A Generative Approach to Credit Prediction with Learnable Prompts for Multi-scale Temporal Representation Learning
Abstract: Recent industrial credit scoring models remain heavily reliant on manually tuned statistical learning methods. While deep learning offers promising solutions, its effectiveness is often limited by the complexity of financial data, particularly in long-horizon scenarios. In this work, we propose FinLangNet, which addresses credit scoring by reframing it as the task of generating multi-scale distributions of a user's future behavior. Within this framework, tabular data is transformed into sequential representations, enabling the generation of user embeddings across multiple temporal scales. Inspired by the recent success of prompt-based training in LLMs, FinLangNet also introduces two types of prompts to model and capture user behavior at both the feature-granularity and user-granularity levels. Experimental results demonstrate that FinLangNet outperforms the online XGBoost benchmark, achieving a 7.2\% improvement in KS metric performance and a 9.9\% reduction in the relative bad debt rate. Furthermore, FinLangNet exhibits superior performance on public UEA archives, underscoring its scalability and adaptability in time series classification tasks.
- SFE: A simple, fast and efficient feature selection algorithm for high-dimensional data. IEEE Transactions on Evolutionary Computation (2023).
- Wide and deep learning for peer-to-peer lending. Expert Systems with Applications 134 (2019), 209–224.
- Credit risk evaluation: a comprehensive study. Multimedia Tools and Applications 82, 12 (2023), 18217–18267.
- Deep neural networks and tabular data: A survey. IEEE Transactions on Neural Networks and Learning Systems (2022).
- Contagious chain risk rating for networked-guarantee loans. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2715–2723.
- On supervised class-imbalanced learning: An updated perspective and some key challenges. IEEE Transactions on Artificial Intelligence 3, 6 (2022), 973–993.
- Investigating the role of Fintech in the banking industry: what do we know? European Journal of Innovation Management 26, 5 (2023), 1365–1393.
- Fang Fang and Yuanyuan Chen. 2019. A new approach for credit scoring by directly maximizing the Kolmogorov–Smirnov statistic. Computational Statistics & Data Analysis 133 (2019), 180–194.
- SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. Journal of artificial intelligence research 61 (2018), 863–905.
- Standardizing fairness-evaluation procedures: interdisciplinary insights on machine learning algorithms in creditworthiness assessments for small personal loans. AI and Ethics (2023), 1–17.
- Revisiting deep learning models for tabular data. Advances in Neural Information Processing Systems 34 (2021), 18932–18943.
- Albert Gu and Tri Dao. 2023. Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752 (2023).
- DeepFM: a factorization-machine based neural network for CTR prediction. arXiv preprint arXiv:1703.04247 (2017).
- Improving credit risk prediction in online peer-to-peer (P2P) lending using feature selection with deep learning. In 2019 21st International Conference on Advanced Communication Technology (ICACT). IEEE, 511–515.
- A novel ensemble method for credit scoring: Adaption of different imbalance ratios. Expert Systems with Applications 98 (2018), 105–117.
- A filter feature selection for high-dimensional data. Journal of Algorithms & Computational Technology 17 (2023), 17483026231184171.
- Wonkeun Jo and Dongil Kim. 2022. OBGAN: Minority oversampling near borderline with generative adversarial networks. Expert Systems with Applications 197 (2022), 116694.
- Aleum Kim and Sung-Bae Cho. 2019. An ensemble semi-supervised learning method for predicting defaults in social lending. Engineering applications of Artificial intelligence 81 (2019), 193–199.
- Predicting mortgage default using convolutional neural networks. Expert Systems with Applications 102 (2018), 207–217.
- Heterogeneous ensemble learning with feature engineering for default prediction in peer-to-peer lending in China. World Wide Web 23 (2020), 23–45.
- Credit risk prediction based on loan profit: Evidence from Chinese SMEs. Research in International Business and Finance 67 (2024), 102155.
- DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data. arXiv preprint arXiv:2308.03704 (2023).
- itransformer: Inverted transformers are effective for time series forecasting. arXiv preprint arXiv:2310.06625 (2023).
- Study on a prediction of P2P network loan default based on the machine learning LightGBM and XGboost algorithms according to different high dimensional data cleaning. Electronic Commerce Research and Applications 31 (2018), 24–39.
- Credit risk classification: an integrated predictive accuracy algorithm using artificial and deep neural networks. Annals of Operations Research 330, 1 (2023), 609–637.
- M Senthil Murugan et al. 2023. Large-scale data-driven financial risk management & analysis using machine learning strategies. Measurement: Sensors 27 (2023), 100756.
- Forecasting groundwater quality using automatic exponential smoothing model (AESM) in Xianyang City, China. Human and Ecological Risk Assessment: An International Journal 29, 2 (2023), 347–368.
- Sequential multi-dimensional self-supervised learning for clinical time series. In International Conference on Machine Learning. PMLR, 28531–28548.
- The role of fintech on bank risk-taking: Mediating role of bank’s operating efficiency. Human Behavior and Emerging Technologies 2023 (2023).
- Using sequences of life-events to predict human lives. Nature Computational Science (2023), 1–14.
- Loan default prediction using a credit rating-specific and multi-objective ensemble learning scheme. Information Sciences 629 (2023), 599–617.
- A deep learning approach to competing risks representation in peer-to-peer lending. IEEE transactions on neural networks and learning systems 30, 5 (2018), 1565–1574.
- Credit card fraud detection-machine learning methods. In 2019 18th International Symposium INFOTEH-JAHORINA (INFOTEH). IEEE, 1–5.
- Attention is all you need. Advances in neural information processing systems 30 (2017).
- Temporal-aware graph neural network for credit risk prediction. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM). SIAM, 702–710.
- Fintech inputs, non-performing loans risk reduction and bank performance improvement. International Review of Financial Analysis 90 (2023), 102849.
- Forecasting SMEs’ credit risk in supply chain finance with a sampling strategy based on machine learning techniques. Annals of Operations Research 331, 1 (2023), 1–33.
- A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring. Expert systems with applications 78 (2017), 225–241.
- Loan default prediction of Chinese P2P market: a machine learning methodology. Scientific Reports 11, 1 (2021), 18759.
- Predict and Optimize Financial Services Risk Using AI-driven Technology. Academic Journal of Science and Technology 10, 1 (2024), 299–304.
- A novel method for credit scoring based on cost-sensitive neural network ensemble. IEEE Access 9 (2021), 78521–78537.
- A review of recurrent neural networks: LSTM cells and network architectures. Neural computation 31, 7 (2019), 1235–1270.
- An attention-based Logistic-CNN-BiLSTM hybrid neural network for credit risk prediction of listed real estate enterprises. Expert Systems 41, 2 (2024), e13299.
- Fintech, macroprudential policies and bank risk: Evidence from China. International Review of Financial Analysis 87 (2023), 102648.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.