Abstract
This paper presents methods of banks discrimination according to the rate of NonPerforming Loans (NPLs), using Gaussian Bayes models and different approaches of multiclass Support Vector Machines (SVM). This classification problem involves many irrelevant variables and comparatively few training instances. New variable selection strategies are proposed. They are based on Gaussian marginal densities for Bayesian models and ranking scores derived from multiclass SVM. The results on both toy data and real-life problem of banks classification demonstrate a significant improvement of prediction performance using only a few variables. Moreover, Support Vector Machines approaches are shown to be superior to Gaussian Bayes models.
Original language | English (US) |
---|---|
Pages (from-to) | 3087-3099 |
Number of pages | 13 |
Journal | Expert Systems with Applications |
Volume | 39 |
Issue number | 3 |
DOIs | |
State | Published - Feb 15 2012 |
Externally published | Yes |
Keywords
- Gaussian Bayes classifier
- Multiclass SVM
- Multiclass bank's risk
- Risk factors
- Stepwise algorithm
- Variable selection
ASJC Scopus subject areas
- General Engineering
- Computer Science Applications
- Artificial Intelligence