PREDIKSI KELAYAKAN NASABAH KREDIT MENGGUNAKAN TEKNIK PENGGABUNGAN KLASTERISASI EXPECTATION-MAXIMIZATION DAN KLASIFIKASI NAÏVE BAYES

  • Raditia Vindua Universitas Pamulang
Keywords: Data Mining, EM Clustering, Naïve Bayes Classification, Credit Clients

Abstract

Customer Data Credit has not been utilized to recognize patterns that can be used to predict New Credit Clients. However, Credit Client data does not have a class label for the classification. Therefore, the Credit Client data is processed first using EM clusteration (expectation-maximization) method to classify the class so that it can be classified using the Naïve Bayes classification. The purpose of this research is to create a data mining model for the prediction of new Credit Customers Feasibility with combined method of EM Classification and Naïve Bayes Classification. From 540 Credit Customers data, there are 142 data of Credit Clients that become cluster 0 or Bad Credit Customer and there are 398 Data of Credit Clients that become cluster 1 or good Credit Customers. The accuracy level of Naïve Bayes classification after using EM clusterization has an average accuracy of 99.2437% and an average error rate of 0.7563%. Thus the combined method of EM clustering and Naïve Bayes classification using WEKA can be concluded to be used to predict the feasibility of New Credit Clients.

Downloads

Download data is not yet available.

References

Berry, M. J. A., & Linoff, G. S. (2004). Data mining techniques: For marketing, sales, and customer support. New York: John Wiley and Sons, Inc.
D.J. Hand. (2005). Good practice in retail credit scorecard assessment, J. Oper. Res. Soc, 56 (9), 1109–1117 .
Dm Crews. (2004). Modul dan Jurnal Praktek Data Mining T.A 2004/2005. STT Telkom: Bandung
Frawley, W. J., Piatetsky-Shapiro, G., & Matheus, C. J. (1991). Knowledge discovery in databases: an overview. In Knowledge Discovery in Databases (pp. 213–228). Cambridge, MA: AAAI/MIT.
K.M. Jung , L.C. Thomas , M.C. So. (2015). When to rebuild or when to adjust scorecards, J. Oper. Res. Soc, 66 (10), 1656–1668
L.C. Thomas. (2009). Consumer Credit Models: Pricing, Profit and Portfolios, Oxford University Press, Oxford, UK.
L. Einav , M. Jenkins , J. Levin. (2013). The impact of credit scoring on consumer lending, RAND J. Econ, 44 (2), 249–274 .
S.C. Chen, M.Y. Huang. (2011). Constructing credit auditing and control & management model with data mining technique. Expert Systems with Applications, 38, 5359–5365
T. S. Lee, C. C. Chiu, C. J. Lu and I. F. Chen. (2002). Credit scoring using the hybrid neural discriminant technique. Expert Systems with Applications, 23, 245-254
T. S. Lee and I. F. Chen. (2005). A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines. Expert Systems with Applications, 28, 743-752
Vercellis, C. (2009). Business Intelligence; Data Mining and Optimization for Decision Making. John Wiley & Sons, Ltd., UK
Weimin Chen et al. (2012). Credit risk Evaluation by hybrid data mining technique. Systems Engineering Procedia, 3, 197 – 200
Xiong, T., Wang, S., Mayers, A., & Monga, E. (2013). Personal bankruptcy prediction by mining credit card data. Expert Systems with Applications, 40, 665–676
Z. Huang, H. Chen, C. J. Hsu and W. H. Chen. (2004). Credit rating analysis with support vector machines and neural networks: a market comparative study. Decision Support Systems, 37, 543-558.
Acock, C. A., (2005). Working With Missing Values. Journal Marriage Fam, (67), pp. 1012–28.
Osama Abu Abbas., (2008). Comparisons Between Data Clustering Algorithms. International. Arab Journal Information And Technology, (5), pp.320–5.
Published
2020-08-17
How to Cite
Vindua, R. (2020). PREDIKSI KELAYAKAN NASABAH KREDIT MENGGUNAKAN TEKNIK PENGGABUNGAN KLASTERISASI EXPECTATION-MAXIMIZATION DAN KLASIFIKASI NAÏVE BAYES. KOLANO: Journal of Multi-Disciplinary Sciences, 1(1), 1-8. Retrieved from http://e-journal.univ-nuku.ac.id/index.php/kolano/article/view/20