A COMPARATIVE CLUSTERING MODEL THAT CONSIDERS FALSE POSITIVES AND FALSE NEGATIVES IN SOME SOCIOECONOMIC APPLICATIONS
D.E. Urueta-Hinojosa. Universidad Autónoma Metropolitana-Iztapalapa, Departamento de Ingeniería Eléctrica, Ciudad de México, México. E-mail: deurueta@xanum.uam.mx
P. Lara-Velázquez. Universidad Autónoma Metropolitana-Iztapalapa, Departamento de Ingeniería Eléctrica, Ciudad de México, México. E-mail: plara@xanum.uam.mx
M.A. Gutiérrez-Andrade. Universidad Autónoma Metropolitana-Iztapalapa, Departamento de Ingeniería Eléctrica, Ciudad de México, México. E-mail: gamma@xanum.uam.mx
S.G. De-los-Cobos-Silva. Universidad Autónoma Metropolitana-Iztapalapa, Departamento de Ingeniería Eléctrica, Ciudad de México, México. E-mail: cobos@xanum.uam.mx
E.A. Rincón-García. Universidad Autónoma Metropolitana-Azcapotzalco, Departamento de Sistemas, Ciudad de México, México. E-mail: rigaeral@correo.azc.uam.mx
R.A. Mora-Gutiérrez. Universidad Autónoma Metropolitana-Azcapotzalco, Departamento de Sistemas, Ciudad de México, México. E-mail: mgra@correo.azc.uam.mx
- Fuzzy Economic Review: Volume 25, Number 2, 2020, special issue
- DOI: 10.25102/fer.2020.02.03
Abstract
Unsupervised learning enables classifier models to be built quickly and inexpensively in comparison with supervised approaches because the labeling task is eliminated. On the other hand, to assess the quality of a classifier, the only parameter to consider is usually accuracy, treating incorrect predictions like if they had the same importance when in reality the consequences of diagnosing a healthy patient as sick (Type I Error), or diagnosing a sick patient as healthy (Type II Error) are different. That is why, depending on the application, it is preferable to avoid a specific type of error, even if the accuracy decreases. The present work shows a model based on clustering methods that take into account Type I and II Errors to solve medical and business instances using three techniques: k-means, Spectral and Gauss. Based on representative and well-studied datasets for socioeconomic applications, the results show that the accuracy of a model is not a conclusive parameter and to make a decision it is necessary to focus on errors in the confusion matrix which according to each specific instance, take a different meaning and significance. Our results and analysis are discussed to determine the best model for each case study. Finally, conclusions and limitations are analyzed.
Related items
- STRATEGIC OPTIMIZATION OF ELECTRIC POWER GENERATION IN AN UNCERTAIN ENVIRONMENT
- WATER INFRASTRUCTURE FINANCING IN SPAIN: POTENTIAL FOR SECURITIZATION
- RBF centers initialization using fuzzy clustering technique for function approximation problems
- Credit card application assessment using a neuro-fuzzy classification system