Repository logo
  • English
  • Deutsch
  • Español
  • Français
  • Log In
    New user? Click here to register.Have you forgotten your password?

  • English
  • Deutsch
  • Español
  • Français
  • Log In
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • Research Outputs
  • Fundings & Projects
  • Researchers
  • Statistics
  1. Home
  2. Current Research Information System UV
  3. Publicaciones
  4. Applying Machine Learning Sampling Techniques to Address Data Imbalance in a Chilean COVID-19 Symptoms and Comorbidities Dataset
 
  • Details
Options

Applying Machine Learning Sampling Techniques to Address Data Imbalance in a Chilean COVID-19 Symptoms and Comorbidities Dataset

Journal
Applied Sciences
Date Issued
2025-01-23
Author(s)
Pablo Ormeño-Arriagada
Gastón Márquez
David Araya
Rimassa, Carla  
Facultad de Medicina  
Carla Taramasco
DOI
10.3390/app15031132
Abstract
<jats:p>Reliably detecting COVID-19 is critical for diagnosis and disease control. However, imbalanced data in medical datasets pose significant challenges for machine learning models, leading to bias and poor generalization. The dataset obtained from the EPIVIGILA system and the Chilean Epidemiological Surveillance Process contains information on over 6,000,000 patients, but, like many current datasets, it suffers from class imbalance. To address this issue, we applied various machine learning algorithms, both with and without sampling methods, and compared them using different classification and diagnostic metrics such as precision, sensitivity, specificity, likelihood ratio positive, and diagnostic odds ratio. Our results showed that applying sampling methods to this dataset improved the metric values and contributed to models with better generalization. Effectively managing imbalanced data is crucial for reliable COVID-19 diagnosis. This study enhances the understanding of how machine learning techniques can improve diagnostic reliability and contribute to better patient outcomes.</jats:p>

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback

Hosting & Support by

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science