University of Bahrain
Scientific Journals

Classification of Tuberculosis Based on Chest X-Ray Images for Imbalance Data using SMOTE

Show simple item record

dc.contributor.author Fadhlullah Kh.TQ , Muhammad
dc.contributor.author Wahyono
dc.date.accessioned 2024-01-09T12:22:02Z
dc.date.available 2024-01-09T12:22:02Z
dc.date.issued 2024-01-09
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/5324
dc.description.abstract This research delves into the challenge of dataset imbalance in classifying Chest X-Ray (CXR) images in the TBX11K dataset. To address this, the study employs Random Forest (RF) and XGBoost (XGB) methods, both with and without the Synthetic Minority Over-sampling Technique (SMOTE). The primary objective is to evaluate the impact of SMOTE on the performance of these models in classifying CXR images from the TBX11K dataset. This research applies SMOTE to the RF and XGB classification models to increase the number of minority class samples (TB positive) and address the imbalance with the majority class samples (TB negative). To ensure a comprehensive comparison, each model is assessed using a consistent set of evaluation metrics, including accuracy, precision, recall, and F1 score. The findings indicate that applying SMOTE to both RF and XGB models effectively mitigates class imbalance in the dataset. Specifically, the RF model without SMOTE achieves an accuracy of approximately 93.33%, while the RF model with SMOTE achieves an accuracy of 92.72%. On the other hand, the XGB model without SMOTE achieves an accuracy of 94.11%, and the XGB model with SMOTE reaches 94.33%. Although SMOTE enhances overall model performance, challenges persist in accurately predicting the minority classes ’altb’ and ’ltb.’ These challenges are attributed to the less representative features of these minority classes, which are difficult to overcome even with resampling techniques. Based on the experimental results, the XGB model with SMOTE emerges as the most optimal model for classifying TBX11K images. Despite the improved performance, further work is needed to enhance the prediction accuracy for minority classes, suggesting that additional techniques or more sophisticated models might be required to address this issue comprehensively. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Random Forest, XGBoost , machine learning, VGG16 en_US
dc.title Classification of Tuberculosis Based on Chest X-Ray Images for Imbalance Data using SMOTE en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/160171
dc.volume 16 en_US
dc.issue 1 en_US
dc.pagestart 981 en_US
dc.pageend 993 en_US
dc.contributor.authorcountry Indonesia en_US
dc.contributor.authorcountry Indonesia en_US
dc.contributor.authoraffiliation Master Program in Artificial Intelligence, Universitas Gadjah Mada en_US
dc.contributor.authoraffiliation Department of Computer Science and Electronics, Universitas Gadjah Mada en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account