View : 1366 Download: 295

The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction

Title
The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction
Authors
Lee, SunbokChung, Jae Young
Ewha Authors
정제영이선복
SCOPUS Author ID
정제영scopusscopus; 이선복scopus
Issue Date
2019
Journal Title
APPLIED SCIENCES-BASEL
ISSN
2076-3417JCR Link
Citation
APPLIED SCIENCES-BASEL vol. 9, no. 15
Keywords
dropoutmachine learningbig dataclass-imbalanceoversamplingensemble
Publisher
MDPI
Indexed
SCIE; SCOPUS WOS
Document Type
Article
Abstract
A dropout early warning system enables schools to preemptively identify students who are at risk of dropping out of school, to promptly react to them, and eventually to help potential dropout students to continue their learning for a better future. However, the inherent class imbalance between dropout and non-dropout students could pose difficulty in building accurate predictive modeling for a dropout early warning system. The present study aimed to improve the performance of a dropout early warning system: (a) by addressing the class imbalance issue using the synthetic minority oversampling techniques (SMOTE) and the ensemble methods in machine learning; and (b) by evaluating the trained classifiers with both receiver operating characteristic (ROC) and precision-recall (PR) curves. To that end, we trained random forest, boosted decision tree, random forest with SMOTE, and boosted decision tree with SMOTE using the big data samples of the 165,715 high school students from the National Education Information System (NEIS) in South Korea. According to our ROC and PR curve analysis, boosted decision tree showed the optimal performance.
DOI
10.3390/app9153093
Appears in Collections:
사범대학 > 교육학과 > Journal papers
Files in This Item:
The Machine Learning.pdf(409.45 kB) Download
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE