Abstract
Prediction of academic performance of health sciences students prior to being fully engaged in academic studies will identify those students who may need early intervention. Machine learning (ML), a branch of artificial intelligence, can be used to predict the academic performance of such students and the factors that continue to impact their academic performance. Objective: To use a best fit model in ML to predict the academic performance of health science students and rank the most important factors affecting their performance. Method: The academic records of 3468 students were extracted from the student information system (SIS), which included preparatory year great point average (GPA), high school GPA, Achievement Test (AT), General Aptitude Test (GAT), and cumulative GPA upon graduation. Multiple machine learning algorithms were used to develop the best fit model to predict students' performance GPA and identify factors that contributed to GP A. Results: The best performing classifier based on area under the curve (AUC) is random forest (.773) followed by naïve bayes (.758), Support Vector Machine (.686), k-nearest neighbors (.684) and decision tree (.658), the three scoring methods showed preparatory year GPA, gender, and high school GPA were the top variables predicating student cumulative GPAs. Conclusion: Random forest model can assist college administrators and faculty in health colleges to predict which students are more likely to underperform during their undergraduate studies.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2021 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 33-36 |
| Number of pages | 4 |
| ISBN (Electronic) | 9781665428897 |
| DOIs | |
| State | Published - 2021 |
| Event | 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021 - Nanning, China Duration: 10 Dec 2021 → 12 Dec 2021 |
Publication series
| Name | Proceedings - 2021 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021 |
|---|
Conference
| Conference | 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021 |
|---|---|
| Country/Territory | China |
| City | Nanning |
| Period | 10/12/21 → 12/12/21 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 4 Quality Education
Keywords
- algorithms
- Classifiers
- GPA
- Machine learning
- ML
Fingerprint
Dive into the research topics of 'Machine Learning Techniques to Predict Academic Performance of Health Sciences Students'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver