Machine Learning Techniques to Predict Academic Performance of Health Sciences Students

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Prediction of academic performance of health sciences students prior to being fully engaged in academic studies will identify those students who may need early intervention. Machine learning (ML), a branch of artificial intelligence, can be used to predict the academic performance of such students and the factors that continue to impact their academic performance. Objective: To use a best fit model in ML to predict the academic performance of health science students and rank the most important factors affecting their performance. Method: The academic records of 3468 students were extracted from the student information system (SIS), which included preparatory year great point average (GPA), high school GPA, Achievement Test (AT), General Aptitude Test (GAT), and cumulative GPA upon graduation. Multiple machine learning algorithms were used to develop the best fit model to predict students' performance GPA and identify factors that contributed to GP A. Results: The best performing classifier based on area under the curve (AUC) is random forest (.773) followed by naïve bayes (.758), Support Vector Machine (.686), k-nearest neighbors (.684) and decision tree (.658), the three scoring methods showed preparatory year GPA, gender, and high school GPA were the top variables predicating student cumulative GPAs. Conclusion: Random forest model can assist college administrators and faculty in health colleges to predict which students are more likely to underperform during their undergraduate studies.

Original languageEnglish
Title of host publicationProceedings - 2021 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages33-36
Number of pages4
ISBN (Electronic)9781665428897
DOIs
StatePublished - 2021
Event20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021 - Nanning, China
Duration: 10 Dec 202112 Dec 2021

Publication series

NameProceedings - 2021 20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021

Conference

Conference20th International Symposium on Distributed Computing and Applications for Business Engineering and Science, DCABES 2021
Country/TerritoryChina
CityNanning
Period10/12/2112/12/21

Keywords

  • algorithms
  • Classifiers
  • GPA
  • Machine learning
  • ML

Fingerprint

Dive into the research topics of 'Machine Learning Techniques to Predict Academic Performance of Health Sciences Students'. Together they form a unique fingerprint.

Cite this