Comparison of Machine Learning Performance Using Analytic and Holistic Coding Approaches across Constructed Response Assessments Aligned to a Science Learning Progression

التفاصيل البيبلوغرافية
العنوان:	Comparison of Machine Learning Performance Using Analytic and Holistic Coding Approaches across Constructed Response Assessments Aligned to a Science Learning Progression
اللغة:	English
المؤلفون:	Jescovitch, Lauren N. (ORCID 0000-0002-8786-1297), Scott, Emily E. (ORCID 0000-0002-4545-9910), Cerchiara, Jack A. (ORCID 0000-0002-3606-7966), Merrill, John, Urban-Lurain, Mark (ORCID 0000-0002-2243-8252), Doherty, Jennifer H. (ORCID 0000-0002-2333-1692), Haudek, Kevin C. (ORCID 0000-0003-1422-6038)
المصدر:	Journal of Science Education and Technology. Apr 2021 30(2):150-167.
الاتاحة:	Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link.springer.com/
Peer Reviewed:	Y
Page Count:	18
تاريخ النشر:	2021
Sponsoring Agency:	National Science Foundation (NSF), Division of Undergraduate Education (DUE)
Contract Number:	1660643 1661263
نوع الوثيقة:	Journal Articles Reports - Research
Education Level:	Higher Education Postsecondary Education
Descriptors:	Science Instruction, Coding, Artificial Intelligence, Man Machine Systems, Holistic Approach, Logical Thinking, Physiology, Learning Processes, Responses, Classification, Undergraduate Students, Scoring Rubrics, Scoring, Models
DOI:	10.1007/s10956-020-09858-0
تدمد:	1059-0145
مستخلص:	We systematically compared two coding approaches to generate training datasets for machine learning (ML): (1) a holistic approach based on learning progression levels; and (2) a dichotomous, analytic approach of multiple concepts in student reasoning, deconstructed from holistic rubrics. We evaluated four constructed response assessment items for undergraduate physiology, each targeting five levels of a developing flux learning progression in an ion context. Human-coded datasets were used to train two ML models: (1) an 8-classification algorithm ensemble implemented in the Constructed Response Classifier (CRC); and (2) a single classification algorithm implemented in LightSide Researcher's Workbench. Human coding agreement on approximately 700 student responses per item was high for both approaches with Cohen's kappas ranging from 0.75 to 0.87 on holistic scoring and from 0.78 to 0.89 on analytic composite scoring. ML model performance varied across items and rubric type. For two items, training sets from both coding approaches produced similarly accurate ML models, with differences in Cohen's kappa between machine and human scores of 0.002 and 0.041. For the other items, ML models trained with analytic coded responses and used for a composite score, achieved better performance as compared to using holistic scores for training, with increases in Cohen's kappa of 0.043 and 0.117. These items used a more complex scenario involving movement of two ions. It may be that analytic coding is beneficial to unpacking this additional complexity.
Abstractor:	As Provided
Entry Date:	2021
رقم الانضمام:	EJ1292769
قاعدة البيانات:	ERIC

الوصف
تدمد:	1059-0145
DOI:	10.1007/s10956-020-09858-0