TY - JOUR
T1 - You are what you click
T2 - using machine learning to model trace data for psychometric measurement
AU - Landers, Richard N.
AU - Auer, Elena M
AU - Mersy, Gabriel
AU - Marin, Sebastian
AU - Blaik, Jason
N1 - Publisher Copyright:
© Copyright © 2022 The Author(s). Published with license by Taylor & Francis Group, LLC.
PY - 2022
Y1 - 2022
N2 - Assessment trace data, such as mouse positions and their timing, offer interesting and provocative reflections of individual differences yet are currently underutilized by testing professionals. In this article, we present a 10-step procedure to maximize the probability that a trace data modeling project will be successful: 1) grounding the project in psychometric theory, 2) building technical infrastructure to collect trace data, 3) designing a useful developmental validation study, 4) using a holdout validation approach with collected data, 5) using exploratory analysis to conduct meaningful feature engineering, 6) identifying useful machine learning algorithms to predict a thoughtfully chosen criterion, 7) engineering a machine learning model with meaningful internal cross-validation and hyperparameter selection, 8) conducting model diagnostics to assess if the resulting model is overfitted, underfitted, or within acceptable tolerance, and 9) testing the success of the final model in meeting conceptual, technical, and psychometric goals. If deemed successful, trace data model predictions could then be engineered into decision-making systems. We present this framework within the broader view of psychometrics, exploring the challenges of developing psychometrically valid models using such complex data with much weaker trait signals than assessment developers have typically attempted to model.
AB - Assessment trace data, such as mouse positions and their timing, offer interesting and provocative reflections of individual differences yet are currently underutilized by testing professionals. In this article, we present a 10-step procedure to maximize the probability that a trace data modeling project will be successful: 1) grounding the project in psychometric theory, 2) building technical infrastructure to collect trace data, 3) designing a useful developmental validation study, 4) using a holdout validation approach with collected data, 5) using exploratory analysis to conduct meaningful feature engineering, 6) identifying useful machine learning algorithms to predict a thoughtfully chosen criterion, 7) engineering a machine learning model with meaningful internal cross-validation and hyperparameter selection, 8) conducting model diagnostics to assess if the resulting model is overfitted, underfitted, or within acceptable tolerance, and 9) testing the success of the final model in meeting conceptual, technical, and psychometric goals. If deemed successful, trace data model predictions could then be engineered into decision-making systems. We present this framework within the broader view of psychometrics, exploring the challenges of developing psychometrically valid models using such complex data with much weaker trait signals than assessment developers have typically attempted to model.
KW - Machine learning
KW - data science
KW - mousetrap
KW - psychometric
KW - trace data
UR - http://www.scopus.com/inward/record.url?scp=85142228554&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142228554&partnerID=8YFLogxK
U2 - 10.1080/15305058.2022.2134394
DO - 10.1080/15305058.2022.2134394
M3 - Article
AN - SCOPUS:85142228554
SN - 1530-5058
VL - 22
SP - 243
EP - 263
JO - International Journal of Testing
JF - International Journal of Testing
IS - 3-4
ER -