TY - JOUR
T1 - A decision support system for cost-effective diagnosis
AU - Chi, Chih Lin
AU - Street, W. Nick
AU - Katz, David A.
PY - 2010/11/1
Y1 - 2010/11/1
N2 - Objective: Speed, cost, and accuracy are three important goals in disease diagnosis. This paper proposes a machine learning-based expert system algorithm to optimize these goals and assist diagnostic decisions in a sequential decision-making setting. Methods: The algorithm consists of three components that work together to identify the sequence of diagnostic tests that attains the treatment or no test threshold probability for a query case with adequate certainty: lazy-learning classifiers, confident diagnosis, and locally sequential feature selection (LSFS). Speed-based and cost-based objective functions can be used as criteria to select tests. Results: Results of four different datasets are consistent. All LSFS functions significantly reduce tests and costs. Average cost savings for heart disease, thyroid disease, diabetes, and hepatitis datasets are 50%, 57%, 22%, and 34%, respectively. Average test savings are 55%, 73%, 24%, and 39%, respectively. Accuracies are similar to or better than the baseline (the classifier that uses all available tests in the dataset). Conclusion: We have demonstrated a new approach that dynamically estimates and determines the optimal sequence of tests that provides the most information (or disease probability) based on a patient's available information.
AB - Objective: Speed, cost, and accuracy are three important goals in disease diagnosis. This paper proposes a machine learning-based expert system algorithm to optimize these goals and assist diagnostic decisions in a sequential decision-making setting. Methods: The algorithm consists of three components that work together to identify the sequence of diagnostic tests that attains the treatment or no test threshold probability for a query case with adequate certainty: lazy-learning classifiers, confident diagnosis, and locally sequential feature selection (LSFS). Speed-based and cost-based objective functions can be used as criteria to select tests. Results: Results of four different datasets are consistent. All LSFS functions significantly reduce tests and costs. Average cost savings for heart disease, thyroid disease, diabetes, and hepatitis datasets are 50%, 57%, 22%, and 34%, respectively. Average test savings are 55%, 73%, 24%, and 39%, respectively. Accuracies are similar to or better than the baseline (the classifier that uses all available tests in the dataset). Conclusion: We have demonstrated a new approach that dynamically estimates and determines the optimal sequence of tests that provides the most information (or disease probability) based on a patient's available information.
KW - Cost-effective diagnosis
KW - Decision support systems
KW - Feature selection
KW - Machine learning
KW - Optimization
KW - Utility-based data mining
UR - http://www.scopus.com/inward/record.url?scp=78149465325&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78149465325&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2010.08.001
DO - 10.1016/j.artmed.2010.08.001
M3 - Article
C2 - 20933375
AN - SCOPUS:78149465325
SN - 0933-3657
VL - 50
SP - 149
EP - 161
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
IS - 3
ER -