Claim Detection and Relationship with Writing Quality

Qian Wan, Scott Crossley, Laura Allen, Danielle McNamara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we extracted content-based and structure-based features of text to predict human annotations for claims and non-claims in argumentative essays. We compared Logistic Regression, Bernoulli Naive Bayes, Gaussian Naive Bayes, Linear Support Vector Classification, Random Forest, and Neural Networks to train classification models. Random Forest and Neural Network classifiers yielded the most balanced identifications of claims and non-claims based on the evaluation of accuracy, precision, and recall. The Random Forest model was then used to calculate the number, percentage, and positionality of claims and non-claims in a validation corpus that included human ratings of writing quality. Correlational and regression analyses indicated that the number of claims and the average position of non-claims in text were significant indicators of essay quality in the expected direction.

Original languageEnglish (US)
Title of host publicationProceedings of the 13th International Conference on Educational Data Mining, EDM 2020
EditorsAnna N. Rafferty, Jacob Whitehill, Cristobal Romero, Violetta Cavalli-Sforza
PublisherInternational Educational Data Mining Society
Pages691-695
Number of pages5
ISBN (Electronic)9781733673617
StatePublished - 2020
Externally publishedYes
Event13th International Conference on Educational Data Mining, EDM 2020 - Virtual, Online
Duration: Jul 10 2020Jul 13 2020

Publication series

NameProceedings of the 13th International Conference on Educational Data Mining, EDM 2020

Conference

Conference13th International Conference on Educational Data Mining, EDM 2020
CityVirtual, Online
Period7/10/207/13/20

Bibliographical note

Publisher Copyright:
© 2020 Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020. All rights reserved.

Keywords

  • argument mining
  • automated essay evaluation
  • claim detection
  • essay quality
  • natural language processing

Fingerprint

Dive into the research topics of 'Claim Detection and Relationship with Writing Quality'. Together they form a unique fingerprint.

Cite this