Text categorization models for identifying unproven cancer treatments on the web.

Yin Aphinyanaphongs, Constantin Aliferis

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

The nature of the internet as a non-peer-reviewed (and largely unregulated) publication medium has allowed wide-spread promotion of inaccurate and unproven medical claims in unprecedented scale. Patients with conditions that are not currently fully treatable are particularly susceptible to unproven and dangerous promises about miracle treatments. In extreme cases, fatal adverse outcomes have been documented. Most commonly, the cost is financial, psychological, and delayed application of imperfect but proven scientific modalities. To help protect patients, who may be desperately ill and thus prone to exploitation, we explored the use of machine learning techniques to identify web pages that make unproven claims. This feasibility study shows that the resulting models can identify web pages that make unproven claims in a fully automatic manner, and substantially better than previous web tools and state-of-the-art search engine technology.

Original languageEnglish (US)
Pages (from-to)968-972
Number of pages5
JournalMedinfo. MEDINFO
Volume12
Issue numberPt 2
StatePublished - Nov 8 2007

Fingerprint

Dive into the research topics of 'Text categorization models for identifying unproven cancer treatments on the web.'. Together they form a unique fingerprint.

Cite this