The tower of Babel meets web 2.0: User-generated content and its applications in a multilingual context

Brent Hecht, Darren Gergle

Research output: Chapter in Book/Report/Conference proceedingConference contribution

105 Scopus citations

Abstract

This study explores language's fragmenting effect on user-generated content by examining the diversity of knowledge representations across 25 different Wikipedia language editions. This diversity is measured at two levels: the concepts that are included in each edition and the ways in which these concepts are described. We demonstrate that the diversity present is greater than has been presumed in the literature and has a significant influence on applications that use Wikipedia as a source of world knowledge. We close by explicating how knowledge diversity can be beneficially leveraged to create "culturally- aware applications" and "hyperlingual applications".

Original languageEnglish (US)
Title of host publicationCHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings
Pages291-300
Number of pages10
DOIs
StatePublished - Jul 1 2010
Event28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010 - Atlanta, GA, United States
Duration: Apr 10 2010Apr 15 2010

Publication series

NameConference on Human Factors in Computing Systems - Proceedings
Volume1

Other

Other28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010
CountryUnited States
CityAtlanta, GA
Period4/10/104/15/10

Keywords

  • explicit semantic analysis
  • hyperlingual
  • knowledge diversity
  • language
  • multilingual
  • semantic relatedness
  • wikipedia

Fingerprint Dive into the research topics of 'The tower of Babel meets web 2.0: User-generated content and its applications in a multilingual context'. Together they form a unique fingerprint.

Cite this