Time evolution of writing styles in romanian language

Daniela Gifu, Mihai Dascalu, Stefan Trausan-Matu, Laura K. Allen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

This paper presents a diachronic analysis centered on the exploration of differences between the writing styles of journalistic texts in Romanian language. This analysis is focused on the time evolution of this language across two adjacent regions, Bessarabia and Romania in two major periods that were marked by important historical differences. Our aim is to examine these language differences based on corpora of historical and contemporary texts. To this end, we employ the ReaderBench framework to calculate a number of textual complexity indices that can be reliably used to characterize writing style. These analyses are conducted on two independent corpora for each of the two language styles, covering the following time periods: 1941-1991, when Bessarabia was separated from Romania and became a state in the Soviet Union (and there were few connections and language influences with Romania), and after July 1991, when Bessarabia became an independent state, Republic of Moldavia (and many language interactions with Romania occurred). The results of our analyses highlight the lexical and cohesive textual complexity indices that best reflect the differences in writing style, ranging from sentence and paragraph structure to word entropy and cohesion, measured in terms of Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA).

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016
EditorsAnna Esposito, Miltos Alamaniotis, Amol Mali, Nikolaos Bourbakis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1048-1054
Number of pages7
ISBN (Electronic)9781509044597
DOIs
StatePublished - Jan 11 2017
Externally publishedYes
Event28th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2016 - San Jose, United States
Duration: Nov 6 2016Nov 8 2016

Publication series

NameProceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016

Conference

Conference28th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2016
Country/TerritoryUnited States
CitySan Jose
Period11/6/1611/8/16

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Keywords

  • Comparable corpora
  • Language similarity
  • Textual complexity
  • Time periods and geographic regions
  • Writing style

Fingerprint

Dive into the research topics of 'Time evolution of writing styles in romanian language'. Together they form a unique fingerprint.

Cite this