Computational methods for the analysis of fiction genres

  • Cranenburgh Andreas van
  • , Allen Laura
  • , Sharoff Serge
  • , Dalen Oskam Karina van

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This chapter presents a multimethod, multidisciplinary analysis of genre in a large dataset of 9,800 English novels, in order to deepen our understanding of aspects of fiction genres and subgenres. We specifically focus on applying well-established, interpretable methods, in order to benefit scholars from a variety of disciplines. Objects of our analysis are written texts and the linguistic features of the texts. We approach the analysis from two directions: data-driven, with topic modeling of content words, and theory-driven, with features Douglas Biber selected for his research on register, for example, in his 1988 book Variation across Speech and Writing, as well as simple readability metrics. We illustrate these methods by applying them to a corpus of fiction (novels). The texts in our corpora are English, but our methods aim to be also applicable to corpora in other languages. The research questions we try to answer with the proposed methods are whether different kinds of novels (“subgenres”) can be distinguished from each other in their use of linguistic features, and what the results of the computational methods can reveal to researchers to assist them in a renewed qualitative analysis of the texts or in phrasing new hypotheses for further research. Code and data are available at https://github.com/andreasvc/fictiongenres/" xmlns:xlink="https://www.w3.org/1999/xlink">https://github.com/andreasvc/fictiongenres/.

Original languageEnglish (US)
Title of host publicationMultidisciplinary Views on Discourse Genre
Subtitle of host publicationa Research Agenda
PublisherTaylor and Francis
Pages135-167
Number of pages33
ISBN (Electronic)9781040106211
ISBN (Print)9781032371610
DOIs
StatePublished - Jan 1 2024

Bibliographical note

Publisher Copyright:
© 2025 selection and editorial matter, Ninke Stukker, John A. Bateman, Danielle McNamara, and Wilbert Spooren; individual chapters, the contributors. All rights reserved.

Fingerprint

Dive into the research topics of 'Computational methods for the analysis of fiction genres'. Together they form a unique fingerprint.

Cite this