Grounding the Comparative Turn in Communications: A Framework for Validating Multilingual Computational Text Analysis

  • Fabienne Lind
  • , Martijn Schoonvelde
  • , Christian Baden
  • , Alona O. Dolinsky
  • , Christian Pipal
  • , Mariken A.C.G. van der Velden

Research output: Contribution to journalArticlepeer-review

Abstract

Following the progressing internationalisation of social science research and the computational turn in the field, researchers are increasingly adopting computational text analysis (CTA) methods to compare textual data across multiple cases and languages. In these settings, it is not only the mapping between construct and measures that requires validation, but also the equivalence of this mapping across languages and cases. However, although the validation requirements in multilingual analyses exceed those in monolingual studies, current research shows that validation is often insufficiently and inconsistently addressed in comparative multilingual CTA. To support more robust comparative research, this article presents a framework for validating findings obtained from multilingual textual data. The framework outlines validation strategies for four key stages of a typical multilingual CTA workflow: corpus, input data, process, and output. It directly tackles the challenge of approaching equivalence across contexts and languages in these stages and moves beyond the common practice of identifying problems only at the final stage of research.

Original languageEnglish
JournalComputational Communication Research
Volume7
Issue number1
DOIs
StatePublished - 2025

Bibliographical note

Publisher Copyright:
© The author(s).

Keywords

  • comparative research
  • computational text analysis
  • cross-lingual
  • internationalisation
  • text as data
  • validation framework

Fingerprint

Dive into the research topics of 'Grounding the Comparative Turn in Communications: A Framework for Validating Multilingual Computational Text Analysis'. Together they form a unique fingerprint.

Cite this