TY - JOUR
T1 - Grounding the Comparative Turn in Communications
T2 - A Framework for Validating Multilingual Computational Text Analysis
AU - Lind, Fabienne
AU - Schoonvelde, Martijn
AU - Baden, Christian
AU - Dolinsky, Alona O.
AU - Pipal, Christian
AU - van der Velden, Mariken A.C.G.
N1 - Publisher Copyright:
© The author(s).
PY - 2025
Y1 - 2025
N2 - Following the progressing internationalisation of social science research and the computational turn in the field, researchers are increasingly adopting computational text analysis (CTA) methods to compare textual data across multiple cases and languages. In these settings, it is not only the mapping between construct and measures that requires validation, but also the equivalence of this mapping across languages and cases. However, although the validation requirements in multilingual analyses exceed those in monolingual studies, current research shows that validation is often insufficiently and inconsistently addressed in comparative multilingual CTA. To support more robust comparative research, this article presents a framework for validating findings obtained from multilingual textual data. The framework outlines validation strategies for four key stages of a typical multilingual CTA workflow: corpus, input data, process, and output. It directly tackles the challenge of approaching equivalence across contexts and languages in these stages and moves beyond the common practice of identifying problems only at the final stage of research.
AB - Following the progressing internationalisation of social science research and the computational turn in the field, researchers are increasingly adopting computational text analysis (CTA) methods to compare textual data across multiple cases and languages. In these settings, it is not only the mapping between construct and measures that requires validation, but also the equivalence of this mapping across languages and cases. However, although the validation requirements in multilingual analyses exceed those in monolingual studies, current research shows that validation is often insufficiently and inconsistently addressed in comparative multilingual CTA. To support more robust comparative research, this article presents a framework for validating findings obtained from multilingual textual data. The framework outlines validation strategies for four key stages of a typical multilingual CTA workflow: corpus, input data, process, and output. It directly tackles the challenge of approaching equivalence across contexts and languages in these stages and moves beyond the common practice of identifying problems only at the final stage of research.
KW - comparative research
KW - computational text analysis
KW - cross-lingual
KW - internationalisation
KW - text as data
KW - validation framework
UR - https://www.scopus.com/pages/publications/105022887903
U2 - 10.5117/CCR2025.1.13.LIND
DO - 10.5117/CCR2025.1.13.LIND
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:105022887903
SN - 2665-9085
VL - 7
JO - Computational Communication Research
JF - Computational Communication Research
IS - 1
ER -