TY - JOUR
T1 - Three Gaps in Computational Text Analysis Methods for Social Sciences
T2 - A Research Agenda
AU - Baden, Christian
AU - Pipal, Christian
AU - Schoonvelde, Martijn
AU - van der Velden, Mariken A.C.G.
N1 - Publisher Copyright:
© 2021 The Author(s). Published with license by Taylor & Francis Group, LLC.
PY - 2022
Y1 - 2022
N2 - We identify three gaps that limit the utility and obstruct the progress of computational text analysis methods (CTAM) for social science research. First, we contend that CTAM development has prioritized technological over validity concerns, giving limited attention to the operationalization of social scientific measurements. Second, we identify a mismatch between CTAMs’ focus on extracting specific contents and document-level patterns, and social science researchers’ need for measuring multiple, often complex contents in the text. Third, we argue that the dominance of English language tools depresses comparative research and inclusivity toward scholarly communities examining languages other than English. We substantiate our claims by drawing upon a broad review of methodological work in the computational social sciences, as well as an inventory of leading research publications using quantitative textual analysis. Subsequently, we discuss implications of these three gaps for social scientists’ uneven uptake of CTAM, as well as the field of computational social science text research as a whole. Finally, we propose a research agenda intended to bridge the identified gaps and improve the validity, utility, and inclusiveness of CTAM.
AB - We identify three gaps that limit the utility and obstruct the progress of computational text analysis methods (CTAM) for social science research. First, we contend that CTAM development has prioritized technological over validity concerns, giving limited attention to the operationalization of social scientific measurements. Second, we identify a mismatch between CTAMs’ focus on extracting specific contents and document-level patterns, and social science researchers’ need for measuring multiple, often complex contents in the text. Third, we argue that the dominance of English language tools depresses comparative research and inclusivity toward scholarly communities examining languages other than English. We substantiate our claims by drawing upon a broad review of methodological work in the computational social sciences, as well as an inventory of leading research publications using quantitative textual analysis. Subsequently, we discuss implications of these three gaps for social scientists’ uneven uptake of CTAM, as well as the field of computational social science text research as a whole. Finally, we propose a research agenda intended to bridge the identified gaps and improve the validity, utility, and inclusiveness of CTAM.
UR - http://www.scopus.com/inward/record.url?scp=85121829668&partnerID=8YFLogxK
U2 - 10.1080/19312458.2021.2015574
DO - 10.1080/19312458.2021.2015574
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85121829668
SN - 1931-2458
VL - 16
SP - 1
EP - 18
JO - Communication Methods and Measures
JF - Communication Methods and Measures
IS - 1
ER -