TY - JOUR
T1 - Skewed distributions facilitate infants' word segmentation
AU - Wolters, Lucie
AU - Ota, Mitsuhiko
AU - Arnon, Inbal
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/10
Y1 - 2025/10
N2 - Infants can use statistical patterns to segment continuous speech into words, a crucial task in language acquisition. Experimental studies typically investigate this ability using artificial languages with a uniform frequency distribution, where all words occur equally often. However, words in natural language follow a highly skewed distribution conforming to a Zipfian power law, in which few words occur frequently while many occur infrequently. Prior work shows that such skewed distributions facilitate word segmentation, but the experimental evidence for this has been limited to individuals aged ten years or older, leaving unclear whether this effect arises from accumulated linguistic experience or is already present in the early stages of language learning. To address this, we conducted a word segmentation study with 7- to 9-month-old infants. Infants were exposed to a continuous speech stream containing four artificial words, presented either in a uniform or skewed frequency distribution. We found that infants exposed to the skewed distribution showed a greater looking time difference between familiar and unfamiliar words compared to those in the uniform condition. These findings suggest that skewed distributions facilitate learning during early linguistic development, highlighting the impact of such distributions on language acquisition. Moreover, these findings suggest that the widespread use of uniform distributions in lab-based studies may underestimate infants' segmentation abilities.
AB - Infants can use statistical patterns to segment continuous speech into words, a crucial task in language acquisition. Experimental studies typically investigate this ability using artificial languages with a uniform frequency distribution, where all words occur equally often. However, words in natural language follow a highly skewed distribution conforming to a Zipfian power law, in which few words occur frequently while many occur infrequently. Prior work shows that such skewed distributions facilitate word segmentation, but the experimental evidence for this has been limited to individuals aged ten years or older, leaving unclear whether this effect arises from accumulated linguistic experience or is already present in the early stages of language learning. To address this, we conducted a word segmentation study with 7- to 9-month-old infants. Infants were exposed to a continuous speech stream containing four artificial words, presented either in a uniform or skewed frequency distribution. We found that infants exposed to the skewed distribution showed a greater looking time difference between familiar and unfamiliar words compared to those in the uniform condition. These findings suggest that skewed distributions facilitate learning during early linguistic development, highlighting the impact of such distributions on language acquisition. Moreover, these findings suggest that the widespread use of uniform distributions in lab-based studies may underestimate infants' segmentation abilities.
KW - Infants
KW - Language acquisition
KW - Skewed distribution
KW - Statistical learning
KW - Word segmentation
UR - http://www.scopus.com/inward/record.url?scp=105008089996&partnerID=8YFLogxK
U2 - 10.1016/j.cognition.2025.106221
DO - 10.1016/j.cognition.2025.106221
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:105008089996
SN - 0010-0277
VL - 263
JO - Cognition
JF - Cognition
M1 - 106221
ER -