How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Yedetore, Aditya; Linzen, Tal; Frank, Robert; McCoy, R. Thomas

Computer Science > Computation and Language

arXiv:2301.11462 (cs)

[Submitted on 26 Jan 2023 (v1), last revised 6 Jun 2023 (this version, v2)]

Title:How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Authors:Aditya Yedetore, Tal Linzen, Robert Frank, R. Thomas McCoy

View PDF

Abstract:When acquiring syntax, children consistently choose hierarchical rules over competing non-hierarchical possibilities. Is this preference due to a learning bias for hierarchical structure, or due to more general biases that interact with hierarchical cues in children's linguistic input? We explore these possibilities by training LSTMs and Transformers - two types of neural networks without a hierarchical bias - on data similar in quantity and content to children's linguistic input: text from the CHILDES corpus. We then evaluate what these models have learned about English yes/no questions, a phenomenon for which hierarchical structure is crucial. We find that, though they perform well at capturing the surface statistics of child-directed speech (as measured by perplexity), both model types generalize in a way more consistent with an incorrect linear rule than the correct hierarchical rule. These results suggest that human-like generalization from text alone requires stronger biases than the general sequence-processing biases of standard neural network architectures.

Comments:	10 pages plus references and appendices; accepted to ACL
Subjects:	Computation and Language (cs.CL)
ACM classes:	J.4; I.2.7
Cite as:	arXiv:2301.11462 [cs.CL]
	(or arXiv:2301.11462v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2301.11462

Submission history

From: Aditya Yedetore [view email]
[v1] Thu, 26 Jan 2023 23:24:17 UTC (1,614 KB)
[v2] Tue, 6 Jun 2023 13:40:22 UTC (8,456 KB)

Computer Science > Computation and Language

Title:How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators