Список литературы

2313-8912

Research Result. Theoretical and Applied Linguistics

2313-8912

10.18413/2313-8912-2023-9-1-1-2

3063

NEURAL NETWORKS IN NATURAL LANGUAGE PROCESSING

<strong>Parametrizing number variation in Russian noun phrases with experimental studies and language modeling</strong>

Studenikina

Kseniia A.

Studenikina

Kseniia A.

xeanst@gmail.com

Lomonosov Moscow State University, Russia

2023

9100

The study deals with the problem of agreement with coordinated constructions and number form alternation. The target, agreeing with two conjoined singular nouns, copies either plural or singular number feature. This paper focuses on the syntax of Russian noun phrases with coordinated modifiers which demonstrate the number variation of the agreement controller. If two conjoined singular adjectives have split interpretations, both singular and plural nouns are acceptable. In contrast to previous studies relying on introspection and corpus, we parametrize the number variation based on the results of the self-paced acceptability experiments (Likert scale 1-7). We compare the data about human perception with the language probabilities predicted by a neural model for text generation ruGPT-3. Two case studies were conducted to analyze morphological and syntactic factors parametrizing variation. The first study considers the impact of noun morphology on number alternation. The second study examines the effect of the premodifier attributive agreement. Both offline acceptability scores and online reading time demonstrate that the observed morphological and syntactic factors should be considered while parametrizing number variation in Russian noun phrases with coordinated modifiers. The ruGPT-3 language model, trained on a vast collection of Russian texts, manages to predict the correct probability for highly acceptable and highly unacceptable sentences, but it fails to assign accurate probability values to the cases of variation.

CoordinationAgreement variationRussianExperimental syntaxLanguage modelingAcceptabilityPerplexity

This work was supported by Non-commercial Foundation for Support of Science and Education «INTELLECT».

Список литературы

Aaronson, D. and Scarborough, H. S. (1976). Performance theories for sentence coding: Some quantitative evidence, Journal of Experimental Psychology: Human perception and performance, 2 (1), 56–70. https://doi.org/10.1037/0096-1523.2.1.56 (In English)

Barros, M. and Vicente, L. (2011). Right node raising requires both ellipsis and multidomination, University of Pennsylvania working papers in linguistics, 17 (1), 1–9. https://doi.org/10.22099/jill.2022.41795.1275 (In English)

Belova, D., Demina, J., Gerasimova, A., Lyutikova, E., Morgunova, E., Petelin, D., Studenikina, K. and Voznesenskaya, A. (2021). Russkije ostrova v svete eksperimental’nyh dannyh [Russian islands in the light of experimental data], E. Lyutikova, A. Gerasimova (eds.), Buki Vedi, Moscow, Russia. (In Russian)

Brunato, D., Chesi, C., Dell’Orletta, F., Montemagni, S., Venturi, G. and Zamparelli, R. (2020). AcCompl-it @ EVALITA2020: Overview of the Acceptability & Complexity Evaluation Task for Italian, International Workshop on Evaluation of Natural Language and Speech Tools for Italian, 1–8. (In English)

Corbett, G. G. (1979). The agreement hierarchy, Journal of linguistics, 15 (2), 203-224. https://doi.org/10.1017/S0022226700016352 (In English)

Dou, Y., Forbes, M., Koncel-Kedziorski, R., Smith, N. A. and Choi, Y. (2022). Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text, Proceedings of the 60th Annual Meeting of the Association for Computational, Association for Computational Linguistics, Dublin, Ireland, 7250–7274. http://dx.doi.org/10.18653/v1/2022.acl-long.501 (In English)

Gerasimova, A. (2018). Rassoglasovanie po rodu v russkom yazyke (eksperimental'noe issledovanie) [Mixed agreement patterns in Russian (experimental study)], Izvestiia Rossiiskoi akademii nauk. Seriia literatury i iazyka, 77 (1), 65–71. (In Russian)

Gold, J. W. (2021). Locus of Gender Resolution: on Goal or on Probe? Workshop on Agreement in Multivaluation Construction (AMC 2021). (In English)

Gries, S. T. (2021). Statistics for Linguistics with R: A Practical Introduction, De Gruyter Mouton, Berlin/Boston, Germany/USA. https://doi.org/10.1515/9783110718256 (In English)

Grosz, P. G. (2015). Movement and agreement in right-node-raising constructions, Syntax, 18 (1), 1–38. https://doi.org/10.1111/synt.12024 (In English)

Harizanov, B. and Gribanova, V. (2015). How across-the-board movement interacts with nominal concord in Bulgarian, Proceedings from the Annual Meeting of the Chicago Linguistics Society 49, University of Chicago, IL, Chicago Linguistics Society, 115–129. (In English)

Hartmann, K. (2000). Right node raising and gapping: Interface conditions on prosodic deletion, John Benjamins, Amsterdam, Netherlands. https://doi.org/10.1075/z.106 (In English)

Kodzasov, S. (1987). Chislo v sochinitel'nyx konstrukcijax [Number in coordinated structures], in Kodzasov, S., Laufer, N. and Savina, E. (eds.), Modelirovanije jazykovoj dejatel’nosti v intellektual’nyx sistemax, Moscow, Russia, 204–219. (In Russian)

Lau, J. H., Armendariz, C., Lappin, S., Purver, M. and Shu, C. (2020). How furiously can colorless green ideas sleep? Sentence acceptability in context, Transactions of the Association for Computational Linguistics, 8, 296–310. https://doi.org/10.1162/tacl_a_00315 (In English)

Lau, J. H., Clark, A. and Lappin, S. (2017). Grammaticality, acceptability, and probability: A probabilistic view of linguistic knowledge, Cognitive science, 41 (5), 1202–1241. https://doi.org/10.1111/cogs.12414 (In English)

Likert, R. (1932). A technique for the measurement of attitudes, Archives of Psychology, 140, 1–55. (In English)

Schütze, C. and Sprouse, J. (2014). Judgment data, in Sharma, D. and Podesva, R. (eds.), Research methods in linguistics, Cambridge: Cambridge University Press, 27–50. (In English)

Shen, Z. (2018). Feature arithmetic in the nominal domain, Ph.D. Thesis, University of Connecticut, Storrs, CT. (In English)

Sprouse, J., Schütze, C. T. and Almeida, D. (2013). A comparison of informal and formal acceptability judgments using a random sample from Linguistic Inquiry 2001–2010, Lingua, 134, 219–248. https://doi.org/10.1016/j.lingua.2013.07.002 (In English)

Sprouse, J., Yankama, B., Indurkhya S., Fong, S. and Berwick, R. C. (2018). Colorless green ideas do sleep furiously: gradient acceptability and the nature of the grammar, The Linguistic Review, 35 (3), 575–599. https://doi.org/10.1515/tlr-2018-0005 (In English)

Talmy, L. (2018). Introspection as a Methodology in Linguistics, Ten lectures on cognitive semantics, Brill, Leiden, 218–262. https://doi.org/10.1163/9789004349575_007 (In English)

Warstadt, A., Singh, A. and Bowman, S. R. (2019). Neural network acceptability judgments, Transactions of the Association for Computational Linguistics, 7, 625–641. https://doi.org/10.1162/tacl_a_00290 (In English)

Wilder, C. (1997). Some properties of ellipsis in coordination, in Alexiadou, A. and Hall, T. A. (eds.), Studies in universal grammar and typological variation, Amsterdam, 59–108. https://doi.org/10.1075/la.13.04wil (In English)

Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A. and Rush, A. M. (2020). Transformers: State-of-the-art natural language processing, Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, Association for Computational Linguistics, 38–45. http://dx.doi.org/10.18653/v1/2020.emnlp-demos.6 (In English)