DOI: 10.18413/2313-8912-2023-9-1-1-2

Parametrizing number variation in Russian noun phrases with experimental studies and language modeling

Kseniia A. Studenikina (Lomonosov Moscow State University, Russia)

The study deals with the problem of agreement with coordinated constructions and number form alternation. The target, agreeing with two conjoined singular nouns, copies either plural or singular number feature. This paper focuses on the syntax of Russian noun phrases with coordinated modifiers which demonstrate the number variation of the agreement controller. If two conjoined singular adjectives have split interpretations, both singular and plural nouns are acceptable. In contrast to previous studies relying on introspection and corpus, we parametrize the number variation based on the results of the self-paced acceptability experiments (Likert scale 1-7). We compare the data about human perception with the language probabilities predicted by a neural model for text generation ruGPT-3. Two case studies were conducted to analyze morphological and syntactic factors parametrizing variation. The first study considers the impact of noun morphology on number alternation. The second study examines the effect of the premodifier attributive agreement. Both offline acceptability scores and online reading time demonstrate that the observed morphological and syntactic factors should be considered while parametrizing number variation in Russian noun phrases with coordinated modifiers. The ruGPT-3 language model, trained on a vast collection of Russian texts, manages to predict the correct probability for highly acceptable and highly unacceptable sentences, but it fails to assign accurate probability values to the cases of variation.

Keywords: Coordination, Agreement variation, Russian, Experimental syntax, Language modeling, Acceptability, Perplexity.

Figures

Number of views: 1810 (view statistics)

Количество скачиваний: 4229

Full text (HTML)Full text (PDF)To articles list

Information for citation:

Studenikina, K. A. (2023). Parametrizing number variation in Russian noun phrases with experimental studies and language modeling, Research Result. Theoretical and Applied Linguistics, 9 (1), 192-205. DOI: 10.18413/2313-8912-2023-9-1-1-2

User comments
Reference lists
Thanks

While nobody left any comments to this publication.
You can be first.

Aaronson, D. and Scarborough, H. S. (1976). Performance theories for sentence coding: Some quantitative evidence, Journal of Experimental Psychology: Human perception and performance, 2 (1), 56–70. https://doi.org/10.1037/0096-1523.2.1.56(In English)

Barros, M. and Vicente, L. (2011). Right node raising requires both ellipsis and multidomination, University of Pennsylvania working papers in linguistics, 17 (1), 1–9. https://doi.org/10.22099/jill.2022.41795.1275(In English)

Belova, D., Demina, J., Gerasimova, A., Lyutikova, E., Morgunova, E., Petelin, D., Studenikina, K. and Voznesenskaya, A. (2021). Russkije ostrova v svete eksperimental’nyh dannyh [Russian islands in the light of experimental data], E. Lyutikova, A. Gerasimova (eds.), Buki Vedi, Moscow, Russia. (In Russian)

Brunato, D., Chesi, C., Dell’Orletta, F., Montemagni, S., Venturi, G. and Zamparelli, R. (2020). AcCompl-it @ EVALITA2020: Overview of the Acceptability & Complexity Evaluation Task for Italian, International Workshop on Evaluation of Natural Language and Speech Tools for Italian, 1–8. (In English)

Corbett, G. G. (1979). The agreement hierarchy, Journal of linguistics, 15 (2), 203-224. https://doi.org/10.1017/S0022226700016352(In English)

Dou, Y., Forbes, M., Koncel-Kedziorski, R., Smith, N. A. and Choi, Y. (2022). Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text, Proceedings of the 60th Annual Meeting of the Association for Computational, Association for Computational Linguistics, Dublin, Ireland, 7250–7274. http://dx.doi.org/10.18653/v1/2022.acl-long.501(In English)

Gerasimova, A. (2018). Rassoglasovanie po rodu v russkom yazyke (eksperimental'noe issledovanie) [Mixed agreement patterns in Russian (experimental study)], Izvestiia Rossiiskoi akademii nauk. Seriia literatury i iazyka, 77 (1), 65–71. (In Russian)

Gold, J. W. (2021). Locus of Gender Resolution: on Goal or on Probe? Workshop on Agreement in Multivaluation Construction (AMC 2021). (In English)

Gries, S. T. (2021). Statistics for Linguistics with R: A Practical Introduction, De Gruyter Mouton, Berlin/Boston, Germany/USA. https://doi.org/10.1515/9783110718256(In English)

Grosz, P. G. (2015). Movement and agreement in right-node-raising constructions, Syntax, 18 (1), 1–38. https://doi.org/10.1111/synt.12024(In English)

Harizanov, B. and Gribanova, V. (2015). How across-the-board movement interacts with nominal concord in Bulgarian, Proceedings from the Annual Meeting of the Chicago Linguistics Society 49, University of Chicago, IL, Chicago Linguistics Society, 115–129. (In English)

Hartmann, K. (2000). Right node raising and gapping: Interface conditions on prosodic deletion, John Benjamins, Amsterdam, Netherlands. https://doi.org/10.1075/z.106(In English)

Kodzasov, S. (1987). Chislo v sochinitel'nyx konstrukcijax [Number in coordinated structures], in Kodzasov, S., Laufer, N. and Savina, E. (eds.), Modelirovanije jazykovoj dejatel’nosti v intellektual’nyx sistemax, Moscow, Russia, 204–219. (In Russian)

Lau, J. H., Armendariz, C., Lappin, S., Purver, M. and Shu, C. (2020). How furiously can colorless green ideas sleep? Sentence acceptability in context, Transactions of the Association for Computational Linguistics, 8, 296–310. https://doi.org/10.1162/tacl_a_00315(In English)

Lau, J. H., Clark, A. and Lappin, S. (2017). Grammaticality, acceptability, and probability: A probabilistic view of linguistic knowledge, Cognitive science, 41 (5), 1202–1241. https://doi.org/10.1111/cogs.12414(In English)

Likert, R. (1932). A technique for the measurement of attitudes, Archives of Psychology, 140, 1–55. (In English)

Schütze, C. and Sprouse, J. (2014). Judgment data, in Sharma, D. and Podesva, R. (eds.), Research methods in linguistics, Cambridge: Cambridge University Press, 27–50. (In English)

Shen, Z. (2018). Feature arithmetic in the nominal domain, Ph.D. Thesis, University of Connecticut, Storrs, CT. (In English)

Sprouse, J., Schütze, C. T. and Almeida, D. (2013). A comparison of informal and formal acceptability judgments using a random sample from Linguistic Inquiry 2001–2010, Lingua, 134, 219–248. https://doi.org/10.1016/j.lingua.2013.07.002(In English)

Sprouse, J., Yankama, B., Indurkhya S., Fong, S. and Berwick, R. C. (2018). Colorless green ideas do sleep furiously: gradient acceptability and the nature of the grammar, The Linguistic Review, 35 (3), 575–599. https://doi.org/10.1515/tlr-2018-0005(In English)

Talmy, L. (2018). Introspection as a Methodology in Linguistics, Ten lectures on cognitive semantics, Brill, Leiden, 218–262. https://doi.org/10.1163/9789004349575_007 (In English)

Warstadt, A., Singh, A. and Bowman, S. R. (2019). Neural network acceptability judgments, Transactions of the Association for Computational Linguistics, 7, 625–641. https://doi.org/10.1162/tacl_a_00290(In English)

Wilder, C. (1997). Some properties of ellipsis in coordination, in Alexiadou, A. and Hall, T. A. (eds.), Studies in universal grammar and typological variation, Amsterdam, 59–108. https://doi.org/10.1075/la.13.04wil (In English)

Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A. and Rush, A. M. (2020). Transformers: State-of-the-art natural language processing, Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, Association for Computational Linguistics, 38–45. http://dx.doi.org/10.18653/v1/2020.emnlp-demos.6(In English)

All journals

Send article

Research Result. Theoretical and Applied Linguistics is included in the scientific database of the RINTs (license agreement No. 765-12/2014 dated 08.12.2014).

Журнал включен в перечень рецензируемых научных изданий, рекомендуемых ВАК

The journal is indexed by the following scientific databases and platforms

Research Result. Research Result. Theoretical and Applied Linguistics (ISSN 2313-8912)

The journal materials and website are licensed under Creative Commons «Attribution» 4.0 International.

The Founder: Federal State Autonomous Educational Institution of Higher Education "Belgorod National Research University"The Founder’s address: 85 Pobedy Street, Belgorod, the Belgorod region, 308015, Russia

The Publisher: Federal State Autonomous Educational Institution of HigherEducation "Belgorod National Research University" The Founder’s address:85 Pobedy Street, Belgorod, the Belgorod region, 308015, Russia

Editors Office: chief editor Olga Dekhnich, e-mail: RR_Linguistics@bsuedu.ru, phone: (4722) 301254.

Registered by the Federal Service for Supervision of Communications, Information Technology and Mass Media (Roskomnadzor)

Certificate

Charter of the editorial board of the mass media "Research Result. Theoretical and Applied Linguistics"

Order No. 636-OD dated 30.06.2023 "On approval of the Charters of the editorial boards of the mass media of scientific journals of Belgorod State National Research University"

Order No. 1097-OD from 15.11.2023 "On approval of the Regulations for the publication of scientific journals of Belgorod State National Research University"

Have questions?
You can write to us:

✉ Executive Secretary

✉ Site administration

✉ Content manager