<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.2" xml:lang="ru" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="issn">2313-8912</journal-id><journal-title-group><journal-title>Research Result. Theoretical and Applied Linguistics</journal-title></journal-title-group><issn pub-type="epub">2313-8912</issn></journal-meta><article-meta><article-id pub-id-type="doi">10.18413/2313-8912-2024-10-4-0-8</article-id><article-id pub-id-type="publisher-id">3679</article-id><article-categories><subj-group subj-group-type="heading"><subject>Human Language Behaviour in Machine-Generated Environments</subject></subj-group></article-categories><title-group><article-title>&lt;strong&gt;Investigating between-word pause duration&amp;nbsp;in Russian typed texts using mixture modeling based on keystroke data&lt;/strong&gt;</article-title><trans-title-group xml:lang="en"><trans-title>&lt;strong&gt;Investigating between-word pause duration&amp;nbsp;in Russian typed texts using mixture modeling based on keystroke data&lt;/strong&gt;</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Litvinova</surname><given-names>Tatiana A.</given-names></name><name xml:lang="en"><surname>Litvinova</surname><given-names>Tatiana A.</given-names></name></name-alternatives><email>centr_rus_yaz@mail.ru</email><xref ref-type="aff" rid="aff1" /></contrib><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Molchanova</surname><given-names>Viktoriya A.</given-names></name><name xml:lang="en"><surname>Molchanova</surname><given-names>Viktoriya A.</given-names></name></name-alternatives><email>zva0604@yandex.ru</email><xref ref-type="aff" rid="aff2" /></contrib></contrib-group><aff id="aff2"><institution>Voronezh State Pedagogical University, Voronezh, Russia</institution></aff><aff id="aff1"><institution>Voronezh State Pedagogical University, Russia</institution></aff><pub-date pub-type="epub"><year>2024</year></pub-date><volume>10</volume><issue>4</issue><fpage>0</fpage><lpage>0</lpage><self-uri content-type="pdf" xlink:href="/media/linguistics/2024/4/Research_Result_4-42-182-201.pdf" /><abstract xml:lang="ru"><p>Keystroke logging is an objective and scalable methodology that has become the gold standard in writing research for modeling writing processes. A particularly significant aspect of this analysis is the examination of features such as pause duration, as pauses are regarded as indicators of underlying cognitive processes. Traditionally, arbitrary pause thresholds that are universally applied to all writers have been established to differentiate between cognitive and non-cognitive pauses. However, this approach presents considerable limitations and fails to account for the complexity and individual variability inherent in the cognitive processes involved in text production. Furthermore, different scholars employ varying approaches to the calculation of between-word pauses. This study is the first to analyze keystroke logs of Russian typed texts utilizing Gaussian mixture models (GMM) to cluster pause duration values at between-word boundaries. By employing keystroke logs collected from 50 university students who described the views from their home windows, we conducted a cluster analysis of pause duration values before words, after words, and between words separately. It was determined that the distribution of pauses between words cannot be characterised by a single distribution. For the majority of participants, two-component distribution provided the best fit for all three types of pauses. Additionally, we observed a high degree of individual variability in the mixing proportions of different components. This paper underscores the necessity of avoiding the imposition of fixed thresholds in pause analysis that are universally applicable to all writers and advocates for individualized and holistic approach to studying the writing process.



</p></abstract><trans-abstract xml:lang="en"><p>Keystroke logging is an objective and scalable methodology that has become the gold standard in writing research for modeling writing processes. A particularly significant aspect of this analysis is the examination of features such as pause duration, as pauses are regarded as indicators of underlying cognitive processes. Traditionally, arbitrary pause thresholds that are universally applied to all writers have been established to differentiate between cognitive and non-cognitive pauses. However, this approach presents considerable limitations and fails to account for the complexity and individual variability inherent in the cognitive processes involved in text production. Furthermore, different scholars employ varying approaches to the calculation of between-word pauses. This study is the first to analyze keystroke logs of Russian typed texts utilizing Gaussian mixture models (GMM) to cluster pause duration values at between-word boundaries. By employing keystroke logs collected from 50 university students who described the views from their home windows, we conducted a cluster analysis of pause duration values before words, after words, and between words separately. It was determined that the distribution of pauses between words cannot be characterised by a single distribution. For the majority of participants, two-component distribution provided the best fit for all three types of pauses. Additionally, we observed a high degree of individual variability in the mixing proportions of different components. This paper underscores the necessity of avoiding the imposition of fixed thresholds in pause analysis that are universally applicable to all writers and advocates for individualized and holistic approach to studying the writing process.



</p></trans-abstract><kwd-group xml:lang="ru"><kwd>Keystroke logging</kwd><kwd>Keystroke analysis</kwd><kwd>Pause</kwd><kwd>Pause duration</kwd><kwd>Writing research</kwd><kwd>Writing processes</kwd><kwd>Mixture modelling</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Keystroke logging</kwd><kwd>Keystroke analysis</kwd><kwd>Pause</kwd><kwd>Pause duration</kwd><kwd>Writing research</kwd><kwd>Writing processes</kwd><kwd>Mixture modelling</kwd></kwd-group></article-meta></front><back><ack><p>The authors acknowledge the support of the Ministry of Education of the Russian Federation (the research was supported by the Ministry of Education of the Russian Federation within the framework of the State Assignment in the field of science, topic number QRPK-2024-0011).</p></ack><ref-list><title>Список литературы</title><ref id="B1"><mixed-citation>Alamargot,&amp;nbsp;D., Dansac,&amp;nbsp;C., Chesnet,&amp;nbsp;D. and Fayol,&amp;nbsp;M. (2007). Parallel processing before and after pauses: A combined analysis of graphomotor and eye movements during procedural text production, in Torrance,&amp;nbsp;M., Waes,&amp;nbsp;L.&amp;nbsp;van and Galbraith,&amp;nbsp;D. (eds.). Writing and cognition: Research and applications, Elsevier Science, 13&amp;ndash;29. (In English)</mixed-citation></ref><ref id="B2"><mixed-citation>Alves,&amp;nbsp;R.&amp;nbsp;A. and Limpo,&amp;nbsp;T. (2015). Progress in written language bursts, pauses, transcription, and written composition across schooling, Scientific Studies of Reading, 19&amp;nbsp;(5), 374&amp;ndash;391. https://doi.org/10.1080/10888438.2015.1059838 (In English)</mixed-citation></ref><ref id="B3"><mixed-citation>Baaijen, V.&amp;nbsp;M. and Galbraith,&amp;nbsp;D. (2018). Discovery through writing: Relationships with writing processes and text quality, Cognition and Instruction, 36&amp;nbsp;(3), 199&amp;ndash;223. https://doi.org/10.1080/07370008.2018.1456431 (In English)</mixed-citation></ref><ref id="B4"><mixed-citation>Baaijen,&amp;nbsp;V.&amp;nbsp;M., Galbraith,&amp;nbsp;D. and de&amp;nbsp;Glopper,&amp;nbsp;K. (2012). Keystroke analysis: Reflections on procedures and measures, Written Communication, 29&amp;nbsp;(3), 246&amp;ndash;277. https://doi.org/10.1177/0741088312451108 (In English)</mixed-citation></ref><ref id="B5"><mixed-citation>Barkaoui,&amp;nbsp;K. (2019). What Can L2 Writers&amp;rsquo; Pausing Behavior Tell Us About Their L2 Writing Processes?, Studies in Second Language Acquisition, 41&amp;nbsp;(3), 529&amp;ndash;554. DOI:&amp;nbsp;10.1017/S027226311900010X (In English)</mixed-citation></ref><ref id="B6"><mixed-citation>Beauvais,&amp;nbsp;C., Olive,&amp;nbsp;T. and Passerault,&amp;nbsp;J.-M. (2011). Why are some texts good and others not? Relationship between text quality and management of the writing process, Journal of Educational Psychology, 103&amp;nbsp;(2), 415&amp;ndash;428. https://doi.org/10.1037/a0022545 (In English)</mixed-citation></ref><ref id="B7"><mixed-citation>Chukharev-Hudilainen,&amp;nbsp;E. (2014). Pauses in spontaneous written communication: A keystroke logging study, Journal of Writing Research, 6&amp;nbsp;(1), 61&amp;ndash;84. DOI: 10.17239/jowr-2014.06.01.3 (In English)</mixed-citation></ref><ref id="B8"><mixed-citation>Chukharev-Hudilainen,&amp;nbsp;E., Saricaoglu,&amp;nbsp;A., Torrance,&amp;nbsp;M. and Feng,&amp;nbsp;H.-H. (2019). Combined Deployable Keystroke Logging and Eyetracking for Investigating L2 Writing Fluency. Studies in Second Language Acquisition. 41(3), 583&amp;ndash;604. https://doi.org/10.1017/S027226311900007X (In English)</mixed-citation></ref><ref id="B9"><mixed-citation>Chukharev-Hudilainen,&amp;nbsp;E. (2011). Local Discourse Structure of Chat Dialogues: Evidence from Keystroke Logging, Proceedings of the 15th Workshop on the Semantics and Pragmatics of Dialogue &amp;ndash; Full Papers, Los Angeles, California, 104&amp;ndash;111. (In English)</mixed-citation></ref><ref id="B10"><mixed-citation>Cislaru,&amp;nbsp;G., Feltgen,&amp;nbsp;Q., Khoury,&amp;nbsp;E., Delorme,&amp;nbsp;R. and Bucci,&amp;nbsp;M.P. (2024). Language Processing Units Are Not Equivalent to Sentences: Evidence from Writing Tasks, Typical and Dyslexic Children, Languages, 9&amp;nbsp;(5), 155. https://doi.org/10.3390/languages9050155 (In English)</mixed-citation></ref><ref id="B11"><mixed-citation>Escorcia,&amp;nbsp;D., Passerault,&amp;nbsp;J.-M., Ros,&amp;nbsp;C. and Pylouster,&amp;nbsp;J. (2017). Profiling writers: Analysis of writing dynamics among college students, Metacognition and Learning, 12&amp;nbsp;(2), 233&amp;ndash;273. https://doi.org/10.1007/s11409-016-9166-6 (In English)</mixed-citation></ref><ref id="B12"><mixed-citation>Fraley,&amp;nbsp;C., Raftery,&amp;nbsp;A.&amp;nbsp;E., Scrucca,&amp;nbsp;L., Murphy,&amp;nbsp;T.&amp;nbsp;B. and Fop,&amp;nbsp;M. (2020). Package &amp;ldquo;mclust&amp;rdquo; Title Gaussian Mixture Modelling for Model-Based Clustering, Classification, and Density Estimation, available at: https://mclust-org.github.io/mclust/ (accessed on 05.11.2024) (In English)</mixed-citation></ref><ref id="B13"><mixed-citation>Galbraith,&amp;nbsp;D. and Baaijen,&amp;nbsp;V.&amp;nbsp;M. (2018). The work of writing: Raiding the inarticulate, Educational Psychologist, 53&amp;nbsp;(4), 238&amp;ndash;257. https://doi.org/10.1080/00461520.2018.1505515 (In English)</mixed-citation></ref><ref id="B14"><mixed-citation>Galbraith,&amp;nbsp;D. and Baaijen,&amp;nbsp;V.&amp;nbsp;M. (2019). Aligning keystrokes with cognitive processes in writing, in Lindgren,&amp;nbsp;E.&amp;nbsp;and Sullivan, K.&amp;nbsp;P.&amp;nbsp;H. (eds.) Observing Writing: Insights from Keystroke Logging and Handwriting, Brill, 38, 306&amp;ndash;325. https://doi.org/10.1163/9789004392526_015 (In English)</mixed-citation></ref><ref id="B15"><mixed-citation>Garc&amp;eacute;s-Manzanera,&amp;nbsp;A. (2024). Language bursts and text quality in digital writing by young EFL learners, Journal of New Approaches in Educational Research, 13. https://doi.org/10.1007/s44322-024-00012-x (In English)</mixed-citation></ref><ref id="B16"><mixed-citation>Guo,&amp;nbsp;H., Deane,&amp;nbsp;P.&amp;nbsp;D., van&amp;nbsp;Rijn,&amp;nbsp;P.&amp;nbsp;W., Zhang,&amp;nbsp;M. and Bennett,&amp;nbsp;R.&amp;nbsp;E. (2018). Modeling basic writing processes from keystroke logs, Journal of Educational Measurement, 55&amp;nbsp;(2), 194&amp;ndash;216. https://doi.org/10.1111/jedm.12172 (In English)</mixed-citation></ref><ref id="B17"><mixed-citation>Hall,&amp;nbsp;S., Baaijen,&amp;nbsp;V.&amp;nbsp;M. and Galbraith,&amp;nbsp;D. (2024). Constructing theoretically informed measures of pause duration in experimentally manipulated writing, Reading and Writing, 37, 329&amp;ndash;357. https://doi.org/10.1007/s11145-022-10284-4 (In English)</mixed-citation></ref><ref id="B18"><mixed-citation>Kass,&amp;nbsp;R.&amp;nbsp;E. and Raftery,&amp;nbsp;A.&amp;nbsp;E. (1995). Bayes factors, Journal of the American statistical association, 90&amp;nbsp;(430), 773&amp;ndash;795. (In English)</mixed-citation></ref><ref id="B19"><mixed-citation>Kibrik, A. A., Korotaev, N. A. and Podlesskaya, V. I. (2020). Russian spoken discourse: Local structure and prosody. In Izre&amp;rsquo;el,&amp;nbsp;S., Mello, H., Panunzi, A. and Raso,&amp;nbsp;T. (Eds.). In search of basic units of spoken language: A corpus-driven approach, John Benjamins, Amsterdam, 367&amp;ndash;382. DOI: 10.1075/scl.94.01kib (In English)</mixed-citation></ref><ref id="B20"><mixed-citation>Leijten,&amp;nbsp;M. and Van&amp;nbsp;Waes,&amp;nbsp;L. (2013). Keystroke logging in writing research: Using inputlog to analyze and visualize writing processes, Written Communication, 30&amp;nbsp;(3), 358&amp;ndash;392. https://doi.org/10.1177/0741088313491692 (In English)</mixed-citation></ref><ref id="B21"><mixed-citation>Leijten,&amp;nbsp;M. and Van&amp;nbsp;Waes,&amp;nbsp;L. (2006). Inputlog: New perspectives on the logging of online writing, in Sullivan,&amp;nbsp;K.&amp;nbsp;P.&amp;nbsp;H. and Lindgren,&amp;nbsp;E. (Eds.). Computer keystroke logging and writing: Methods and applications, Elsevier, Oxford, 73&amp;ndash;93. (In English)</mixed-citation></ref><ref id="B22"><mixed-citation>Limpo,&amp;nbsp;T. and Alves,&amp;nbsp;R.&amp;nbsp;A. (2017). Written language bursts mediate the relationship between transcription skills and writing performance, Written Communication, 34&amp;nbsp;(3), 306&amp;ndash;332. https://doi.org/10.1177/0741088317714234 (In English)</mixed-citation></ref><ref id="B23"><mixed-citation>Little,&amp;nbsp;D.&amp;nbsp;R., Oehmen,&amp;nbsp;R., Dunn,&amp;nbsp;J.&amp;nbsp;C., Hird,&amp;nbsp;K. and Kirsner,&amp;nbsp;K. (2012). Fluency Profiling System: An automated system for analyzing the temporal properties of speech,&amp;nbsp;Behavior Research Methods, 45, 191&amp;ndash;202. https://doi.org/10.3758/s13428-012-0222-0 (In English)</mixed-citation></ref><ref id="B24"><mixed-citation>McLachlan,&amp;nbsp;G.&amp;nbsp;J. and Peel,&amp;nbsp;D. (2000). Finite mixture models, Wiley, New York. (In English)</mixed-citation></ref><ref id="B25"><mixed-citation>Medimorec,&amp;nbsp;S. and Risko,&amp;nbsp;E.&amp;nbsp;F. (2017). Pauses in written composition: on the importance of where writers pause, Reading and Writing, 30, 1267&amp;ndash;1285. https://doi.org/10.1007/s11145-017-9723-7 (In English)</mixed-citation></ref><ref id="B26"><mixed-citation>Muth&amp;eacute;n,&amp;nbsp;B. and Asparouhov,&amp;nbsp;T. (2009). Multilevel regression mixture analysis, Journal of the Royal Statistical Society Series A, Royal Statistical Society, 172&amp;nbsp;(3), 639&amp;ndash;657. (In English)</mixed-citation></ref><ref id="B27"><mixed-citation>Mohsen,&amp;nbsp;M.&amp;nbsp;A. and Qassem,&amp;nbsp;M. (2020). Analyses of L2 Learners&amp;rsquo; Text Writing Strategy: Process-Oriented Perspective, Journal of Psycholinguistic Research, 49, 435&amp;ndash;451. https://doi.org/10.1007/s10936-020-09693-9 (In English)</mixed-citation></ref><ref id="B28"><mixed-citation>Roeser,&amp;nbsp;J., Torrance,&amp;nbsp;M. and Baguley,&amp;nbsp;T. (2019). Advance planning in written and spoken sentence production, Journal of Experimental Psychology: Learning, Memory, and Cognition, 45&amp;nbsp;(11), 1983&amp;ndash;2009. DOI: 10.1037/xlm0000685 &amp;nbsp;(In English)</mixed-citation></ref><ref id="B29"><mixed-citation>Spelman-Miller,&amp;nbsp;K. (2006). The pausological study of written language production, in Sullivan,&amp;nbsp;K.&amp;nbsp;P.&amp;nbsp;H. and Lindgren,&amp;nbsp;E. (eds.). Computer keystroke logging: Methods and applications, Elsevier, 11&amp;ndash;30. (In English)</mixed-citation></ref><ref id="B30"><mixed-citation>Usoof,&amp;nbsp;H.&amp;nbsp;A., Leblay,&amp;nbsp;C. and Caporossi,&amp;nbsp;G. (2020). GenoGraphiX-Log version 2.0 user guide. Les Cahiers du GERAD, available at: https://ggxlog.net/download/User%20Guide_20230130.pdf (accessed on 05/11/2024) (In English)</mixed-citation></ref><ref id="B31"><mixed-citation>Valenzuela,&amp;nbsp;&amp;Aacute;. and Castillo,&amp;nbsp;R.&amp;nbsp;D. (2023). The effect of communicative purpose and reading medium on pauses during different phases of the textualization process. Read Writ. 36, 881&amp;ndash;908. https://doi.org/10.1007/s11145-022-10309-y (In English)</mixed-citation></ref><ref id="B32"><mixed-citation>Van&amp;nbsp;Waes,&amp;nbsp;L., Leijten,&amp;nbsp;M., Roeser,&amp;nbsp;J., Olive,&amp;nbsp;T. and Grabowski,&amp;nbsp;J. (2021). Measuring and assessing typing skills in writing research, Journal of Writing Research, 13&amp;nbsp;(1), 107&amp;ndash;153. DOI: 10.17239/jowr-2021.13.01.04 (In English)</mixed-citation></ref><ref id="B33"><mixed-citation>Vandermeulen,&amp;nbsp;N., Lindgren,&amp;nbsp;E., Waldmann,&amp;nbsp;Ch. and Levlin,&amp;nbsp;M. (2024). Getting a grip on the writing process: (Effective) approaches to write argumentative and narrative texts in L1 and L2, Journal of Second Language Writing, 65. https://doi.org/10.1016/j.jslw.2024.101113 (In English)</mixed-citation></ref><ref id="B34"><mixed-citation>Wengelin,&amp;nbsp;&amp;Aring;. (2006). Examining Pauses in Writing: Theory, Methods and Empirical Data, in Computer Keystroke Logging and Writing: Methods and Applications, Brill, Leiden, The Netherlands, 107&amp;ndash;130. DOI: 10.1163/9780080460932_008 (In English)</mixed-citation></ref><ref id="B35"><mixed-citation>Zhang,&amp;nbsp;M., Bennett,&amp;nbsp;R.&amp;nbsp;E., Deane,&amp;nbsp;P. and van&amp;nbsp;Rijn,&amp;nbsp;P.&amp;nbsp;W. (2019). Are there gender differences in how students write their essays? An analysis of writing processes, Educational Measurement: Issues and Practice, 38&amp;nbsp;(2), 14&amp;ndash;26. https://doi.org/10.1111/emip.12249 In English)</mixed-citation></ref></ref-list></back></article>