<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.2" xml:lang="ru" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="issn">2313-8912</journal-id><journal-title-group><journal-title>Research Result. Theoretical and Applied Linguistics</journal-title></journal-title-group><issn pub-type="epub">2313-8912</issn></journal-meta><article-meta><article-id pub-id-type="doi">10.18413/2313-8912-2024-10-2-0-2</article-id><article-id pub-id-type="publisher-id">3497</article-id><article-categories><subj-group subj-group-type="heading"><subject>APPLIED LINGUISTICS</subject></subj-group></article-categories><title-group><article-title>&lt;strong&gt;Machine translation in hindsight&lt;/strong&gt;</article-title><trans-title-group xml:lang="en"><trans-title>&lt;strong&gt;Machine translation in hindsight&lt;/strong&gt;</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Dyomochkina</surname><given-names>Viktoriya V.</given-names></name><name xml:lang="en"><surname>Dyomochkina</surname><given-names>Viktoriya V.</given-names></name></name-alternatives><email>dyomochkina.viktoria@yandex.ru</email><xref ref-type="aff" rid="aff1" /></contrib><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Gruzdev</surname><given-names>Dmitry Yu.</given-names></name><name xml:lang="en"><surname>Gruzdev</surname><given-names>Dmitry Yu.</given-names></name></name-alternatives><email>gru@inbox.ru</email><xref ref-type="aff" rid="aff1" /></contrib><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Lukyanova</surname><given-names>Elena V.</given-names></name><name xml:lang="en"><surname>Lukyanova</surname><given-names>Elena V.</given-names></name></name-alternatives><email>lukyanova@bsu.edu.ru</email><xref ref-type="aff" rid="aff2" /></contrib></contrib-group><aff id="aff2"><institution>Belgorod State National Research University</institution></aff><aff id="aff1"><institution>Military University, Russia</institution></aff><pub-date pub-type="epub"><year>2024</year></pub-date><volume>10</volume><issue>2</issue><fpage>0</fpage><lpage>0</lpage><self-uri content-type="pdf" xlink:href="/media/linguistics/2024/2/2024-02_июнь_Том_10_2-21-45.pdf" /><abstract xml:lang="ru"><p>The paper expands on the analysis of key projects adoring the machine translation (MT) hall of fame and their role in addressing practical tasks. The most successful initiatives suggest that the fledgling MT was contingent on the level of entropy, a.k.a. random nature of natural languages: the lower the indicator, the higher the predictability of the text, and by implication the efficiency of the system. It accounts for the success of the first Georgetown-IBM experiment and Canada&amp;rsquo;s METEO-1. The letter grew into a full-fledged system that for almost a quarter of the 20th century, provided English-French-English translations of weather bulletins, boasting high language predictability. Although, in between them the 1964 ALPAC report sowed a seed of doubt of the MT validity, it never aimed at killing the research area at all. On the contrary, it highlighted technics and applications, where the technology had demonstrated promising results, including raw MT, post-edited MT, and M-AT. The authors note a cyclic nature of the development of MT-powered methods and technologies. Today&amp;rsquo;s combination of resources and the way they are used are different very little from those employed in the past century. What makes them stand apart is the maturity of MT technologies, which made it through rule-based, direct, corpus-based, and knowledge-based translation to SMT and eventually to NMT. It has been established that the improved performance comes at a cost of more elaborate and larger data sets, tagged, marked up and annotated for automated use in language models. Taking advantage of these as well as artificial intelligence (AI), the authors venture into modeling basic text processing scenarios in a bilingual environment. This results in recommendations as to future paths for the improvement of MT technologies in the hands of professional translators by fine-tuning language models individually and pursuing post-editing (PEMT) and pre-editing practices paving the way for more complex transformations and lower equivalence levels.



</p></abstract><trans-abstract xml:lang="en"><p>The paper expands on the analysis of key projects adoring the machine translation (MT) hall of fame and their role in addressing practical tasks. The most successful initiatives suggest that the fledgling MT was contingent on the level of entropy, a.k.a. random nature of natural languages: the lower the indicator, the higher the predictability of the text, and by implication the efficiency of the system. It accounts for the success of the first Georgetown-IBM experiment and Canada&amp;rsquo;s METEO-1. The letter grew into a full-fledged system that for almost a quarter of the 20th century, provided English-French-English translations of weather bulletins, boasting high language predictability. Although, in between them the 1964 ALPAC report sowed a seed of doubt of the MT validity, it never aimed at killing the research area at all. On the contrary, it highlighted technics and applications, where the technology had demonstrated promising results, including raw MT, post-edited MT, and M-AT. The authors note a cyclic nature of the development of MT-powered methods and technologies. Today&amp;rsquo;s combination of resources and the way they are used are different very little from those employed in the past century. What makes them stand apart is the maturity of MT technologies, which made it through rule-based, direct, corpus-based, and knowledge-based translation to SMT and eventually to NMT. It has been established that the improved performance comes at a cost of more elaborate and larger data sets, tagged, marked up and annotated for automated use in language models. Taking advantage of these as well as artificial intelligence (AI), the authors venture into modeling basic text processing scenarios in a bilingual environment. This results in recommendations as to future paths for the improvement of MT technologies in the hands of professional translators by fine-tuning language models individually and pursuing post-editing (PEMT) and pre-editing practices paving the way for more complex transformations and lower equivalence levels.



</p></trans-abstract><kwd-group xml:lang="ru"><kwd>Machine translation</kwd><kwd>Tagging</kwd><kwd>Annotation</kwd><kwd>MT quality</kwd><kwd>Entropy</kwd><kwd>PEMT</kwd><kwd>CAT</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Machine translation</kwd><kwd>Tagging</kwd><kwd>Annotation</kwd><kwd>MT quality</kwd><kwd>Entropy</kwd><kwd>PEMT</kwd><kwd>CAT</kwd></kwd-group></article-meta></front><back><ref-list><title>Список литературы</title><ref id="B1"><mixed-citation>Bharati,&amp;nbsp;A., Chaitanya,&amp;nbsp;V., Kulkarni,&amp;nbsp;A. and Sangal,&amp;nbsp;R. (2003). Anusaaraka: Machine Translation in Stages, ArXiv, cs.CL/0306130. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B2"><mixed-citation>Brown,&amp;nbsp;P.&amp;nbsp;F., Della Pietra,&amp;nbsp;S.&amp;nbsp;A., Della Pietra,&amp;nbsp;V.&amp;nbsp;J. and Mercer,&amp;nbsp;R.&amp;nbsp;L. (1993). The Mathematics of Statistical Machine Translation: Parameter Estimation, Computational Linguistics, 19&amp;nbsp;(2), 263&amp;ndash;311. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B3"><mixed-citation>Costa-jussa,&amp;nbsp;M., Escolano,&amp;nbsp;C. and Fonollosa,&amp;nbsp;J. (2017). Byte-based Neural Machine Translation, Proceedings of the First Workshop on Subword and Character Level Models in NLP, 154-158. DOI:&amp;nbsp;10.18653/v1/W17-4123 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B4"><mixed-citation>Daems,&amp;nbsp;J., Vandepitte,&amp;nbsp;S., Hartsuiker,&amp;nbsp;R. and Macken,&amp;nbsp;L. (2017). Translation Methods and Experience: A Comparative Analysis of Human Translation and Post-editing with Students and Professional Translators, META, 62&amp;nbsp;(2), 245&amp;ndash;270. https://doi.org/10.7202/1041023ar (In&amp;nbsp;English)</mixed-citation></ref><ref id="B5"><mixed-citation>Fumani,&amp;nbsp;F.&amp;nbsp;Q. and Reza,&amp;nbsp;M. (2007). Ambiguity in Machine Translation, Ketabdari Va Etella&amp;#39;resaani, 9, 21-38. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B6"><mixed-citation>Gashaw,&amp;nbsp;I. and Shashirekha,&amp;nbsp;H.&amp;nbsp;L. (2019). Amharic-Arabic Neural Machine Translation, Computer Science &amp;amp; Information Technology (CS &amp;amp; IT), 55-68. https://doi.org/10.5121/csit.2019.91606 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B7"><mixed-citation>Gruzdev,&amp;nbsp;D.&amp;nbsp;Yu. and Kodzhebash,&amp;nbsp;D.&amp;nbsp;O. (2023). POS-powered queries for neat and lean concordances in ad-hoc corpora analysis, Teoreticheskaya i prikladnaya lingvistika [Theoretical and Applied Linguistics], 9&amp;nbsp;(4), 35‒48. https://doi.org/10.22250/24107190_2023_9_4_35 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B8"><mixed-citation>Hurskainen,&amp;nbsp;A. (2018). Can machine translation assist in Bible translation?, SALAMA &amp;ndash; Swahili Language Manager: Technical reports on LT, 62, Helsinki. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B9"><mixed-citation>Hutchins,&amp;nbsp;J. (2004a). Two precursors of machine translation: Artsrouni and Trojanskij, International Journal of Translation, 16&amp;nbsp;(1), 11-31. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B10"><mixed-citation>Hutchins,&amp;nbsp;J. (2004b). The first public demonstration of machine translation: the Georgetown-IBM system, 7th January 1954. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B11"><mixed-citation>Volkart,&amp;nbsp;L. and Bouillon,&amp;nbsp;P. (2022). Studying Post-Editese in a Professional Context: A Pilot Study, Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, Ghent, Belgium. European Association for Machine Translation, 71&amp;ndash;79 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B12"><mixed-citation>L&amp;oacute;pez,&amp;nbsp;V.&amp;nbsp;F., Corchado,&amp;nbsp;J.&amp;nbsp;M., De&amp;nbsp;Paz,&amp;nbsp;J.&amp;nbsp;F., Rodr&amp;iacute;guez,&amp;nbsp;S. and Bajo,&amp;nbsp;J. (2010). A SomAgent statistical machine translation, Applied Soft Computing, 11&amp;nbsp;(2). https://doi.org/10.1016/j.asoc.2010.08.018 (In English)</mixed-citation></ref><ref id="B13"><mixed-citation>Nelyubin,&amp;nbsp;L.&amp;nbsp;L. (1975). Informacionno-statisticheskie i inzhenerno-lingvisticheskie osobennosti yazyka i teksta v usloviyah speckommunikacii [Information-statistical and engineering-linguistic features of language and text in conditions of special communication], D.&amp;nbsp;Sc.&amp;nbsp;Thesis, Applied linguistics, Leningrad. (In&amp;nbsp;Russian)</mixed-citation></ref><ref id="B14"><mixed-citation>Nelyubin,&amp;nbsp;L.&amp;nbsp;L. (1983). Translation and applied linguistics, Vysshaya Shkola Publishers, Moscow, USSR. (In&amp;nbsp;Russian)</mixed-citation></ref><ref id="B15"><mixed-citation>N&amp;eacute;v&amp;eacute;ol&amp;nbsp;A., Dogan&amp;nbsp;R. and lu&amp;nbsp;Zh. (2010). Semi-automatic semantic annotation of PubMed queries: A study on quality, efficiency, satisfaction, Journal of biomedical informatics, 44. 310-8. DOI:&amp;nbsp;10.1016/j.jbi.2010.11.001. (In English)</mixed-citation></ref><ref id="B16"><mixed-citation>Nirenburg,&amp;nbsp;S. (ed.) (1993). Progress in machine translation, IOS Press, Amsterdam, Netherlands. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B17"><mixed-citation>O&amp;#39;Brien,&amp;nbsp;S. Balling,&amp;nbsp;L.&amp;nbsp;W., Carl,&amp;nbsp;M., Simard,&amp;nbsp;M. and Specia,&amp;nbsp;L. (eds.) (2014). Post-editing of Machine Translation: Processes and Applications, Cambridge Scholars Publishing. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B18"><mixed-citation>Oladosu,&amp;nbsp;J., Esan,&amp;nbsp;A., Adeyanju,&amp;nbsp;I., Adegoke,&amp;nbsp;B., Olaniyan,&amp;nbsp;O. and Omodunbi,&amp;nbsp;B. (2016). Approaches to Machine Translation: A Review, FUOYE Journal of Engineering and Technology, 1&amp;nbsp;(1), 120&amp;ndash;126. https://doi.org/10.46792/fuoyejet.v1i1.26 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B19"><mixed-citation>Ornstein,&amp;nbsp;J. (1955). Mechanical Translation: New Challenge to Communication, Science, 22&amp;nbsp;(3173), 745-748. DOI:&amp;nbsp;10.1126/science.122.3173.745 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B20"><mixed-citation>Quach,&amp;nbsp;K. (2016). Google&amp;#39;s neural network learns to translate languages it hasn&amp;#39;t been trained on: First time machine translation has used true transfer learning, The&amp;nbsp;Register, available at: https://www.theregister.com/2016/11/17/googles_neural_net_translates_languages_not_trained_on/ (Accessed 10 June 2024). (In&amp;nbsp;English)</mixed-citation></ref><ref id="B21"><mixed-citation>Quintana,&amp;nbsp;R.&amp;nbsp;C. and Castilho,&amp;nbsp;S. (2022). A review of the Integration of Machine Translation in CAT tools, Proceedings of the International Conference &amp;ldquo;New Trends in Translation and Technology NeTT 2022&amp;rdquo;, Rhodes Island, Greece, 214&amp;ndash;221 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B22"><mixed-citation>Tripathi,&amp;nbsp;S. and Kansal,&amp;nbsp;V. (2020). Machine Translation Evaluation: Unveiling the Role of Dense Sentence Vector Embedding for Morphologically Rich Language, International Journal of Pattern Recognition and Artificial Intelligence, 34&amp;nbsp;(1), 2059001. https://doi.org/10.1142/S0218001420590016 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B23"><mixed-citation>Schuster,&amp;nbsp;M., Johnson,&amp;nbsp;M. and Thorat,&amp;nbsp;N. (2016). Zero-Shot Translation with Google&amp;rsquo;s Multilingual Neural Machine Translation System, AI Blog, available at: https://ai.googleblog.com/2016/11/zero-shot-translation-with-googles.html (Accessed 10 June 2024). (In&amp;nbsp;English)</mixed-citation></ref><ref id="B24"><mixed-citation>Schuster M. Zero-Shot Translation with Google&amp;rsquo;s Multilingual Neural Machine</mixed-citation></ref><ref id="B25"><mixed-citation>Suxia&amp;nbsp;Lei and You&amp;nbsp;Li. (2023). English Machine translation System Based on Neural Network Algorithm, Procedia Computer Science, 228, 409-420. https://doi.org/10.1016/j.procs.2023.11.047 (In&amp;nbsp;English)</mixed-citation></ref><ref id="B26"><mixed-citation>Tomasello,&amp;nbsp;L. (2020). Neural Machine Translation and Artificial Intelligence: What Is Left for the Human Translator?, Master&amp;rsquo;s degree theses, University of Padua, Italy. (In&amp;nbsp;English)</mixed-citation></ref><ref id="B27"><mixed-citation>Weaver,&amp;nbsp;W. (1949). The Mathematics of Communication, Scientific American, 181&amp;nbsp;(1), 11-15. (In&amp;nbsp;English)</mixed-citation></ref></ref-list></back></article>