<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.2" xml:lang="ru" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="issn">2313-8912</journal-id><journal-title-group><journal-title>Научный результат. Вопросы теоретической и прикладной лингвистики</journal-title></journal-title-group><issn pub-type="epub">2313-8912</issn></journal-meta><article-meta><article-id pub-id-type="doi">10.18413/2313-8912-2024-10-3-0-6</article-id><article-id pub-id-type="publisher-id">3545</article-id><article-categories><subj-group subj-group-type="heading"><subject>ПРИКЛАДНАЯ ЛИНГВИСТИКА</subject></subj-group></article-categories><title-group><article-title>&lt;strong&gt;ASAWEC: towards a corpus of Arab scholars&amp;rsquo; academic written English&lt;/strong&gt;</article-title><trans-title-group xml:lang="en"><trans-title>&lt;strong&gt;ASAWEC: towards a corpus of Arab scholars&amp;rsquo; academic written English&lt;/strong&gt;</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Sanosi</surname><given-names>Abdulaziz B</given-names></name><name xml:lang="en"><surname>Sanosi</surname><given-names>Abdulaziz B</given-names></name></name-alternatives><email>a.assanosi@psau.edu.sa</email><xref ref-type="aff" rid="aff1" /></contrib><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Mohammed</surname><given-names>Abuelgasim Sabah Elsaid</given-names></name><name xml:lang="en"><surname>Mohammed</surname><given-names>Abuelgasim Sabah Elsaid</given-names></name></name-alternatives><email>a.ibrahim@psau.edu.sa</email><xref ref-type="aff" rid="aff1" /></contrib></contrib-group><aff id="aff1"><institution>College of Science and Humanities, Prince Sattam bin Abdulaziz University, Hawtat Bani Tamim, Saudi Arabia</institution></aff><pub-date pub-type="epub"><year>2024</year></pub-date><volume>10</volume><issue>3</issue><fpage>0</fpage><lpage>0</lpage><self-uri content-type="pdf" xlink:href="/media/linguistics/2024/3/ВТиПЛ_2024_3_116-134.pdf" /><abstract xml:lang="ru"><p>Linguistic corpora have been used in a wide range in recent years. Different types of linguistics analyses in both spoken and written discourses are being conducted using the corpus linguistics approach. Among these, academic writing has received considerable attention. Corpus linguistics has provided insights into the academic writing of both native and non-native English language learners and writers in general. Nevertheless, relatively few studies have investigated this topic in the Arab EFL setting. Consequently, there is a relative paucity in corpora of Academic written English by Arab speakers. To address this gap, we compiled the Arab Scholars&amp;rsquo; Academic Written English Corpus (ASAWEC) which is a specialized corpus of Arab scholars&amp;rsquo; academic written English. We collected the corpus texts according to specific criteria, and then we normalized and cleaned the data. The texts were then tokenized and tagged and the corpus underwent initial tests which yields insightful findings on Arab scholars&amp;rsquo; academic written English such as the low lexical diversity and the utilization of various discourse techniques. The present paper introduces the corpus, provides details on its compilation, presents initial results and statistics, and discusses potential limitations and future perspectives for updating the corpus. It is envisaged that this project will encourage the use of the ASAWEC and help in launching similar initiatives to advance research in Arab corpus linguistics.</p></abstract><trans-abstract xml:lang="en"><p>Linguistic corpora have been used in a wide range in recent years. Different types of linguistics analyses in both spoken and written discourses are being conducted using the corpus linguistics approach. Among these, academic writing has received considerable attention. Corpus linguistics has provided insights into the academic writing of both native and non-native English language learners and writers in general. Nevertheless, relatively few studies have investigated this topic in the Arab EFL setting. Consequently, there is a relative paucity in corpora of Academic written English by Arab speakers. To address this gap, we compiled the Arab Scholars&amp;rsquo; Academic Written English Corpus (ASAWEC) which is a specialized corpus of Arab scholars&amp;rsquo; academic written English. We collected the corpus texts according to specific criteria, and then we normalized and cleaned the data. The texts were then tokenized and tagged and the corpus underwent initial tests which yields insightful findings on Arab scholars&amp;rsquo; academic written English such as the low lexical diversity and the utilization of various discourse techniques. The present paper introduces the corpus, provides details on its compilation, presents initial results and statistics, and discusses potential limitations and future perspectives for updating the corpus. It is envisaged that this project will encourage the use of the ASAWEC and help in launching similar initiatives to advance research in Arab corpus linguistics.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>Specialized corpus</kwd><kwd>Corpus compiling</kwd><kwd>Academic writing</kwd><kwd>Corpus linguistics</kwd><kwd>L2 writing</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Specialized corpus</kwd><kwd>Corpus compiling</kwd><kwd>Academic writing</kwd><kwd>Corpus linguistics</kwd><kwd>L2 writing</kwd></kwd-group></article-meta></front><back /></article>