M. N. Kozhina’s Functional Stylistics of Scientific Text and Current Corpus-Based Studies in Detecting Artificially Generated Content

Authors

DOI:

https://doi.org/10.17072/2073-6681-2025-4-81-90

Keywords:

functional stylistics; scientific text; corpus-based studies; frequency; ChatGPT-4; artificially generated content.

Abstract

The paper emphasizes the significance of scientific works by the outstanding Russian scientist, Professor at Perm State University M. N. Kozhina, as well as her contribution to linguistics in general and contemporary stylistics in particular. The paper shows the connections between current computer- and corpus-based statistical studies and those of Perm stylistics school of thought, founded by M. N. Kozhina. It also presents the latest corpus-based statistical research results in detecting artificially generated texts and demonstrates the continuity and effectiveness of quantitative studies in language, speech, and communication, as well as their significance for the development of science in general. Statistical data show a steady increase in the use of the open-source Artificial Intelligence tool ChatGPT-4 in scientific publications that started almost immediately after its release on November 30, 2022. The largest and fastest growth in artificially generated content has been noted in publications on computer sciences – up to 17.5%. This fact was established as a result of a systemic large-scale statistical comparison across more than 950,900 papers published in English in leading scientific journals from various academic fields between January 2020 and February 2024, i.e. before and after the release of ChatGPT-4. In abstracts published in computer science journals over the last 14 years (2010-2024), four words demonstrate disproportionately high frequency of use – realm, intricate, showcasing, pivotal. Indicatively, a sudden surge in the use of these words occurred in 2023, about five months after ChatGPT-4 became freely available, while in the period from 2010 to 2022 their use was consistently low.  The paper highlights the challenges posed by the use of ChatGPT-4, especially in scientific communication, and the principles to be used for addressing them.

Author Biography

Nadezhda K. Riabtseva, Institute of Linguistics of the Russian Academy of Sciences

Leading Researcher in the Department of Applied Linguistics

References

Арутюнова Н. Д. Стиль Достоевского в рамке русской картины мира // Поэтика. Стилистика. Язык и культура. Памяти Татьяны Григорьевны Винокур / отв. ред. Н. Н. Розанова. М.: Наука, 1996. С. 61–90.

Баженова Е. А. Стилистика текста М. Н. Кожиной // Векторы развития современной стилистики. Стилистика и речеведение / отв. ред. Е. П. Кучина. Пермcкий гос. ун-т, 2013. С. 74–81.

Беляева Т. Р. Частотность и дистрибуция единиц общенаучной (академической) лексики как маркеры дисциплинарной принадлежности дискурса // Litera. 2021. № 6. С. 164–175. doi 10.25136/2409-8698.2021.6.35902

Гайда Ст. В честь Маргариты Николаевны Кожиной. Панегирик // Стереотипность и твор-чество в тексте: межвуз сб. науч. тр. / под ред. М. П. Котюровой. Пермь, 2010. Вып. 14. С. 6–11.

Глинкина Л. А. Частотность как значимый регистр лексикографии и фразеографии // Проблемы истории, филологии, культуры. 2011. № 3 (33). С. 7–11.

Данилевская Н. В. Динамика формирования знания в научном дискурсе (Функционально-стилистический аспект) // Вестник ТГПУ. Серия: Гуманитарные науки (Филология). 2005. Вып. 3(47). С. 14–17.

Засорина Л. Н. Частотный словарь русского языка / под ред. Л. Н. Засорина. М.: Рус. яз., 1977. 936 с.

Кащук С. М. Теория и практика интеграции GPT-подобных языковых моделей в систему иноязычного образования // Язык и действительность. Научные чтения на кафедре романских языков им. В. Г. Гака: сб. ст. по итогам IX междунар. конф. (Москва, 25–29 марта 2024 г.). Т. 9. М.: Спутник +, 2024. С. 382–386.

Кожина М. Н. Дискурсный анализ и функ-циональная стилистика с речеведческих позиций // Текст – Дискурс – Стиль: сб. науч. ст. СПб., 2004. С. 9–33.

Кожина М. Н. Общая характеристика текстовых категорий в функционально-стилевом аспекте // Очерки истории научного стиля русского литературного языка XVIII–XX вв. Т. II. Ч. 2: Категории научного текста: Функционально-стилистический аспект. / отв. ред. Н. М. Кожина. Пермь, 1998. С. 3–15.

Кожина М. Н. О специфике художественной и научной речи в аспекте функциональной стилистики. Пермь, 1966. 213 с.

Кожина М. Н. Речеведение. Теория функциональной стилистики. Избранные труды. 3-е изд., стер. М.: Флинта, 2020. 623 с.

Кожина М. Н. Соотношение стилистики текста со смежными дисциплинами // Очерки истории научного стиля русского литературного языка XVIII–XX вв. Т. II. Ч. 1: Стилистика научного текста (общие параметры) / отв. ред. Н. М. Кожина. Пермь, 1996. С. 11–29.

Кожина М. Н. (отв. ред.) Стилистический энциклопедический словарь русского языка. М.: Флинта, 2003. 696 с.

Козловская Н. В. GPT и CHAT-GPT: как это по-русски? (об электронных словарных статьях в неологическом ресурсе neolex.iling.spb.ru) // Terra Linguistica. 2023. Т. 14, № 4. С. 67–78. doi 10.18721/JHSS.14405.

Рябцева Н. К. Когнитивные исследования дискурса и современная «компьютерная стилистика» // Когнитивные исследования языка и дискурса: материалы Всерос. науч. конф. (Москва, 30–31 октября 2025 г.). Вып. № 3(64): Ч. I / гл. ред. вып. О. К. Ирисханова. Тамбов: Изд. дом «Державинский», 2025. С. 112–119.

Салимовский В. А. Вклад М. Н. Кожиной в развитие лингвистической стилистики и становление речеведения // Векторы развития современной стилистики. Стилистика и речеведение. Пермь: Перм. гос. ун-т, 2013. С. 7–32.

Шайкевич А. Я., Андрющенко В. М., Ребец-кая Н. А. Дистрибутивно-статистический анализ языка русской прозы 1850–1870-х гг. М.: Языки славянской культуры». 2016. 849 с.

Штайн К. Э. Культурное достояние России: Пермская научная школа функциональной стилистики // Стереотипность и творчество в тексте: Межвуз. сб. науч. тр. Вып. 7 / отв. ред. М. П. Котюрова. Пермь: Перм. гос. ун-т, 2004. С. 6–57.

Cantor M. Nearly 50 news websites are ‘AI-generated’, a study says. Would I be able to tell? 2023. URL: https://www.theguardian.com/technology/2023/may/ 08/ai-generated-news-websites-study (дата обращения: 27.06.2025).

Divjak D. Frequency in Language: Memory, Attention and Learning. Cambridge: Cambridge University Press, 2019. 328 p. doi 10.1017/9781316084410

Kelly S. M. ChatGPT creator pulls AI detection tool due to ‘low rate of accuracy’ // CNN Business. 2023. 25 July. URL: https://www.cnn.com/2023/07/25/tech/openai-ai-detection-tool/index.html. (дата обращения: 28.06.2025).

Liang W. et al. GPT Detectors Are Biased Against Non-native English Writers / W. Liang, M. Yuksekgonul, Y. Mao, E. Wu, J. Y. Zou // arXiv:2304.02819v3 [cs.CL]. 2023a 10 Jul. doi 10.48550/arXiv.2304.02819

Liang W. et al. Can Large Language Models Provide Useful Feedback on Research Papers? A Large-scale Empirical Analysis / W. Liang, Y. Zhang, H. Cao, B. Wang, D. Ding, X. Yang, K. Vodrahalli, S. He, D. Smith, Y. Yin, D. McFarland, J. Zou // arXiv:2310.01783. 2023b. doi 10.48550/arXiv.2310.01783

Liang W. et al. Monitoring AI-modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews / W. Liang, Z. Izzo, Y. Zhang, H. Lepp, H. Cao, X. Zhao, L. Chen, H. Ye, Sh. Liu, Z. Huang, D. A. McFarland, J. Y. Zou. // International Conference on Machine Learning (ICML). 2024a. doi 10.48550/arXiv.2403.07183

Liang W. et al. Mapping the Increasing Use of LLMs in Scientific Papers / W. Liang, Y. Zhang, Z. Wu, H. Lepp, W. Ji, Z. Zhao, H. Cao, S. Liu, S. He, Z. Huang, D. Yang, C. Potts, C. D. Manning, J. Y. Zou. // First Conference on Language Modeling COLM-2024. (Published as a conference paper at COLM-2024). 2024b. P. 1–27 doi 10.48550/arXiv.2404.01268

NewsGuard. Tracking AI-enabled Misinfor-mation: 713 ‘Unreliable AI-Generated News’ Websites (and Counting), Plus the Top False Narratives Generated by Artificial Intelligence Tools. 2023. URL: https://www.newsguardtech.com/ special-reports/ai-tracking-center/ (дата обращения: 28.06.2025).

Van de Poel K., Gasiorek J. Using AI to Expand the “Toolbox” for EAP Writing Instruction: Student Experiences and Perceptions of ChatGPT’s Instructional Potential // AILA Review. 2024. doi 10.1075/aila.24029.van

Walters W. H., Wilder E. I. Fabrication and Er-rors in the Bibliographic Citations Generated by ChatGPT // Scientific Reports. 2023. Vol. 13. Iss. 14045. doi 10.1038/s41598-023-41032-5

Yang W., Chen J., Lin Y., Wen J. DeepCritic: Deliberate Critique with Large Language Models // arXiv:2505.00662v1 [cs.CL]. May 2025. URL: https://arxiv.org/pdf/2505.00662 (дата обращения: 28.06.2025).

References

Arutyunova N. D. Stil’ Dostoevskogo v ramke russkoy kartiny mira [Dostoevsky’s style within Russian worldview]. Poetika. Stilistika. Yazyk i kul’tura. Pamyati Tat’yany Grigor’evny Vinokur [Poetics. Stylistics. Language and Culture. In Memory of Tatiana Grigorievna Vinokur]. Ed. by N. N. Rozanova. Moscow, Nauka Publ., 1996, pp. 61-90. (In Russ.)

Bazhenova E. A. Stilistika teksta M. N. Kozhinoy [Text stylistics of M. N. Kozhina]. Vektory razvitiya sovremennoy stilistiki. Stilistika i rechevedenie [Vectors of Development of Modern Stylistics. Stylistics and Speech Studies]. Ed. by E. P. Kuchina. Perm State University Press, 2013, pp. 74-81. (In Russ.)

Belyaeva T. R. Chastotnost’ i distributsiya edinits obshchenauchnoy (akademicheskoy) leksiki kak markery distsiplinarnoy prinadlezhnosti diskursa [Frequency and distribution of the units of general scientific (academic) lexicon as the markers of disciplinary affiliation of a discourse], Litera, 2021, issue 6, pp. 164-175. doi 10.25136/2409-8698.2021.6.35902. (In Russ.)

Gayda St. V chest’ Margarity Nikolaevny Kozhinoy. Panegirik [In honor of Margarita Nikolaevna Kozhina. Encomium]. Stereotipnost’ i tvorchestvo v texte [Stereotypes and Creativity in Text: an interuniversity collection of scientific papers]. Ed. by M. P. Kotyurova. Perm, 2010, issue 14, pp. 6-11. (In Russ.)

Glinkina L. A. Chastotnost’ kak znachimyy registr leksikografii i frazeografii [Frequency as an important characteristic of lexicography and phraseography]. Problemy istorii, filologii, cultury [Journal of Historical, Philological and Cultural Studies], 2011, issue 3 (33), pp. 7-11. (In Russ.)

Danilevskaya N. V. Dinamika formirovaniya znaniya v nauchnom diskurse (Funktsional'no-stilisticheskiy aspekt) [Dynamics of knowledge formation in scientific discourse (Functional and stylistic aspect)]. Vestnik TGPU. Seriya: Gumanitarnye nauki (Filologiya) [Tomsk State Pedagogical University Bulletin. Series: Humanities (Philology)], 2005, issue 3 (47), pp. 14-17. (In Russ.)

Zasorina L. N. Chastotnyy slovar' russkogo yazyka [Frequency Dictionary of the Russian Language]. Ed. L. N. Zasorin. Moscow, Russian Language Publ., 1977. 936 p. (In Russ.)

Kashchuk S. M. Teoriya i praktika integratsii GPT-podobnykh yazykovykh modeley v sistemu inoyazychnogo obrazovaniya [Theory and practice of integrating GPT-like language models into the foreign language education system]. Yazyk i deystvitel'nost'. Nauchnye chteniya na kafedre romanskikh yazykov im. V.G. Gaka [Language and Reality. Scientific Readings at the Department of Romance Languages named after V. G. Gak: Proceedings of the IX International Conference (Moscow, March 25–29, 2024)]. Moscow, Sputnik + Publ., 2024, vol. 9, pp. 382-386. (In Russ.)

Kozhina M. N. Diskursnyy analiz i funktsional'naya stilistika s rechevedcheskikh pozitsiy [Discourse analysis and functional stylistics from a speech studies perspective]. Tekst – Diskurs – Stil' [Text – Discourse – Style: a collection of scientific articles]. St. Petersburg, Nauka Publ., 2004, pp. 9-33. (In Russ.)

Kozhina M. N. Obshchaya kharakteristika tekstovykh kategoriy v funktsional'no-stilevom aspekte [General characteristics of text categories in the functional-stylistic aspect]. Ocherki istorii nauchnogo stilya russkogo literaturnogo yazyka XVIII–XX vv. [Essays on the History of Scientific Style of the Russian Literary Language in the 18th–20th Centuries]. Ed. by M. N. Kozhina. Perm, Perm State University Press, 1998, vol. II., pt. 2. Kategorii nauchnogo teksta: Funktsional'no-stilisticheskiy aspekt [Categories of scientific text: Functional and stylistic aspects], pp. 3-15. (In Russ.)

Kozhina M. N. O spetsifike khudozhestvennoy i nauchnoy rechi v aspekte funktsional'noy stilistiki [On the specifics of artistic and scientific speech in the aspect of functional stylistics]. Perm, Perm State University Press, 1966. 213 p. (In Russ.)

Kozhina M. N. Rechevedenie. Teoriya funktsional'noy stilistiki: Izbrannye trudy [Speech Studies. Theory of Functional Stylistics: Selected works]. 3rd stereot. ed. Moscow, Flinta Publ., 2020. 623 p. (In Russ.)

Kozhina M. N. Sootnoshenie stilistiki teksta so smezhnymi distsiplinami [Correlation of text stylistics with related disciplines]. Ocherki istorii nauchnogo stilya russkogo literaturnogo yazyka XVIII–XX [Essays on the History of Scientific Style in the Russian Literary Language of the 18th–20th Centuries]. Ed. by M. N. Kozhina. Perm, Perm State University Press, 1996, vol. II, pt. 1. Stilistika nauchnogo teksta (obshchie parametry) [Stylistics of scientific texts (general parameters)], pp. 11-29. (In Russ.)

Stilisticheskiy entsiklopedicheskiy slovar' russkogo yazyka [Stylistic Encyclopedic Dictionary of the Russian Language]. Ed. by M. N. Kozhina. Moscow, Flinta Publ., 2003. 696 p. (In Russ.)

Kozlovskaya N. V. GPT i CHAT-GPT: kak eto po-russki? (Ob elektronnykh slovarnykh stat'yakh v neologicheskom resurse neolex.iling.spb.ru) [GPT and CHATGPT: How is it in Russian? (on electronic dictionary entries in the neological database neolex.iling.spb.ru)]. Terra Linguistica, 2023, vol. 14, issue 4, pp. 67-78. doi 10.18721/JHSS.14405. (In Russ.)

Riabtseva N. K. Kognitivnye issledovaniya diskursa i sovremennaya ʽkomp'yuternaya stilistikaʼ [Cognitive discourse studies and contemporary ʽcomputer stylisticsʼ]. Kognitivnye issledovaniya yazyka i diskursa [Cognitive Studies of Language and Discourse: Proceedings of the All-Russian scientific conference. Moscow, 30-31 October 2025]. Issue 3(64): Pt. 1. Ed. by O. K. Iriskhanova. Moscow, Tambov, 2025, pp. 112-119. (In Russ.)

Salimovskiy V. A. Vklad M. N. Kozhinoy v razvitie lingvisticheskoy stilistiki i stanovlenie rechevedeniya [The contribution of M. N. Kozhina to the development of linguistic stylistics and the formation of speech studies]. Vektory razvitiya sovremennoy stilistiki. Stilistika i rechevedenie [Vectors of Development of Modern Stylistics. Stylistics and Speech Studies]. Perm, Perm State University Press, 2013, pp. 7-32. (In Russ.)

Shaykevich A. Ya., Andryushchenko V. M., Rebetskaya N. A. Distributivno-statisticheskiy analiz yazyka russkoy prozy 1850—1870-kh gg. [Distributional and Statistical Analysis of the Language of Russian Prose of the 1850s–1870s]. Moscow, LRC Publishing House Publ., 2016. 849 p. (In Russ.)

Shtayn K. E. Kul'turnoe dostoyanie Rossii: Permskaya nauchnaya shkola funktsional'noy stilistiki [Cultural heritage of Russia: Perm Scientific School of Functional Stylis-tics]. Stereotipnost' i tvorchestvo v tekste [Stereotypes and Creativity in Text: an interuniversity collection of scientific papers]. Ed. by M. P. Kotyurova. Perm, Perm State University Press, 2004, issue 7, pp. 6-57. (In Russ.)

Cantor M. Nearly 50 news websites are ‘AI-generated’, a study says. Would I be able to tell? 2023. Available at: https://www. theguardian.com/technology/2023/may/ 08/ai-generated-news-websites-study (accessed 27 June 2025). (In Eng.)

Divjak D. Frequency in Language: Memory, Attention and Learning. Cambridge, Cambridge University Press, 2019. 328 p. doi 10.1017/9781316084410. (In Eng.)

Kelly S. M. ChatGPT creator pulls AI detection tool due to ‘low rate of accuracy’. CNN Business. 2023, 25 July. Available at: https://www.cnn.com/2023/07/25/tech/openai-ai-detection-tool/index.html. (accessed 28 June 2025). (In Eng.)

Liang W. et al. GPT detectors are biased against non-native English writers. arXiv:2304.02819v3 [cs.CL]. 2023a, 10 Jul. doi 10.48550/arXiv.2304.02819. (In Eng.)

Liang W. et al. Can large language models pro-vide useful feedback on research papers? A large-scale empirical analysis. arXiv:2310.01783, 2023b.

doi 10.48550/arXiv.2310.01783. (In Eng.)

Liang W. et al. Monitoring AI-modified content at scale: A case study on the impact of ChatGPT on AI conference peer reviews. International Conference on Machine Learning (ICML), 2024a. doi 10.48550/arXiv.2403.07183. (In Eng.)

Liang W. et al. Mapping the increasing use of LLMs in scientific papers. First Conference on Language Modeling COLM-2024. (Published as a conference paper at COLM-2024), 2024b, pp. 1-27. doi 10.48550/arXiv.2404.01268. (In Eng.)

Tracking AI-enabled misinformation: 713 ‘Unreliable AI-generated news’ websites (and counting), plus the top false narratives generated by artificial intelligence tools. NewsGuard, 2023. Available at: https://www.newsguardtech.com/ special-reports/ai-tracking-center/ (accessed 28 June 2025). (In Eng.)

van de Poel K., Gasiorek J. Using AI to expand the ʽToolboxʼ for EAP writing instruction: Student experiences and perceptions of ChatGPT’s instructional potential. AILA Review, 2024. doi 10.1075/aila.24029.van. (In Eng.)

Walters W. H., Wilder E. I. Fabrication and er-rors in the bibliographic citations generated by ChatGPT. Scientific Reports, 2023, vol. 13, issue 14045. doi 10.1038/s41598-023-41032-5. (In Eng.)

Yang W., Chen J., Lin Y., Wen J. DeepCritic: Deliberate critique with large language models. arXiv:2505.00662v1 [cs.CL]. May 2025. Available at: https://arxiv.org/pdf/2505.00662 (accessed 28 June 2025). (In Eng.)

Published

2025-12-29

How to Cite

Riabtseva Н. К. (2025). M. N. Kozhina’s Functional Stylistics of Scientific Text and Current Corpus-Based Studies in Detecting Artificially Generated Content. Perm University Herald. Russian and Foreign Philology, 17(4). https://doi.org/10.17072/2073-6681-2025-4-81-90

Issue

Section

LANGUAGE, CULTURE, AND SOCIETY