AI vs AI: Scientists Develop Neural Networks to Detect Generated Text Insertions
A research team, including Alexander Shirnin from HSE University, has developed two models designed to detect AI-generated insertions in scientific texts. The AIpom system integrates two types of models: a decoder and an encoder. The Papilusion system is designed to detect modifications through synonyms and summarisation by neural networks, using one type of models: encoders. In the future, these models will assist in verifying the originality and credibility of scientific publications. Articles describing the Papilusion and AIpom systems have been published in the ACL Anthology Digital Archive.
As language models like ChatGPT and GigaChat become more popular and widely used, it becomes increasingly challenging to distinguish original human-written text from AI-generated content. Artificial intelligence is already being used to write scientific publications and graduation papers. Therefore, it is crucial to develop tools capable of identifying AI-generated insertions in texts. A research team, including scientists from HSE University, presented their solutions at the SemEval 2024 and DAGPap24 international scientific competitions.
The AIpom model was used to identify the boundaries between original and generated fragments in scientific papers. In each paper, the proportion of machine-generated text to the author's text varied. To train the models, the organisers provided texts on the same topic. However, during the verification stage, the topics changed, making the task more challenging.
Alexander Shirnin
'Models perform well on familiar topics, but their performance declines when presented with new topics,' according to Alexander Shirnin, co-author of the paper and Research Assistant at the Laboratory for Models and Methods of Computational Pragmatics, HSE Faculty of Computer Science. 'It's like a student who, having learned how to solve one type of problem, struggles to solve a problem on an unfamiliar topic or from a different subject as easily or accurately.'
To improve the system's performance, the researchers combined two models: a decoder and an encoder. At the first stage, a neural network decoder was used, with the input consisting of an instruction and the source text, and the output being a text fragment presumably generated by AI. Next, in the original text, the area where the model predicted the beginning of a generated fragment was highlighted using a special <BREAK> token. The encoder then processed the text marked up in the first stage and refined the decoder's predictions. To do this, it categorised each token—the smallest unit of text, such as a word or part of a word—and identified whether it was written by a human or generated by AI. This approach improved accuracy compared to systems that used only one type of model: AIpom ranked second at the SemEval-2024 competition.
The Papilusion model also distinguished between written text and generated text. Using Papilusion, sections of the text were classified into four categories: written by a human, modified with synonyms, generated, or summarised by a model. The task was to accurately identify each category. The number of categories and the length of insertions in the texts varied.
In this case, the developers used three models, all of the same type: encoders. They were trained to predict one of the four categories for each token in the text, with each model trained independently of the others. When a model made an error, a cost was applied, and the model was retrained with the lower layers frozen.
'Each model has a different number of layers, depending on its architecture. When training a model, we can leave the first ten or so layers unchanged and adjust only the parameters in the last two layers. This is done to prevent losing important data embedded in the first layers during training,' explains Alexander Shirnin. 'It can be compared to an athlete who makes an error in the movement of their hand. We only need to explain this part to them, rather than resetting their entire learning and retraining them, as they might forget how to move correctly overall. The same logic applies here. The method is not universal and may not work with all models, but in our case, it was effective.'
The three encoders independently determined the category for each token (word). The system's final prediction was based on the category that received the most points. Papilusion ranked sixth out of 30 in the competition.
According to the researchers, current AI detection models perform reasonably well but still have limitations. Primarily, they struggle to process data beyond what they were trained on, and overall, there is a lack of diverse data to train the models effectively.
'To obtain more data, we need to focus on collecting it. Both companies and laboratories have been doing this. Specifically for this type of task, it is necessary to collect datasets that include texts modified using multiple AI models and modification methods,' the researcher comments. 'Instead of continuing a text using just one model, more realistic scenarios should be created, such as asking the model to add to the text, rewrite the beginning for better coherence, remove parts of it, or generate a portion of the text in a new style using a different prompt. Of course, it is also important to collect data in different languages and on a variety of topics.'
See also:
Similar Comprehension, Different Reading: How Native Language Affects Reading in English as a Second Language
Researchers from the MECO international project, including experts from the HSE Centre for Language and Brain, have developed a tool for analysing data on English text reading by native speakers of more than 19 languages. In a large-scale experiment involving over 1,200 people, researchers recorded participants’ eye movements as they silently read the same English texts and then assessed their level of comprehension. The results showed that even when comprehension levels were the same, the reading process—such as gaze fixations, rereading, and word skipping—varied depending on the reader's native language and their English proficiency. The study has been published in Studies in Second Language Acquisition.
Registration for Russian Olympiad in Artificial Intelligence 2025 Now Open
Registration for the fifth season of the Russian Olympiad in Artificial Intelligence has opened. This year, the competition has gained international status. The event is open to students in the 8–11 grades both in Russia and abroad. The winners will receive benefits when applying to Russian universities.
Mortgage and Demography: HSE Scientists Reveal How Mortgage Debt Shapes Family Priorities
Having a mortgage increases the likelihood that a Russian family will plan to have a child within the next three years by 39 percentage points. This is the conclusion of a study by Prof. Elena Vakulenko and doctoral student Rufina Evgrafova from the HSE Faculty of Economic Sciences. The authors emphasise that this effect is most pronounced among women, people under 36, and those without children. The study findings have been published in Voprosy Ekonomiki.
Scientists Discover How Correlated Disorder Boosts Superconductivity
Superconductivity is a unique state of matter in which electric current flows without any energy loss. In materials with defects, it typically emerges at very low temperatures and develops in several stages. An international team of scientists, including physicists from HSE MIEM, has demonstrated that when defects within a material are arranged in a specific pattern rather than randomly, superconductivity can occur at a higher temperature and extend throughout the entire material. This discovery could help develop superconductors that operate without the need for extreme cooling. The study has been published in Physical Review B.
Scientists Develop New Method to Detect Motor Disorders Using 3D Objects
Researchers at HSE University have developed a new methodological approach to studying motor planning and execution. By using 3D-printed objects and an infrared tracking system, they demonstrated that the brain initiates the planning process even before movement begins. This approach may eventually aid in the assessment and treatment of patients with neurodegenerative diseases such as Parkinson’s. The paper has been published in Frontiers in Human Neuroscience.
Global AI Trends Discussed at International Foresight Workshop at HSE University
At an international foresight workshop on artificial intelligence held at HSE University, Russian and foreign scholars discussed the trends and challenges arising from the rapid development of AI.
Civic Identity Helps Russians Maintain Mental Health During Sanctions
Researchers at HSE University have found that identifying with one’s country can support psychological coping during difficult times, particularly when individuals reframe the situation or draw on spiritual and cultural values. Reframing in particular can help alleviate symptoms of depression. The study has been published in Journal of Community Psychology.
HSE Students Win International Olympiad in Artificial Intelligence
In the finals of the olympiad, the Russian team competed with 300 talented schoolchildren from 61 countries, including Australia, Brazil, Hungary, China, Mexico, the United Arab Emirates, Poland, Serbia, Singapore, the USA, Sweden, and Japan. The finals included team and individual rounds. In the team round, the Russian team made it into the top 10, winning a silver medal. In the individual competition, Russian schoolchildren won six gold medals, one silver, and one bronze.
‘Neural Networks Can Provide Assessments As Accurate As Humans’
Voice assistants have become part of everyday life. They can plan routes, play music and films, and answer questions. But the quality of their speech requires assessment. To address this, students of the Applied Artificial Intelligence Workshop at the HSE University and VK Engineering and Mathematics Schoolhave developed neural networks capable of evaluating speech synthesis.
Scientists Clarify How the Brain Memorises and Recalls Information
An international team, including scientists from HSE University, has demonstrated for the first time that the anterior and posterior portions of the human hippocampus have distinct roles in associative memory. Using stereo-EEG recordings, the researchers found that the rostral (anterior) portion of the human hippocampus is activated during encoding and object recognition, while the caudal (posterior) portion is involved in associative recall, restoring connections between the object and its context. These findings contribute to our understanding of the structure of human memory and may inform clinical practice. A paper with the study findings has been published in Frontiers in Human Neuroscience.