>
Fa   |   Ar   |   En
   classifying ai-generated text in low-resource languages like arabic  
   
نویسنده al minshidawi ohood ,vahabie abdol-hossein
منبع aut journal of modeling and simulation - 2025 - دوره : 57 - شماره : 1 - صفحه:113 -124
چکیده    Ai-generated texts (aigts) refer to written content produced by artificial intelligence systems using technologies such as natural language processing and machine learning. the rise of aigt has introduced new challenges in content authenticity, trustworthiness, and information integrity across digital platforms. in low-resource languages, like arabic, aigt detection is challenging because of their more complex structural features. accurate identification of ai-generated versus human-written text is essential to combat misinformation, preserve credibility in communication, and enhance content moderation systems. in this study, we propose a novel framework for aigt detection on the autotweet dataset, an annotated corpus of arabic tweets. to the best of our knowledge, this is the first work to leverage large language models (llms) for aigt detection in arabic, addressing a critical gap in low-resource natural language processing. we introduce a dynamic few-shot prompting technique, powered by a retrieval-based judge prompter module, which selects semantically and stylistically relevant support examples to enhance the contextual understanding of llms. we conduct a comprehensive evaluation across multiple llms, including mistral-7b, llama-3.1-8b, and allam-7b-instruct-preview, under zero-shot, few-shot, and fine-tuning scenarios. our best results were achieved using mistral-7b with qlora fine-tuning and dynamic few-shot prompting, reaching an accuracy of 88.69% and an f1-score of 88.35%. these findings demonstrate the feasibility of adapting llms for aigt detection in arabic and highlight the effectiveness of context-aware prompting in low-resource settings, paving the way for future progress in text classification.
کلیدواژه arabic text detection ,ai-generated text ,zero-shot learning ,few-shot learning ,supervised fine-tuning
آدرس university of tehran, college of alborz, computer engineering department, iran, university of tehran, college of alborz, school of electrical and computer engineering, college of engineering, computer engineering department, iran
پست الکترونیکی h.vahabie@ut.ac.ir
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved