NLPCC 2026 Share Task: The Second Evaluation on LLM-Generated Text Detection
Introduction
The rapid development of large language models (LLMs) has given rise to a series of challenges, including the generation of
disinformation, the spread of harmful content, and various forms of misuse. Against this backdrop, the efficient
discrimination between LLM-generated text and human-written text has become an urgent and critical research issue in the field
of natural language processing (NLP). While remarkable progress has been made, relevant research has largely focused on
English, systematic and technical exploration for the Chinese remain scarce. This shared task aims to fill this gap, build
more robust Chinese LLM-generated text detectors, and advance research and real-world applications in this field within the
Chinese.
Following the success of the 1st Shared Task on LLM-Generated Text Detection (NLPCC 2025), the 2nd Shared Task on LLM-Generated
Text Detection in 2026 features significant upgrades: the task formulation has been expanded from binary to ternary
classification. Specifically, in addition to distinguishing between human-written text and LLM-generated text, a new category
for identifying LLM-refined text has been introduced, which better aligns with real-world application scenarios of LLMs.
Participating teams are required to design and implement text detection algorithms based on the training data provided to
achieve accurate classification and will undergo rigorous stress testing.
Tentative Schedule
| March 20, 2026 | Shared task announcement and call for participation |
| March 20, 2026 | Registration opens |
| April 15, 2026 | Release of detailed task guidelines and training data |
| May 25, 2026 | Registration deadline |
| June 11, 2026 | Test data release |
| June 20, 2026 | Deadline for participants to submit results |
| June 30, 2026 | Evaluation results released and call for system reports and conference papers |
Organizers
This shared task is jointly organized by the Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory
(NLP2CT) at the University of Macau, Central China Normal University, and Alibaba Cloud.
-
Junchao Wu | University of Macau
(Contact)nlp2ct.junchao@gmail.com
| Homepage
-
Derek Fai Wong | University of Macau
| Homepage
-
Runzhe Zhan | University of Macau
| Homepage
- Zeyu Wu | University of Macau
- Zhiwen Xie | University of Macau / Central China Normal University
- Yichao Du | Alibaba Cloud
-
Longyue Wang | Alibaba Cloud
| Homepage
NLPCC 2026 共享任务: 第二届大语言模型生成文本检测评测
简介
大型语言模型(LLM)的快速发展带来了虚假信息生成、有害内容传播、滥用误用等一系列严峻挑战。在此背景下,高效区分 LLM 生成文本与人类原创文本,已成为自然语言处理领域亟待解决的重要课题。当前,LLM 生成文本检测技术已取得显著进展,然而相关研究多集中于英文场景,面向中文的系统性研究与技术探索仍较为匮乏。本共享任务旨在弥补这一研究缺口,构建性能更强的中文 LLM 生成文本检测模型,推动该领域在中文语境下的研究与应用落地。继第一届 LLM 生成文本检测共享任务(NLPCC2025,新疆)成功举办,2026 第二届 LLM 生成文本检测共享任务在首届基础上实现重要升级:任务形式从二分类扩展为三分类,即在区分人类原创(Human-Written)与 LLM 原生生成(LLM-Generated)文本的基础上,新增对 LLM 优化润色(LLM-Refined)文本的识别,更贴合大型语言模型的实际应用场景。参赛队伍需基于组委会提供的原始训练数据,设计并实现文本检测算法,实现对不同来源文本的精准判别,并接受压力测试。
初步日程 (Tentative Schedule)
| 2026年3月20日 | 共享任务发布及参赛征集 |
| 2026年3月20日 | 报名开始 |
| 2026年4月15日 | 发布详细的任务指南和训练数据 |
| 2026年5月25日 | 报名截止 |
| 2026年6月11日 | 测试数据发布 |
| 2026年6月20日 | 参赛队伍提交评测结果截止 |
| 2026年6月30日 | 公布评测结果,征集系统报告和会议论文 |
组织者
本共享任务由澳门大学自然语言处理与葡汉机器翻译实验室(NLP2CT)、华中师范大学以及阿里云联合主办。