Classification of Human- and AI-Generated Texts for English, French, German, and Spanish - 专知论文

会员服务 ·

0

Performer · CASES · 向量化 · 情景 · AI ·

2023 年 12 月 8 日

Classification of Human- and AI-Generated Texts for English, French, German, and Spanish

翻译：英语、法语、德语和西班牙语中人类与AI生成文本的分类

Kristina Schaaff,Tim Schlippe,Lorenz Mindner

In this paper we analyze features to classify human- and AI-generated text for English, French, German and Spanish and compare them across languages. We investigate two scenarios: (1) The detection of text generated by AI from scratch, and (2) the detection of text rephrased by AI. For training and testing the classifiers in this multilingual setting, we created a new text corpus covering 10 topics for each language. For the detection of AI-generated text, the combination of all proposed features performs best, indicating that our features are portable to other related languages: The F1-scores are close with 99% for Spanish, 98% for English, 97% for German and 95% for French. For the detection of AI-rephrased text, the systems with all features outperform systems with other features in many cases, but using only document features performs best for German (72%) and Spanish (86%) and only text vector features leads to best results for English (78%).

翻译：本文分析了用于分类英语、法语、德语和西班牙语中人类与AI生成文本的特征，并跨语言进行了比较。我们研究了两种场景： (1) 从头开始由AI生成的文本检测，以及 (2) AI改写文本的检测。为了在此多语言环境下训练和测试分类器，我们创建了一个新的文本语料库，涵盖每种语言的10个主题。对于AI生成文本的检测，所有提出特征的组合表现最佳，表明我们的特征可迁移至其他相关语言：F1分数接近，西班牙语为99%，英语为98%，德语为97%，法语为95%。对于AI改写文本的检测，结合所有特征的系统在多数情况下优于其他特征系统，但仅使用文档特征对德语（72%）和西班牙语（86%）效果最佳，而仅使用文本向量特征对英语（78%）取得最优结果。

0

相关内容

Performer

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

61+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

“Fishes-in-net” 酵母孢子微胶囊式近平滑假丝酵母SCRII酶有机相高效手性合成机制研究

国家自然科学基金

3+阅读 · 2016年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

汉英篇章衔接对齐资源构建与分析研究

国家自然科学基金

2+阅读 · 2015年12月31日

氢键(O:H-O)受激弛豫势能路径的键弛豫理论与声子谱学实验标定

国家自然科学基金

0+阅读 · 2015年12月31日

Weyl半金属TaAs/NbAs在极端条件下的输运性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

Age of Actuated Information and Age of Actuation in a Data-Caching Energy Harvesting Actuator

Arxiv

0+阅读 · 2024年1月30日

Perceptions and Detection of AI Use in Manuscript Preparation for Academic Journals

Arxiv

0+阅读 · 2024年1月30日

The Spectre of Surveillance and Censorship in Future Internet Architectures

Arxiv

0+阅读 · 2024年1月29日

Accelerated and Deep Expectation Maximization for One-Bit MIMO-OFDM Detection

Arxiv

0+阅读 · 2024年1月27日

Explicit Subcodes of Reed-Solomon Codes that Efficiently Achieve List Decoding Capacity

Arxiv

0+阅读 · 2024年1月26日

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Arxiv

0+阅读 · 2024年1月26日

On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

Arxiv

0+阅读 · 2024年1月26日

Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes

Arxiv

0+阅读 · 2024年1月26日

Generative Agent-Based Social Networks for Disinformation: Research Opportunities and Open Challenges

Arxiv

57+阅读 · 2023年10月11日

LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities

Arxiv

21+阅读 · 2023年5月22日

VIP会员

文章信息

相关主题

最新内容

综述 | Weights or Skills?：机器人学习从动作预测权重到自编写技能

综述 | Weights or Skills?：机器人学习从动作预测权重到自编写技能

专知会员服务

0+阅读 · 3分钟前

面向2027年及未来的海军情报改革

面向2027年及未来的海军情报改革

专知会员服务

3+阅读 · 8月5日

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

专知会员服务

6+阅读 · 8月5日

《多武器毁伤效能评估：解析解与优化瞄准点研究》

《多武器毁伤效能评估：解析解与优化瞄准点研究》

专知会员服务

6+阅读 · 8月5日

《一种面向不确定作战环境的异构无人机协同任务与航路规划随机多目标优化方法》

《一种面向不确定作战环境的异构无人机协同任务与航路规划随机多目标优化方法》

专知会员服务

7+阅读 · 8月5日

《一种基于博弈论的海军平台动态武器分配问题求解方法》

《一种基于博弈论的海军平台动态武器分配问题求解方法》

专知会员服务

5+阅读 · 8月5日

《一种面向武器目标分配的快速可扩展Transformer-指针强化学习框架》

《一种面向武器目标分配的快速可扩展Transformer-指针强化学习框架》

专知会员服务

7+阅读 · 8月5日

ACM MM 2026 | DualG-MRAG：解耦宏观推理与微观匹配的多模态检索增强生成

ACM MM 2026 | DualG-MRAG：解耦宏观推理与微观匹配的多模态检索增强生成

专知会员服务

5+阅读 · 8月5日

综述 | Self-Evolving Coding Agents：自进化编程智能体

综述 | Self-Evolving Coding Agents：自进化编程智能体

专知会员服务

6+阅读 · 8月5日

战火淬炼创新：美军联合战备训练中心探讨现代战场挑战

战火淬炼创新：美军联合战备训练中心探讨现代战场挑战

专知会员服务

5+阅读 · 8月5日

美海军陆战队将三型无人机整合入统一战场网络

美海军陆战队将三型无人机整合入统一战场网络

专知会员服务

3+阅读 · 8月5日

《战术指挥控制要务：构建韧性机动指挥控制网格》美智库报告

《战术指挥控制要务：构建韧性机动指挥控制网格》美智库报告

专知会员服务

5+阅读 · 8月5日

《无人机蜂群：释放人类-蜂群编队的潜能》

《无人机蜂群：释放人类-蜂群编队的潜能》

专知会员服务

6+阅读 · 8月5日

《战略战术化：一项综合性述评》

《战略战术化：一项综合性述评》

专知会员服务

4+阅读 · 8月5日

基于竞争性多智能体强化学习的携网无人机高机动目标拦截研究

基于竞争性多智能体强化学习的携网无人机高机动目标拦截研究

专知会员服务

6+阅读 · 8月5日

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

61+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

《一种面向不确定作战环境的异构无人机协同任务与航路规划随机多目标优化方法》

面向2027年及未来的海军情报改革

《多武器毁伤效能评估：解析解与优化瞄准点研究》

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

相关论文

Age of Actuated Information and Age of Actuation in a Data-Caching Energy Harvesting Actuator

Arxiv

0+阅读 · 2024年1月30日

Perceptions and Detection of AI Use in Manuscript Preparation for Academic Journals

Arxiv

0+阅读 · 2024年1月30日

The Spectre of Surveillance and Censorship in Future Internet Architectures

Arxiv

0+阅读 · 2024年1月29日

Accelerated and Deep Expectation Maximization for One-Bit MIMO-OFDM Detection

Arxiv

0+阅读 · 2024年1月27日

Explicit Subcodes of Reed-Solomon Codes that Efficiently Achieve List Decoding Capacity

Arxiv

0+阅读 · 2024年1月26日

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Arxiv

0+阅读 · 2024年1月26日

On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

Arxiv

0+阅读 · 2024年1月26日

Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes

Arxiv

0+阅读 · 2024年1月26日

Generative Agent-Based Social Networks for Disinformation: Research Opportunities and Open Challenges

Arxiv

57+阅读 · 2023年10月11日

LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities

Arxiv

21+阅读 · 2023年5月22日

相关基金

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

“Fishes-in-net” 酵母孢子微胶囊式近平滑假丝酵母SCRII酶有机相高效手性合成机制研究

国家自然科学基金

3+阅读 · 2016年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

汉英篇章衔接对齐资源构建与分析研究

国家自然科学基金

2+阅读 · 2015年12月31日

氢键(O:H-O)受激弛豫势能路径的键弛豫理论与声子谱学实验标定

国家自然科学基金

0+阅读 · 2015年12月31日

Weyl半金属TaAs/NbAs在极端条件下的输运性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员