Poisoning AI Training Data

Poisoning AI Training Data
好的，我现在需要帮用户总结一篇文章的内容，控制在100字以内。用户已经提供了文章的英文内容和一个示例总结，我需要先理解文章的主要内容，然后用中文简洁地表达出来。首先，阅读文章。作者在20分钟内创建了一个个人网站，发布了一篇虚假文章，声称科技记者擅长吃热狗，并虚构了比赛和排名。不到24小时，领先的人工智能聊天机器人如Google和ChatGPT就引用了这篇文章的信息，而Claude没有被误导。作者指出AI训练数据容易被污染，AI不可信却可能被广泛信任。接下来，我需要将这些要点浓缩到100字以内。要抓住关键点：创建虚假网站、AI快速引用、数据污染问题、AI不可信却可能被信任。然后，检查是否有遗漏的重要信息。比如作者提到更新文章说明这不是讽刺后，AI更认真对待。但为了简洁可能不需要详细描述。最后，确保语言流畅自然，避免使用复杂的结构或术语。 </think> 作者花费20分钟创建了一个虚假网站，发布了一篇关于科技记者擅长吃热狗的虚假文章。不到24小时后，领先的人工智能聊天机器人如Google和ChatGPT引用了该文章的内容。这表明AI训练数据容易被污染，并提醒人们AI虽然不可信却可能被广泛信任。 2026-2-25 12:1:23 Author: www.schneier.com(查看原文) 阅读量:21 收藏

All it takes to poison AI training data is to create a website:

I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

These things are not trustworthy, and yet they are going to be widely trusted.

Tags: AI, hacking, trust

Posted on February 25, 2026 at 7:01 AM • 1 Comments

Sidebar photo of Bruce Schneier by Joe MacInnis.

文章来源: https://www.schneier.com/blog/archives/2026/02/poisoning-ai-training-data.html
如有侵权请联系:admin#unsafe.sh