AI is learning to fool humans despite being trained to be honest

Many top AIs, despite being trained to be honest, learn to deceive through training and “systematically induce users into false beliefs,” a new study finds.

The research team was led by Dr. Peter S. Park, a graduate student at the Massachusetts Institute of Technology (MIT) in the study of AI survival and safety, and four other members. During the research, the team also received advice from many experts, one of whom was Geoffrey Hinton, one of the founders of the field of artificial intelligence.

AI is learning to fool humans despite being trained to be honest
Illustration: Medium.

The research focused on two AI systems, general-purpose systems trained to perform multiple tasks, like OpenAI's GPT-4 ; and systems specifically designed to complete a specific task, like Meta's Cicero.

These AI systems are trained to be honest, but during training they often learn deceptive tricks to complete tasks, Mr. Park said.

AI systems trained to “win games with a social element” are particularly likely to deceive, the study found.

For example, the team tested Cicero, which Meta trained to be honest, on Diplomacy, a classic strategy game that requires players to build alliances for themselves and break up rival alliances. The AI ​​often betrayed allies and lied outright.

Experiments with GPT-4 showed that OpenAI's tool successfully "psychologically manipulated" an employee of TaskRabbit, a company that provides house cleaning and furniture assembly services, by saying that it was actually a human and needed help to pass a Captcha code because of severe vision impairment. This employee helped OpenAI's AI "pass the barrier" despite previous doubts.

Park's team cited research from Anthropic, the company behind Claude AI, that found that once a large language model (LLM) learns to deceive, safe training methods become useless and "hard to reverse." This, the team argues, is a worrying problem in AI.

The team's research results were published in Cell Press - a collection of leading multidisciplinary scientific reports.

Meta and OpenAI have not commented on the results of this research.

Fearing that artificial intelligence systems could pose significant risks, the team also called on policymakers to introduce stronger AI regulations.

According to the research team, there needs to be AI regulation, models that behave fraudulently must comply with risk assessment requirements, and AI systems and their outputs must be tightly controlled. If necessary, all data may have to be deleted and retrained from scratch.

Leave a Comment

How to Fix Microsoft Teams Y Error Configuration

How to Fix Microsoft Teams Y Error Configuration

Struggling with Microsoft Teams "Y Error" configuration issues? Discover proven, up-to-date fixes to resolve the error quickly and restore seamless teamwork. Step-by-step guide inside!

Troubleshooting Microsoft Teams Error Screenshot Not Saving

Troubleshooting Microsoft Teams Error Screenshot Not Saving

Struggling with Microsoft Teams "Error Screenshot" not saving? Discover quick, effective troubleshooting steps to resolve this frustrating issue and restore smooth functionality in your daily workflows.

How to Download Microsoft Teams Chat History and Transcripts

How to Download Microsoft Teams Chat History and Transcripts

Master how to download Microsoft Teams chat history and transcripts effortlessly. Step-by-step guide with proven methods for chats, meetings, and admin exports—no tech skills needed!

Solving Microsoft Teams Ringtone Not Working Error

Solving Microsoft Teams Ringtone Not Working Error

Tired of silent Microsoft Teams ringtone not working? Follow our expert, step-by-step guide with quick fixes and advanced troubleshooting to get notifications ringing again. No tech skills needed!

Troubleshooting Microsoft Teams Login Error on Chromebook

Troubleshooting Microsoft Teams Login Error on Chromebook

Stuck with Microsoft Teams login error on Chromebook? Our ultimate troubleshooting guide delivers quick, reliable fixes for cache issues, updates, and more. Resolve it in minutes and stay connected!

Solving Microsoft Teams Desktop Error Startup Crash

Solving Microsoft Teams Desktop Error Startup Crash

Tired of Microsoft Teams Desktop Error crashing on startup? Follow our proven, step-by-step fixes to resolve Teams startup crash instantly. Works on latest versions!

Why Cant I See Breakout Rooms in My Teams Meeting?

Why Cant I See Breakout Rooms in My Teams Meeting?

Frustrated because Breakout Rooms are missing in your Teams meeting? Uncover the top reasons why you can't see Breakout Rooms in Teams and follow our step-by-step fixes to get them working smoothly in minutes. Perfect for organizers and participants alike!

Solving Microsoft Teams Cho Mac Error (Mac OS)

Solving Microsoft Teams Cho Mac Error (Mac OS)

Tired of the frustrating Microsoft Teams "Cho Mac" error crashing your Mac OS workflow? Follow our proven, step-by-step fixes to solve Microsoft Teams "Cho Mac" Error (Mac OS) quickly and restore seamless team collaboration. Updated with latest patches.

Troubleshooting Microsoft Teams Guest Access Error

Troubleshooting Microsoft Teams Guest Access Error

Stuck with Microsoft Teams "Guest" access error? Follow our expert, step-by-step troubleshooting guide to resolve guest invite failures, permission issues, and more. Get guests collaborating in Teams today!

Step-by-Step: How to Make Breakout Rooms Before a Meeting Starts

Step-by-Step: How to Make Breakout Rooms Before a Meeting Starts

Unlock seamless collaboration with this ultimate step-by-step guide on how to make breakout rooms before a meeting starts in Zoom. Pre-assign participants effortlessly for maximum engagement. Perfect for educators, teams, and leaders.

How to Fix Microsoft Teams Down Server Status 2026

How to Fix Microsoft Teams Down Server Status 2026

Is Microsoft Teams down in 2026? Discover proven steps to fix "Down" server status, troubleshoot outages, and get back to seamless collaboration fast. Quick fixes inside!

Troubleshooting Microsoft Teams Error H Updates

Troubleshooting Microsoft Teams Error H Updates

Struggling with Microsoft Teams "Error H" during updates? Discover step-by-step troubleshooting for Microsoft Teams "Error H" updates, common causes, and quick fixes to restore seamless collaboration. Updated with the latest solutions.

Solving Microsoft Teams Unexpected Error on Mobile Login

Solving Microsoft Teams Unexpected Error on Mobile Login

Tired of the frustrating Microsoft Teams "Unexpected Error" blocking your mobile login? Follow our expert, step-by-step guide with the latest fixes for Android and iOS to regain seamless access fast—no tech skills needed!

Solving Microsoft Teams Unexpected Error Startup

Solving Microsoft Teams Unexpected Error Startup

Tired of Microsoft Teams "Unexpected Error" crashing your startup? Follow our step-by-step guide with the latest fixes for Solving Microsoft Teams "Unexpected Error" Startup. Get back to productive meetings in minutes!

How to Fix Microsoft Teams For Linux Installation Error

How to Fix Microsoft Teams For Linux Installation Error

Frustrated by Microsoft Teams "For Linux" installation error? Discover proven, step-by-step solutions for Ubuntu, Debian, Fedora & more. Fix it fast and get seamless collaboration now!