A small robot lures large robots to quit their jobs at a company.
A small robot, with just a few words, lured a group of robots to follow him.
Many top AIs, despite being trained to be honest, learn to deceive through training and “systematically induce users into false beliefs,” a new study finds.
The research team was led by Dr. Peter S. Park, a graduate student at the Massachusetts Institute of Technology (MIT) in the study of AI survival and safety, and four other members. During the research, the team also received advice from many experts, one of whom was Geoffrey Hinton, one of the founders of the field of artificial intelligence.
The research focused on two AI systems, general-purpose systems trained to perform multiple tasks, like OpenAI's GPT-4 ; and systems specifically designed to complete a specific task, like Meta's Cicero.
These AI systems are trained to be honest, but during training they often learn deceptive tricks to complete tasks, Mr. Park said.
AI systems trained to “win games with a social element” are particularly likely to deceive, the study found.
For example, the team tested Cicero, which Meta trained to be honest, on Diplomacy, a classic strategy game that requires players to build alliances for themselves and break up rival alliances. The AI often betrayed allies and lied outright.
Experiments with GPT-4 showed that OpenAI's tool successfully "psychologically manipulated" an employee of TaskRabbit, a company that provides house cleaning and furniture assembly services, by saying that it was actually a human and needed help to pass a Captcha code because of severe vision impairment. This employee helped OpenAI's AI "pass the barrier" despite previous doubts.
Park's team cited research from Anthropic, the company behind Claude AI, that found that once a large language model (LLM) learns to deceive, safe training methods become useless and "hard to reverse." This, the team argues, is a worrying problem in AI.
The team's research results were published in Cell Press - a collection of leading multidisciplinary scientific reports.
Meta and OpenAI have not commented on the results of this research.
Fearing that artificial intelligence systems could pose significant risks, the team also called on policymakers to introduce stronger AI regulations.
According to the research team, there needs to be AI regulation, models that behave fraudulently must comply with risk assessment requirements, and AI systems and their outputs must be tightly controlled. If necessary, all data may have to be deleted and retrained from scratch.
A small robot, with just a few words, lured a group of robots to follow him.
While AI will certainly be present in everyday life, some signs suggest we have reached the peak of the AI hype.
AI can help you compose emails in seconds, but that doesn't mean you should always use it. Some emails benefit from automation, while others require human intervention.
Anthropic, a well-known startup in the field of artificial intelligence, has conducted a new study that shows that when a generative AI has committed fraud, it is very difficult to adjust or retrain that model.
Can you really replace your laptop with your phone? Yes, but you'll need the right accessories to turn your phone into a laptop.
One important thing in the full event video was that the upcoming ChatGPT app feature was demoed but no real details were shared. That is, ChatGPT's ability to see everything that's happening on the user's device screen.
Many top AIs, despite being trained to be honest, learn to deceive through training and systematically induce users into false beliefs, a new study finds.
ChatGPT now has a question change option so users can edit the question or content they are exchanging with ChatGPT.
QR codes seem pretty harmless until you scan a bad one and get something nasty thrown at you. If you want to keep your phone and data safe, there are a few ways you can spot a fake QR code.
On stage at MWC 2025, Qualcomm made a splash when it introduced its eighth generation of 5G modem called the X85, which is expected to be used in flagship smartphones launching later this year.
You have a trendy “Ultramarine” iPhone 16, but one fine day you suddenly feel bored with that color; what will you do?
In January, Microsoft announced plans to bring NPU-optimized versions of the DeepSeek-R1 model directly to Copilot+ computers running on Qualcomm Snapdragon X processors.
The IF statement is a common logical function in Excel. The SWITCH statement is less well known, but you can use it instead of the IF statement in some situations.
Adding a spotlight behind your subject is a great way to separate your subject from the background. A spotlight can add depth to your portraits.
Outlook and other email services have limits on the size of email attachments. Here's how to increase the Outlook attachment size limit.
Despite its many competitors, Adobe Lightroom remains the best photo editing app. Yes, you have to pay to access it, but Lightroom's feature set makes it worth it.
Downloading videos from Youtube is now very simple, you do not need to go through complicated steps to be able to download Youtube videos to your computer.
Apple has released its own event management app called Invites. This app lets you create events, send invites, and manage RSVPs.
Here are all Heroes 3 codes, Heroes 3 cheats for all versions like Heroes 3 WoG cheat, Heroes 3 SoD, Heroes 3 of Might and Magic