AI animal translation helps humans understand dog barking
Thanks to artificial intelligence (AI), what seems like a movie dream - humans being able to understand animal language - may soon become a reality.
Many top AIs, despite being trained to be honest, learn to deceive through training and “systematically induce users into false beliefs,” a new study finds.
The research team was led by Dr. Peter S. Park, a graduate student at the Massachusetts Institute of Technology (MIT) in the study of AI survival and safety, and four other members. During the research, the team also received advice from many experts, one of whom was Geoffrey Hinton, one of the founders of the field of artificial intelligence.
The research focused on two AI systems, general-purpose systems trained to perform multiple tasks, like OpenAI's GPT-4 ; and systems specifically designed to complete a specific task, like Meta's Cicero.
These AI systems are trained to be honest, but during training they often learn deceptive tricks to complete tasks, Mr. Park said.
AI systems trained to “win games with a social element” are particularly likely to deceive, the study found.
For example, the team tested Cicero, which Meta trained to be honest, on Diplomacy, a classic strategy game that requires players to build alliances for themselves and break up rival alliances. The AI often betrayed allies and lied outright.
Experiments with GPT-4 showed that OpenAI's tool successfully "psychologically manipulated" an employee of TaskRabbit, a company that provides house cleaning and furniture assembly services, by saying that it was actually a human and needed help to pass a Captcha code because of severe vision impairment. This employee helped OpenAI's AI "pass the barrier" despite previous doubts.
Park's team cited research from Anthropic, the company behind Claude AI, that found that once a large language model (LLM) learns to deceive, safe training methods become useless and "hard to reverse." This, the team argues, is a worrying problem in AI.
The team's research results were published in Cell Press - a collection of leading multidisciplinary scientific reports.
Meta and OpenAI have not commented on the results of this research.
Fearing that artificial intelligence systems could pose significant risks, the team also called on policymakers to introduce stronger AI regulations.
According to the research team, there needs to be AI regulation, models that behave fraudulently must comply with risk assessment requirements, and AI systems and their outputs must be tightly controlled. If necessary, all data may have to be deleted and retrained from scratch.
Thanks to artificial intelligence (AI), what seems like a movie dream - humans being able to understand animal language - may soon become a reality.
A small robot, with just a few words, lured a group of robots to follow him.
While AI will certainly be present in everyday life, some signs suggest we have reached the peak of the AI hype.
AI can help you compose emails in seconds, but that doesn't mean you should always use it. Some emails benefit from automation, while others require human intervention.
Anthropic, a well-known startup in the field of artificial intelligence, has conducted a new study that shows that when a generative AI has committed fraud, it is very difficult to adjust or retrain that model.
In this article, we will guide you how to regain access to your hard drive when it fails. Let's follow along!
Dental floss is a common tool for cleaning teeth, however, not everyone knows how to use it properly. Below are instructions on how to use dental floss to clean teeth effectively.
Building muscle takes time and the right training, but its something anyone can do. Heres how to build muscle, according to experts.
In addition to regular exercise and not smoking, diet is one of the best ways to protect your heart. Here are the best diets for heart health.
The third trimester is often the most difficult time to sleep during pregnancy. Here are some ways to treat insomnia in the third trimester.
There are many ways to lose weight without changing anything in your diet. Here are some scientifically proven automatic weight loss or calorie-burning methods that anyone can use.
Apple has introduced iOS 26 – a major update with a brand new frosted glass design, smarter experiences, and improvements to familiar apps.
Yoga can provide many health benefits, including better sleep. Because yoga can be relaxing and restorative, its a great way to beat insomnia after a busy day.
The flower of the other shore is a unique flower, carrying many unique meanings. So what is the flower of the other shore, is the flower of the other shore real, what is the meaning and legend of the flower of the other shore?
Craving for snacks but afraid of gaining weight? Dont worry, lets explore together many types of weight loss snacks that are high in fiber, low in calories without making you try to starve yourself.
Prioritizing a consistent sleep schedule and evening routine can help improve the quality of your sleep. Heres what you need to know to stop tossing and turning at night.
Adding a printer to Windows 10 is simple, although the process for wired devices will be different than for wireless devices.
Diet is important to our health. Yet most of our meals are lacking in these six important nutrients.
You want to have a beautiful, shiny, healthy nail quickly. The simple tips for beautiful nails below will be useful for you.
Students need a specific type of laptop for their studies. It should not only be powerful enough to perform well in their chosen major, but also compact and light enough to carry around all day.