Amazon Announces Nova Sonic Sound Model, Claims Performance Surpasses OpenAI and Google

Amazon today introduced Nova Sonic, an advanced speech-to-speech model that enables developers to build applications that can converse with human-like voices in real time. Amazon claims the new acoustic model offers industry-leading price performance and low latency.

Typically, developing a voice-enabled application requires developers to work with multiple models at the same time:

  • Speech recognition model for converting audio to text.
  • Large Language Model (LLM) for understanding and generating responses.
  • Text-to-speech model.

This approach is not only complex, but also often misses important acoustic contexts such as tone, prosody, and speaking style.

Amazon Announces Nova Sonic Sound Model, Claims Performance Surpasses OpenAI and Google

Nova Sonic addresses this challenge by integrating speech understanding and generation into a single model. This unified approach helps the model capture tone, style, and audio input, creating more natural dialogue. It also determines when to respond appropriately and better handles barge-ins.

Nova Sonic supports both male and female voices in a variety of English accents, including American and British. Developers can access the model via Amazon Bedrock using a two-way streaming API, which supports function calling. The model also includes built-in protection features such as content moderation and watermarking.

In this regard, last month OpenAI announced a new generation of speech-to-text models – gpt-4o-transcribe and gpt-4o-mini-transcribe – with significant improvements in word error rate, language recognition, and accuracy over previous Whisper models.

Sign up and earn $1000 a day ⋙

Leave a Comment

How to use Circle Ks CK Club app to receive attractive offers

How to use Circle Ks CK Club app to receive attractive offers

To get the fastest promotional information from Circle K, you should install the CK Club application. The application saves your payments when shopping or paying at Circle K as well as the number of stamps collected.

Instagram Will Allow Reels Up to 3 Minutes Long

Instagram Will Allow Reels Up to 3 Minutes Long

Instagram has just announced that it will allow users to post Reels videos up to 3 minutes long, double the previous 90-second limit.

How to view Chromebook CPU information

How to view Chromebook CPU information

This article will guide you how to view CPU information, check CPU speed directly on your Chromebook.

8 Cool Things You Can Do With an Old Android Tablet

8 Cool Things You Can Do With an Old Android Tablet

If you don't want to sell or give away your old tablet, you can use it in 5 ways: as a high-quality photo frame, music player, e-book & magazine reader, housework assistant, and as a secondary screen.

How to get beautiful nails quickly

How to get beautiful nails quickly

You want to have a beautiful, shiny, healthy nail quickly. The simple tips for beautiful nails below will be useful for you.

Color inspiration secrets only designers know

Color inspiration secrets only designers know

This article will list color-inspired tips, shared by top designers from the Creative Market community, so you can get the perfect color combination every time.

Everything you need to replace your laptop with a phone

Everything you need to replace your laptop with a phone

Can you really replace your laptop with your phone? Yes, but you'll need the right accessories to turn your phone into a laptop.

ChatGPT will soon be able to see everything happening on your screen

ChatGPT will soon be able to see everything happening on your screen

One important thing in the full event video was that the upcoming ChatGPT app feature was demoed but no real details were shared. That is, ChatGPT's ability to see everything that's happening on the user's device screen.

AI is learning to fool humans despite being trained to be honest

AI is learning to fool humans despite being trained to be honest

Many top AIs, despite being trained to be honest, learn to deceive through training and systematically induce users into false beliefs, a new study finds.

How to change questions on ChatGPT

How to change questions on ChatGPT

ChatGPT now has a question change option so users can edit the question or content they are exchanging with ChatGPT.

How to spot fake QR codes and keep your data safe

How to spot fake QR codes and keep your data safe

QR codes seem pretty harmless until you scan a bad one and get something nasty thrown at you. If you want to keep your phone and data safe, there are a few ways you can spot a fake QR code.

Qualcomm Launches X85 5G Modem With a Series of Notable Improvements

Qualcomm Launches X85 5G Modem With a Series of Notable Improvements

On stage at MWC 2025, Qualcomm made a splash when it introduced its eighth generation of 5G modem called the X85, which is expected to be used in flagship smartphones launching later this year.

New technology allows phones to change color flexibly

New technology allows phones to change color flexibly

You have a trendy “Ultramarine” iPhone 16, but one fine day you suddenly feel bored with that color; what will you do?

Microsoft integrates DeepSeek into the PC Copilot+ platform

Microsoft integrates DeepSeek into the PC Copilot+ platform

In January, Microsoft announced plans to bring NPU-optimized versions of the DeepSeek-R1 model directly to Copilot+ computers running on Qualcomm Snapdragon X processors.

Difference between IF and Switch functions in Excel

Difference between IF and Switch functions in Excel

The IF statement is a common logical function in Excel. The SWITCH statement is less well known, but you can use it instead of the IF statement in some situations.