Voice Revolution: OpenAI’s New AI Model Promises Emotive and Natural-Sounding Speech with Just a 15-Second Sample

4 min read

Happenings

Divisions

Programs

Happenings

Divisions

Programs

Real-life Black Mirror: OpenAI introduces an AI bot that can mimic any voice after hearing a 15-second clip

OpenAI has introduced a new artificial intelligence model, named Voice Engine. This model is able to replicate any voice after listening to a 15-second clip and guarantees to produce "natural-sounding speech" with expressive and life-like voices.

OpenAI, a well-known AI research institution, has recently revealed Voice Engine, a revolutionary voice imitation technology. This advanced system can mimic any speaker by analyzing a short 15-second audio clip.

The company that created ChatGPT has unveiled a new model that promises to deliver speech that sounds natural, along with realistic and emotive voices. This new model is an enhancement of OpenAI's existing text-to-speech API and has been under development since 2022. The firm has already incorporated a version of this toolkit to drive predefined voices in their current text-to-speech API and Read Aloud function, demonstrating examples on their official blog that closely mirror real voices.

OpenAI imagines a multitude of useful uses for Voice Engine, such as helping with reading, translating languages, and aiding people who have trouble speaking. However, they are also aware of the possible abuse of this technology.

The threat of deepfake manipulation is indeed quite significant, raising worries about privacy and ethical issues. Therefore, OpenAI insists that Voice Engine is not prepared for broad application, emphasizing the necessity to tackle serious privacy issues prior to initiating a comprehensive launch.

Recognizing the considerable dangers linked with this technology, especially in a year of elections, OpenAI emphasizes its dedication to seeking input from diverse contributors across government, media, entertainment, education, and civil society. All those who have agreed to test the preview have also committed to abide by OpenAI's usage guidelines, which forbid the imitation of individuals without their clear permission or legal approval.

Additionally, those who use the Voice Engine are required to inform their listeners that the voices they are hearing are created by artificial intelligence. OpenAI has added further protection methods, such as watermarking for identifying the source of the audio and active supervision of system use. When it's officially launched, there will be a list of prohibited voices to identify and stop the usage of AI-generated voices that sound similar to well-known individuals.

The introduction of OpenAI's Sora and Voice Engine coincides with a highly disputed election in the US. Understandably, there's a lot of worry among political experts and tech commentators about the potential abuse of these fresh AI tools.

When it comes to cost and accessibility, OpenAI has stayed quiet. However, possible pricing information implies that Voice Engine could be cheaper than other options in the market. Predictions suggest a price of $15 for every million characters, which is about 162,500 words, making Voice Engine an economical choice for creating audiobooks.

Furthermore, OpenAI subtly suggests a high-definition version that would be twice as expensive, though precise details about its features are still not revealed.

Alongside this declaration, OpenAI has established a substantial collaboration with Microsoft to create an AI-powered supercomputer named "Stargate," with estimated expenses amounting to $100 billion, according to The Information. These latest ventures highlight OpenAI's ongoing dedication to leading progress in artificial intelligence and cooperative innovation with top industry figures.

(Incorporating information from various sources)

Search for us on YouTube

Highlighted Shows

Relevant News

Microsoft and OpenAI collaborate on a $100 billion Stargate AI supercomputer initiative

Japan and the US plan to enhance their partnership in AI and semiconductor sectors

According to the former President of Google China, OpenAI could be the first private firm to reach a $1 trillion valuation

Amazon boosts its investment in Anthropic by $2.75 billion, bringing the total planned investment to $4 billion

Microsoft and OpenAI team up on a $100 billion Stargate AI supercomputer project

Japan and the US are set to deepen their cooperation in AI and semiconductors

The former head of Google China predicts OpenAI may become the first private organization to achieve a $1 trillion valuation

Amazon increases its contribution to Anthropic by an additional $2.75 billion, with the total investment estimated to reach $4 billion

Check us out on YouTube

Top-rated Programs

Associated Reports

Microsoft and OpenAI join forces for a $100 billion Stargate AI supercomputer initiative

Japan and the US aim to boost joint efforts in AI and semiconductors

Google China's former President suggests OpenAI could be the first private firm in the world to reach a $1 trillion valuation

Amazon contributes an additional $2.75 billion towards Anthropic, bringing the total planned investment to $4 billion

Microsoft and OpenAI are collaborating on the $100 billion Stargate AI supercomputer project

Japan and the US are planning to enhance their cooperation in AI and semiconductors

According to the former President of Google China, OpenAI might be the first private company globally to achieve a valuation of $1 trillion

Amazon puts in another $2.75 billion into Anthropic, raising the total planned investment to $4 billion

Available on YouTube.

Firstpost holds all rights and is protected by copyright © 2024.

You May Also Like

More From Author

+ There are no comments

Add yours