Breaking News

RAS Technology Holdings Limited (ASX:RTH) Insider, Gary Crispe, Unloads 1.5 Million Shares Live Stream of the 2024 PGA Championship: How to Watch on TV, Channel Details, Round 3 Featuring Scottie Scheffler, and Schedule Health inspection scores for Wichita Falls restaurants from May 6-11 The Global Dominance of American Stock Markets Over the Past 120 Years: A Comparison with Other Major Economies Bentley, Britain’s efficient performer, secures first place in debut race at Samarkand World Cup for World Triathlon

OpenAI has unveiled a new technology platform known as Voice Engine, which is set to revolutionize the field of speech synthesis. The tool allows for the creation of a synthetic voice from a brief 15-second audio sample of an individual, enabling the reading of texts in the original language or other languages. In order to assess its potential applications and necessary security measures, OpenAI has granted limited access to this technology, partnering with various companies across diverse sectors.

Some of these partners include Age of Learning, HeyGen, Dimagi, Livox, and Lifespan Health System. Through these collaborations, the practical uses for the technology have been explored, such as creating pre-recorded speech content and providing real-time personalized responses for students using GPT-4.

The development of Voice Engine was led by Jeff Harris from OpenAI’s product team and began in late 2022. The platform utilizes licensed and publicly available data to power the text-to-speech API’s pre-built voices and ChatGPT’s Read Aloud feature. Access to Voice Engine will be restricted to around ten developers at first, reflecting OpenAI’s caution in introducing this revolutionary technology.

The field of text-to-audio generation is rapidly evolving with companies like Podcastle and ElevenLabs leading the way with their innovative solutions. However, this growing interest is met with ethical and security concerns regarding the misuse of this technology. For instance, recent US Federal Communications Commission ban on automated calls using cloned AI voices without consent highlights these concerns.

To address these issues, OpenAI has implemented strict usage policies for its partners. These policies include prohibiting impersonation without consent, obtaining explicit informed consent from original speakers, committing not to allow users to create their own entries and including watermarks on all audio clips generated for traceability purposes.

Furthermore, OpenAI suggests preventative measures such as eliminating voice authentication for accessing bank accounts or implementing safeguards on people’s voices used in AI systems as well as increasing education on deepfake technology development.

In summary

Leave a Reply