OpenAI announces ChatGPT will soon ‘see, hear, and speak’

Share This Post

ChatGPT will soon offer new features that allow users to engage with it through images and voice recognition, according to an announcement from OpenAI on Sept. 25.

OpenAI announced that users will be able to interact with ChatGPT using voice commands, enabling a more personalized user experience. The company said that this feature is powered by a text-to-speech model that can generate audio from minimal sample speech created by professional voice actors. It said that the feature is also powered by its open-source speech recognition system, Whisper.

The voice features are expected to provide a wider range of use cases, such as assisting in tasks like reading bedtime stories, creating recipes, composing speeches, reciting poems, explaining common phrases, or even resolving “dinner table debates.”

OpenAI added that users will soon be able to provide images to ChatGPT (or select certain parts of images) for interpretation and response.

OpenAI acknowledges risks

OpenAI acknowledged the risk of fraud and impersonation and said that, accordingly, it is limiting voice features to its voice chat platform. It emphasized that it uses professional voice actors not user voices for output audio. OpenAI added that certain other groups are permitted to use voice capabilities for other purposes; Spotify, for example, is translating participating podcasts to new languages in each host’s original voice.

The company noted that image recognition carries privacy risks and said that, in response, it has limited ChatGPT’s ability to make statements about people. It noted that ChatGPT “is not always accurate” but said that general descriptions of images can be useful, citing its earlier work with Be My Eyes, an app for blind and low-vision people.

OpenAI said that it will introduce voice and image features to ChatGPT Plus and Enterprise over the next two weeks. It said that voice features will be available on iOS and Android on an opt-in basis, and that image features will be available on all platforms.

The post OpenAI announces ChatGPT will soon ‘see, hear, and speak’ appeared first on CryptoSlate.

Read Entire Article
spot_img

Related Posts

Binance Aids Taiwan in Busting $6M Crypto Money Laundering Ring

Binance’s Financial Crimes Compliance Department recently collaborated with Taiwan’s Ministry of Justice Investigation Bureau and the Taipei District Prosecutors Office on a significant

Optimism Network Activity Metrics Approach Record Levels, Propelling OP 9% Higher

Layer 2 (L2) scaling solution Optimism reported a series of strong network metrics in the first quarter (Q1) 2024, with its native OP token surging 9% on the back of this bullish momentum Optimism

Digital Yuan Goes Cross-Border: Hong Kong Unveils e-CNY Wallets For Local Users

In a significant move towards enhancing digital currency use, Hong Kong residents can now set up personal digital yuan or e-CNY wallets, as announced by the Hong Kong Monetary Authority (HKMA) This

Dolce & Gabbana Sued Over $6,000 NFTs Losing 97% Value

Dolce & Gabbana USA Inc faces a lawsuit from a customer who claims the non-fungible tokens (NFTs) he bought for $6,000, which included metaverse outfits, lost 97% of their value due to delivery

Crypto Influencer ‘T.J. Stone’ Pleads Guilty To $1M Wire Fraud Scheme

In a Brooklyn federal court on Thursday, crypto personality Thomas John Sfraga, also known as “TJ Stone”, pleaded guilty to wire fraud charges for defrauding over a dozen victims out of

Crypto Post-Mortem: Here’s How Pump.Fun Was Exploited For $2 Million

Solana-based platform Pumpfun suffered an exploit that left the crypto community with many questions The attack stole millions of dollars in users’ funds, but the reasons behind it and the exact
- Advertisement -spot_img