OpenAI has added speech recognition, picture recognition and text voicing to ChatGPT

By: Bohdan Kaminskyi | 25.09.2023, 17:30

OpenAI

OpenAI has announced a major update to the ChatGPT chatbot, which searches through images, speech recognition and text dubbing.

Here's What We Know

Speech recognition allows you to ask a question to a chatbot using your voice. For this purpose, ChatGPT uses Whisper, an open source model that OpenAI developed.

The text-to-speech conversion is the responsibility of the new model, which the company says can generate a "human-like voice" from just a few seconds of speech samples. There are currently five voice variants available to choose from.

Image search allows you to take a picture of an item of interest and send it to ChatGPT. The chatbot will try to understand the request and respond accordingly.

Use your voice to engage in a back-and-forth conversation with ChatGPT. Speak with it on the go, request a bedtime story, or settle a dinner table debate.

Sound on ???? pic.twitter.com/3tuWzX0wtS
- OpenAI (@OpenAI) September 25, 2023

You can also use a drawing tool within the app to point to a specific part of a picture.

OpenAI recognises the potential risks of the new features. The company does not allow ChatGPT to process people's photos or answer questions about them.

The new features will be available to ChatGPT Plus Enterprise subscribers within two weeks. Later, the company will open up access to them to anyone who wants to use them.

Source: OpenAI

Artificial Intelligence