ChatGPT gains multimodal capabilities to better assist users

Share post:

ChatGPT has gained multimodal capabilities, allowing it to receive and respond to image and voice inputs. This new feature will make ChatGPT even more helpful in a variety of tasks, such as solving math problems, identifying objects, and providing recipes.

To use the image input feature, users simply need to snap a picture of what they are looking at and add the question they’d like an answer to. ChatGPT will then analyze the image and provide a response. For example, users could use this feature to identify the name of a plant, look up the nutritional information of a food item, or get help solving a math problem.

The voice input and output feature gives ChatGPT the same functionality as a voice assistant. Users can now ask ChatGPT to perform tasks or answer questions simply by speaking. ChatGPT will then process the request and respond verbally.

The sources for this piece include an article in ZDNET.

Featured Tech Jobs


Related articles

Robot startup uses ChatGPT to enhance its communications and reasoning skills

Humanoid robot startup Figure has secured a significant $675 million investment from a group of high-profile investors, including...

Lawsuit requires Pegasus spyware to provide code used to spy on WhatsApp users

NSO Group, the developer behind the sophisticated Pegasus spyware, has been ordered by a US court to provide...

OpenAI claims New York Times manipulated ChatGPT “fabricate data”

OpenAI has challenged the New York Times' copyright lawsuit, asserting the newspaper manipulated ChatGPT to fabricate evidence. The...

Companies experiment with four day work week enabled by AI

The dream of a four-day workweek is becoming a reality for some, thanks to AI's integration into the...

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways