Revolutionary ChatGPT Features: Voice and Vision Unleashed
Written on
Chapter 1: Introduction to New Features
OpenAI has made a significant advancement with the release of voice and vision capabilities in ChatGPT. This means users can now engage in voice conversations or share images for interactive discussions!
This update transforms ChatGPT into a more versatile assistant, allowing us to take pictures and ask questions about them, ultimately enhancing our interactions with the AI. Although these features will roll out to ChatGPT Plus subscribers over the next two weeks, let’s explore what they can do right away.
Section 1.1: Engaging in Voice Conversations
With the new voice feature, ChatGPT can function like a voice assistant, akin to Siri. Simply press a button to initiate a dialogue with it. You can request bedtime stories, inquire about various topics, or seek advice. The best part? You can keep the conversation flowing by asking follow-up questions after receiving a response.
If you are among the fortunate ChatGPT Plus users with this feature, here’s how to enable it in the mobile app:
- Go to Settings
- Select New Features
- Opt into voice conversation
- Tap the headphone icon below
OpenAI utilizes its Whisper speech recognition technology to convert your spoken input into text. While it excels at transcribing English, it may struggle with languages that utilize non-Roman scripts. Users can select from five distinct AI voices, which closely mimic human tones. My personal favorite is Cove, reminiscent of narrators from previous audiobooks I enjoyed.
Additionally, Spotify is collaborating with OpenAI to pilot a Voice Translation feature, enabling podcasters to translate their content into multiple languages, all while retaining their unique vocal characteristics. Remarkable, isn't it?
Section 1.2: Conversing About Images
ChatGPT is also equipped to analyze images, allowing you to discuss content presented in photographs. This functionality is beneficial for everyday challenges, such as troubleshooting a broken appliance, planning meals based on visual ingredient lists, or interpreting data from graphs.
To illustrate this, check out the following demonstration from OpenAI.
For those with access to these features, new icons will appear when using ChatGPT.
To engage, simply tap the photo icon to capture or upload an image. You can even draw on the image to provide guidance for your assistant!
Chapter 2: The Future of AI Interaction
These new capabilities are set to elevate DIY projects and everyday tasks significantly. As we embrace AI, remember to subscribe to my newsletter for more updates.
If you enjoyed this article and wish to support my writing, consider donating via PayPal or subscribing to my Substack for exclusive content at just $6 a month.
AI Girl | Diana Dovgopol | Substack
Explore Midjourney, ChatGPT, and other innovative AI tools. Discover hidden insights to simplify your life. Click to read more at AI...
theaigirl.substack.com