WhatsApp Integrates AI Voice Interaction Feature for Hands-Free Chatting

  • Evelyn Young
  • Aug 14, 2024
  • 0
WhatsApp Integrates AI Voice Interaction Feature for Hands-Free Chatting

WhatsApp is reportedly in the process of developing a new feature that utilizes artificial intelligence. This upcoming functionality is expected to enable users to engage in hands-free vocal interactions with the Meta AI chatbot, which is integrated into the platform. Earlier reports suggested that WhatsApp was planning to allow users to send voice notes to the AI, facilitating one-sided vocal communication. However, the latest information indicates that the AI will also be capable of responding vocally. The anticipated voice mode feature may include multiple voice options for users, although specifics about these variations remain unclear.

As reported by WABetaInfo, a platform monitoring WhatsApp features, the voice mode functionality for Meta AI has been detected in the beta release of WhatsApp for Android, namely version 2.24.17.16. Similarly, this feature has also been found in the beta release for iOS, version 24.16.10.70.

The feature is not yet accessible in the current beta version of the app, likely due to ongoing development work by the company. Consequently, participants in the Google Play Beta program will not have the opportunity to test the Meta AI voice mode at this time. Screenshots shared by WABetaInfo showcase a new voice icon in the form of an audio waveform next to the text input area in the Meta AI chat interface. When tapped, it appears to bring up a bottom sheet labeled with the name of Meta AI, featuring a circular design formed by various bubbles in the center. Additionally, at the bottom of the sheet, the text “Hi, how can I help” is displayed alongside an audio waveform icon, indicating that the AI is ready to listen.

Further screenshots suggest that the voice mode may offer as many as ten different voices for users to select from. While details about the distinctions between these voices are not yet clear, there is a possibility that they could feature varied accents, energy levels, or tonal qualities. It seems unlikely that these voices would accommodate multiple languages.

Additionally, there is an option for enabling captions and transcriptions through text-to-speech, which likely transcribes the entire conversation into text for the user's future reference. The timeline for when this feature may be made available to the general public remains uncertain.

Share this Post: