xAI unveils vision feature for Grok's voice mode on iOS, rivaling ChatGPT Vision
xAI has introduced a new vision feature for Grok's voice mode on iOS, mirroring the functionality of ChatGPT Vision. This feature allows users to use their iPhone's camera to capture visual input, which Grok can then analyze and describe through voice responses. While full visual analysis is still in development, the current update emphasizes camera access and basic descriptive capabilities.
The vision feature is integrated with Grok's voice mode, which includes interaction styles such as “unhinged,” “romantic,” and “genius,” providing a personalized user experience. However, the voice mode does not support custom instructions, limiting user control over responses.
Currently being tested on iOS, users can experience the feature by updating to the latest Grok app version. No timeline has been announced for the complete rollout of the vision analysis capabilities or an Android update.


