Audio Input
Record and transcribe voice messages directly in the chat interface using real-time speech-to-text.
Record voice messages directly in the chat interface using the built-in microphone functionality. agao converts your spoken words into text messages automatically, allowing for hands-free communication with AI models. This feature streamlines conversation flow and enables natural voice interactions.
Prerequisites
Audio input availability depends on your specific agao instance configuration and user permissions. A transcription model must be configured in your agao instance, and your user group must have the appropriate permissions to access voice recording features. If these requirements are met, you'll see a microphone button in the chat input area.
Browser Permissions
When you first attempt to use audio input, your browser will request permission to access your microphone. You must grant this permission for the feature to function. This is a standard browser security measure that ensures websites can only access your microphone with your explicit consent.
Recording Process
Click the microphone button in the chat input field to start recording your voice message. While recording, you can speak naturally - the system will capture your audio in real-time. Continue speaking until you've finished your message, then click the stop button to end the recording session.
Transcription and Delivery
Once you stop recording, agao automatically transcribes your audio using the configured speech-to-text model. The transcribed text appears as a regular chat message and is sent to the AI model for processing. This seamless conversion allows you to interact with AI models using natural speech while maintaining the standard text-based conversation format.
The transcription process typically takes just a few seconds, and the resulting message integrates naturally into your conversation flow. Audio input messages are treated identically to typed messages in terms of AI processing and response generation.