Voice Message Bot Integration
This guide explains how to integrate voice message functionality into your chat bot using OpenAI for transcription and response generation.
Workflow Overview
- User uploads a voice/audio message in the chat
- System detects the audio file upload and triggers voice processing workflow
- Audio is transcribed to text using OpenAI's Whisper API
- Transcribed text is sent to OpenAI's Chat Completion API for response
- Bot responds to the user's voice message with text
Implementation Steps
- When an audio file is uploaded, the system automatically detects it:
- The audio is transcribed and processed:
Quick Setup
Download and import these pre-configured components:
- REST API Configuration - Configure your
OPENAI_API_KEY
in both the "Transcription" and "ChatCompletion" API calls - Bot Configuration - When importing, select the REST API you just imported
Technical Requirements
- OpenAI API access for Whisper (transcription) and GPT (completion)
- Supported audio formats: mp3, wav, ogg, m4a (max 25MB)