Voice Message Bot Integration

This guide explains how to integrate voice message functionality into your chat bot using OpenAI for transcription and response generation.

Workflow Overview

User uploads a voice/audio message in the chat
System detects the audio file upload and triggers voice processing workflow
Audio is transcribed to text using OpenAI's Whisper API
Transcribed text is sent to OpenAI's Chat Completion API for response
Bot responds to the user's voice message with text

Implementation Steps

Voice message triggers workflow

When an audio file is uploaded, the system automatically detects it:

Audio file upload detection

Important Trigger Configuration Note: In the trigger setup, you'll see two conditional triggers. For the first trigger that checks for audio files, the "If conditions are NOT met" field must be left empty. For the final trigger in the chain, you should select "default for unknown message" to handle all other message types.

The audio is transcribed and processed:

Text transcription result

Quick Setup

Download and import these pre-configured components:

REST API Configuration - Configure your OPENAI_API_KEY in both the "Transcription" and "ChatCompletion" API calls
Bot Configuration - When importing, select the REST API you just imported

Technical Requirements

OpenAI API access for Whisper (transcription) and GPT (completion)
Supported audio formats: mp3, wav, ogg, m4a (max 25MB)

Voice Message Bot Integration

Workflow Overview​

Implementation Steps​

Quick Setup​

Technical Requirements​

Workflow Overview

Implementation Steps

Quick Setup

Technical Requirements