A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.

Python 100%

Find a file

Amanvir Parhar b0acd79ed9 Create LICENSE		2025-01-15 17:49:53 -08:00
.gitignore	initial commit	2025-01-15 17:11:46 -08:00
LICENSE	Create LICENSE	2025-01-15 17:49:53 -08:00
main.py	initial commit	2025-01-15 17:11:46 -08:00
README.md	initial commit	2025-01-15 17:11:46 -08:00
requirements.txt	initial commit	2025-01-15 17:11:46 -08:00
voices.json	initial commit	2025-01-15 17:11:46 -08:00

README.md

Weebo

A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.

Works on Apple Silicon.

Features

Continuous speech recognition using Whisper MLX
Natural language responses via Llama
Real-time text-to-speech synthesis with Kokoro-82M
Support for different voices
Streaming response generation

Setup

Download required models:

kokoro-v0_19.onnx (TTS model)
Pull the llama3.2 model using Ollama

Usage

Run the chatbot:

python weebo.py

The program will start listening for voice input. Speak naturally and wait for a brief pause - the bot will respond with synthesized speech. Press Ctrl+C to stop.