Type: Software
Status: POC
Tech Stack: Python, OpenAI Assistants API, Pyglet, PyAudio, SpeechRecognition, tkinter
Problem Statement
AI assistants communicate almost exclusively via text. For exhibitions, events, and interactive installations, a visual presence is missing — a ‚face‘ that reacts to questions and shows human-like animations. Existing avatar solutions require elaborate 3D engines, professional motion capture, or cloud rendering. There is no lightweight solution that provides an animated, speaking AI avatar with real-time interaction on standard hardware.
Description
An animated AI avatar system with real-time voice output and robotic audio effects. The avatar reacts with synchronized eye and mouth animation to OpenAI Assistants responses. Supports multi-monitor (avatar + Matrix-style terminal), push-to-talk speech input, conversation history with thread persistence, an extensible plugin system, and configurable audio/visual parameters.
Use Case
An animated AI avatar on the screen that answers questions while moving its mouth and eyes.
Link: https://github.com/rawk7000/SpeakingAI (private repo)