Wi-Fi
BOOT button wake-up and interrupt, supporting both click and long-press triggers
Offline voice wake-up ESP-SR
Streaming voice dialogue (WebSocket or UDP protocol)
Support for 5 languages: Mandarin, Cantonese, English, Japanese, Korean SenseVoice
Voice print recognition to identify who's calling AI's name 3D Speaker
Large model TTS (Volcengine or CosyVoice)
Large Language Model (Qwen2.5 72B or Doubao API)
Configurable prompts and voice tones (custom characters)
Short-term memory with self-summary after each conversation round
LCD display showing signal strength or conversation content
Support for displaying emoji images on LCD