
Michael YuanSecond State
Voice agents, conversational and real-time AI systems capable of listening, reasoning, speaking and actions, are rapidly becoming one of the most practical forms of AI agents. However, most existing voice AI solutions are closed, cloud-dependent, very expensive, and difficult to customize.
In this workshop, we explore using WebAssembly to run lightweight and portable AI inference servers for the full voice pipeline — speech-to-text, LLM reasoning, and text-to-speechn — directly on your own machines, supporting a large variety of AI models, OSes, CPU, GPU, NPU and consumer hardware. The goal is to enable private, controllable, and portable voice AI deployment.
Participants will walk through the architecture of a modern voice AI agent and learn how each component can be powered by Wasm-based cross-platform AI inference.
## By the end of this session, you will:
Please bring your own laptop. We will provide an open-source voice device that connects to the AI services on your laptop.
Early Bird
Conference Ticket WASM I/O 26
Until December 4th
Barcelona
Mar • 19- 20 • 2026
2-Day Conference
AXA Convention Center
Standard
After 4th Dec
Until February 19th
Barcelona
Mar • 19- 20 • 2026
2-Day Conference
AXA Convention Center
Late Bird
After 19th Feb
Barcelona
Mar • 19- 20 • 2026
2-Day Conference
AXA Convention Center