Kwindla Hultman Kramer - How to build the world's fastest voice bot

Video Available!

How to build the world's fastest voice AI bot:

  • Self-host speech-to-text, LLM inference, and text-to-speech all together in the same container/cluster.
  • Route audio over the internet using WebRTC and edge networking.
  • Configure timings for voice activity detection, phrase endpointing, and other parts of the pipeline to optimize for latency. (There are trade-offs to doing this!)

Here's a LLama 3 voice bot that has voice-to-voice response times of ~500ms.

We used @DeepgramAI 's STT and TTS for this bot, and everything is hosted on @cerebriumai 's serverless GPU infrastructure.

https://x.com/kwindla/status/1806129490411900940

Kwindla Hultman Kramer

https://machine-theory.com/

Kwindla Hultman Kramer
Kwindla Hultman KramerCEO

Buy Tickets

We have now sold out of Early Bird tickets; General Admission has also sold out.
Please join us online for the free livestream.

Buy Tickets SOLD OUT!