Day 2 - How to Build a Voice AI Agent with n8n Webhook and ElevenLabs API

If you want to learn:


- How do I create a voice webhook using n8n and ElevenLabs?

- What's the best way to integrate speech-to-text and text-to-speech in a workflow?

- How can I build a conversational AI voice agent without writing code?

- How do I set up webhook nodes in n8n to handle voice data?

- What are the steps to connect ElevenLabs API with n8n for voice automation?

- How do I create real-time voice workflows using AI and webhooks?


Then this lecture is for you!



In this hands-on lecture, you'll build a complete voice-enabled AI workflow using n8n, ElevenLabs, and speech-to-text technology. You'll start by configuring the ElevenLabs text-to-speech API to generate audio responses from AI agents, testing different voice profiles to find the perfect tone for your voice assistant. Next, you'll set up a webhook node in n8n to create an endpoint that receives audio input, then connect it to ElevenLabs speech-to-text service to transcribe voice messages into text. You'll learn how to route transcribed text through an AI agent for conversational responses, then convert those responses back to speech using the ElevenLabs API. The lecture covers essential webhook configuration, including switching from GET to POST requests, handling binary audio files, and using the respond_to_webhook node to return audio data to the caller. You'll also work with a simple HTML interface that records voice input in the browser and sends it to your n8n workflow automation endpoint. By the end, you'll understand how to orchestrate asynchronous voice workflows, troubleshoot common integration issues, and create scalable conversational AI agents using open-source workflow automation tools—all without writing a single line of code.