OpenAI Voice Agent for Auto-Repair Estimates

Answer after-hours calls, capture vehicle details, and provide instant repair estimates for auto shops — all via natural voice conversation.

openai voice-agent auto-repair twilio typescript nextjs stt tts confidence-router

The problem

Small auto-repair shops miss revenue from after-hours callers who need estimates but can’t reach anyone. Manual intake by voicemail is slow, error-prone, and rarely converts to a booked job.

Built from

Intro

This recipe builds a telephone-based voice agent that answers after-hours calls for small auto-repair shops. When a customer calls, the agent uses STT (OpenAI Whisper) to transcribe their speech, runs a finite-state machine to collect vehicle details (make, model, year, symptom), queries OpenAI’s gpt-4o-mini for a repair cost estimate, and reads the result back via TTS (Deepgram Aura). A confidence-based router classifies caller intent — get an estimate, schedule an appointment, or speak to a human — and routes the conversation accordingly.

The project is scaffolded as a Next.js application. It uses Express internally for the HTTP server and WebSocket endpoint that integrates with Twilio Media Streams. Session state is kept in an in-memory session manager from the @reaatech/voice-agent-core package — no external database required.

Prerequisites

Node.js >= 22 with pnpm@10
A Twilio phone number with Voice and Media Streams enabled
An OpenAI API key (for Whisper STT + gpt-4o-mini estimate generation)
A Deepgram API key (for Deepgram Aura TTS)
A Langfuse account (for observability — public key, secret key, and host URL)

Assumed knowledge

You should be comfortable with TypeScript, Next.js route handlers, and basic WebSocket concepts. No deep Twilio experience is needed — the Twilio wiring is handled by the @reaatech/voice-agent-telephony package.

Step 1: Scaffold the project and install dependencies

Create a new directory and initialize the project. The package.json pins every dependency to an exact version — no ^ or ~ ranges.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

173 kB·67 tests·100.0% coverage·vitest passing

SHA-256fadea294bc66ee77d4b099584ebb8bd6580195b3b4c2f6d48a0d3053c6a55453

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 with pnpm@10
A Twilio phone number with Voice and Media Streams enabled
An OpenAI API key (for Whisper STT + gpt-4o-mini estimate generation)
A Deepgram API key (for Deepgram Aura TTS)
A Langfuse account (for observability — public key, secret key, and host URL)

Assumed knowledge

Step 1: Scaffold the project and install dependencies

Create a new directory and initialize the project. The package.json pins every dependency to an exact version — no ^ or ~ ranges.

OpenAI Voice Agent for Auto-Repair Estimates

The problem

Built from

Intro

Prerequisites

Assumed knowledge

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Assumed knowledge

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables with Zod

Step 3: Define shared types

Step 4: Initialize the Langfuse client

Step 5: Build the voice engine

Step 6: Wire the telephony handler

Step 7: Create the intent router

Step 8: Build the estimate collection FSM

Step 9: Create the estimate composer with OpenAI

Step 10: Build the call orchestrator

Step 11: Set up the Express + WebSocket server

Step 12: Create the entry point with graceful shutdown

Step 13: Write and run the tests

Next steps