Skip to content
reaatechREAATECH

reaatech/voice-agent-kit

0Last commit: Jun 4, 2026GitHub →

These packages give you the full pipeline to turn microphone or phone audio into an AI agent response and back — speech-to-text, MCP tool calls, text-to-speech, and telephony or WebRTC transport, all orchestrated with per-stage latency budgets. You'd adopt them to build a production voice agent without writing the audio plumbing, provider switching, or session management yourself. The most distinctive thing is that the agent logic lives entirely in an external MCP server, so the pipeline is a pure transport layer that can be swapped between Twilio, WebRTC, or a local simulator without changing the agent.

Packages

8 packages

@reaatech/create-voice-agent

v0.1.0
A scaffolding CLI that generates a complete voice-agent-kit project with pipeline configuration, STT/TTS provider setup, telephony or WebRTC transport, an MCP client, and a ready-to-run server.
status
published
published
13 days ago

@reaatech/voice-agent-core

v0.1.0
A Zod-validated configuration system and pipeline orchestrator for building voice-enabled AI agents, providing a `createPipeline()` function that coordinates STT, MCP, and TTS stages with latency enforcement, session management, and OpenTelemetry observability.
status
published
published
13 days ago

@reaatech/voice-agent-mcp-client

v0.1.0
A JSON-RPC 2.0 client that connects to any MCP server endpoint, providing tool discovery, conversation history management, retry with backoff, and TTS-safe response sanitization. It exposes an `MCPClient` class with `connect()`, `sendRequest()`, `discoverTools()`, and `close()` methods, and requires only a `fetch`-compatible runtime.
status
published
published
13 days ago

@reaatech/voice-agent-simulator

v0.1.0
A CLI and programmatic simulator that runs a voice agent pipeline (STT → MCP → TTS) locally from a WAV file or live microphone, reporting per-turn latency without requiring Twilio or cloud telephony. Exports `createSimulator`, `Simulator`, `renderLatencyWaterfall`, `readWavFile`, `writeWavFile`, `captureMicrophone`, and `playAudio`.
status
published
published
13 days ago

@reaatech/voice-agent-stt

v0.1.0
Provider-agnostic speech-to-text interface with a unified `STTProvider` class and seven adapter implementations (Deepgram, AWS Transcribe, Google Cloud Speech-to-Text, OpenAI Realtime, OpenAI Whisper, AssemblyAI, Groq Whisper), plus built-in audio format conversion between mulaw and linear16 with sample rate resampling.
status
published
published
13 days ago

@reaatech/voice-agent-telephony

v0.1.0
A WebSocket handler for voice AI agents that normalizes bidirectional streaming protocols (start/media/stop/mark/DTMF) across Twilio, Telnyx, SignalWire, and Vonage, providing barge-in detection, audio buffering, and base64 encoding/decoding. Exports a `TwilioMediaStreamHandler` class (EventEmitter) with methods like `acceptConnection`, `sendAudio`, `clearAudio`, and `sendMark`, plus static `encodeForTwilio`/`decodeFromTwilio` utilities.
status
published
published
13 days ago

@reaatech/voice-agent-tts

v0.1.0
Provider-agnostic text-to-speech interface with five adapter implementations (Deepgram Aura, AWS Polly, Google Cloud TTS, ElevenLabs, Cartesia), returning streaming audio as `AsyncIterable<AudioChunk>` with cancelable synthesis and Twilio-ready audio formatting.
status
published
published
13 days ago

@reaatech/voice-agent-webrtc

v0.1.0
A WebSocket-based transport class (`WebRTCTransport`) for browser voice AI agents that handles Opus encode/decode, PCM resampling, and barge-in detection, paired with standalone PCM utility functions. Requires a WebSocket connection and optionally `@discordjs/opus` or `prism-media` for Opus support.
status
published
published
13 days ago

Comments

Sign in with GitHub to comment and vote.

Loading comments…