Skip to content
reaatechREAATECH

reaatech/voice-agent-kit

0Last commit: May 11, 2026GitHub →

These packages provide a transport layer for real-time voice AI, orchestrating the pipeline between telephony streams, speech-to-text, text-to-speech, and MCP-based agent logic. They solve the challenge of maintaining sub-second latency in conversational systems by enforcing strict timing budgets and providing provider-agnostic adapters for services like Deepgram, AWS, and Google Cloud. The architecture centers on an event-driven pipeline that decouples telephony handling from agent decision-making, allowing you to swap providers or integrate custom MCP servers without modifying core orchestration logic.

Packages

5 packages

@reaatech/voice-agent-core

pending npm
Orchestrates STT, MCP, and TTS pipelines for voice-enabled AI agents using an event-driven `Pipeline` class and session management utilities. It provides latency enforcement, OpenTelemetry-instrumented lifecycle hooks, and Zod-validated configuration schemas.
status
awaiting publish

@reaatech/voice-agent-mcp-client

pending npm
Connects to Model Context Protocol (MCP) servers via JSON-RPC 2.0 to manage tool discovery, conversation history, and request retries. It provides an `MCPClient` class that returns sanitized, TTS-ready text responses and tool call metadata.
status
awaiting publish

@reaatech/voice-agent-stt

pending npm
Provides a unified interface for streaming audio to Deepgram, AWS Transcribe, or Google Cloud Speech-to-Text via provider-specific classes. It includes built-in utilities for audio format conversion, sample rate resampling, and automatic reconnection with buffering.
status
awaiting publish

@reaatech/voice-agent-telephony

pending npm
Provides a `TwilioMediaStreamHandler` class that manages the bidirectional Twilio Media Streams WebSocket protocol, including audio buffering, barge-in detection, and event lifecycle tracking. It is designed to wrap an existing WebSocket connection to facilitate real-time audio streaming between Twilio and voice AI services.
status
awaiting publish

@reaatech/voice-agent-tts

pending npm
Provides a unified interface for text-to-speech synthesis across Deepgram, AWS Polly, and Google Cloud, returning audio as an `AsyncIterable<AudioChunk>`. It includes utility classes for Twilio-compatible audio formatting, silence generation, and text chunking to support low-latency streaming voice applications.
status
awaiting publish

Comments

Sign in with GitHub to comment and vote.

Loading comments…