Cohere MCP Server for SMB Research and Summarization

Expose Cohere's language models and search tools to AI agents via an MCP server, enabling automated research, summarization, and content generation for SMBs.

mcp-server cohere tavily smb research summarization typescript express

The problem

Small businesses need to integrate AI agents that can research topics, summarize documents, and answer complex queries, but building and maintaining API wrappers for Cohere's capabilities is time-consuming.

Built from

Intro

This tutorial walks you through building an MCP (Model Context Protocol) server that exposes Cohere’s language models and Tavily’s web search tools as composable AI agent tools. By the end, you’ll have a working server with seven MCP tools — summarization, chat, web search, content extraction, website crawling, PDF parsing, and automated research report generation — wrapped in auth, rate limiting, and OpenTelemetry observability.

Prerequisites

Node.js 22+ and pnpm installed (the project pins pnpm@10.0.0)
A Cohere API key — sign up at dashboard.cohere.com
A Tavily API key — sign up at tavily.com
Familiarity with TypeScript and MCP concepts

Step 1: Scaffold the project

The project starts with a Next.js 16 scaffold (App Router) that provides the build toolchain — TypeScript, Vitest, ESLint, and pnpm workspaces. This shell is already on disk; your job is to build the server code inside it.

The scaffold provides these files:

package.json — deps and devDeps exact-pinned
tsconfig.json, vitest.config.ts, eslint.config.mjs, next.config.ts
.env.example — placeholder environment variables (you’ll review it next)
app/layout.tsx and app/page.tsx

Install the dependencies:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

179 kB·56 tests·97.5% coverage·vitest passing

SHA-25623dd87560aae62246d56f3758a7a2fb83cf5c0a1686866d0228020068d567627

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ and pnpm installed (the project pins pnpm@10.0.0)
A Cohere API key — sign up at dashboard.cohere.com
A Tavily API key — sign up at tavily.com
Familiarity with TypeScript and MCP concepts

Step 1: Scaffold the project

The scaffold provides these files:

package.json — deps and devDeps exact-pinned
tsconfig.json, vitest.config.ts, eslint.config.mjs, next.config.ts
.env.example — placeholder environment variables (you’ll review it next)
app/layout.tsx and app/page.tsx

Install the dependencies:

import { tavily } from '@tavily/core'; export interface TavilyClientOptions { apiKey: string; } export interface SearchParams { query: string; maxResults?: number; includeAnswer?: boolean; searchDepth?: 'basic' | 'advanced'; includeDomains?: string[]; excludeDomains?: string[]; } export interface ExtractParams { urls: string[]; } export interface CrawlParams { url: string; maxDepth?: number; limit?: number; instructions?: string; } export interface SearchResult { results: Array<{ title: string; url: string; content: string; score: number; }>; answer?: string; } export interface ExtractResult { results: Array<{ url: string; rawContent: string }>; failedResults: Array<{ url: string; error: string }>; } export interface CrawlResult { results: Array<{ url: string; rawContent: string }>; } export class TavilyClient { private client: ReturnType<typeof tavily>; constructor(options: TavilyClientOptions) { this.client = tavily({ apiKey: options.apiKey }); } async search(params: SearchParams): Promise<SearchResult> { const response = await this.client.search(params.query, { maxResults: params.maxResults, includeAnswer: params.includeAnswer, searchDepth: params.searchDepth, includeDomains: params.includeDomains, excludeDomains: params.excludeDomains, }); return { results: response.results.map((r) => ({ title: r.title, url: r.url, content: r.content, score: r.score, })), answer: response.answer, }; } async extract(params: ExtractParams): Promise<ExtractResult> { const response = await this.client.extract(params.urls); return { results: response.results.map((r) => ({ url: r.url, rawContent: r.rawContent, })), failedResults: response.failedResults.map((f) => ({ url: f.url, error: f.error, })), }; } async crawl(params: CrawlParams): Promise<CrawlResult> { const response = await this.client.crawl(params.url, { max_depth: params.maxDepth, limit: params.limit, instructions: params.instructions, }); return { results: response.results.map((r) => ({ url: r.url, rawContent: r.rawContent, })), }; } }

import { defineTool } from '@reaatech/mcp-server-tools'; import { z } from 'zod'; import { textContent, errorResponse } from '@reaatech/mcp-server-core'; import { TavilyClient } from '../lib/tavily.js'; import type { ToolResponse } from '@reaatech/mcp-server-core'; import { recordToolInvocation, recordError } from '@reaatech/mcp-server-observability'; export default defineTool({ name: 'search-web', description: 'Search the web using Tavily search engine', inputSchema: z.object({ query: z.string().describe('Search query'), maxResults: z.number().min(1).max(20).optional().describe('Maximum number of results'), includeAnswer: z.boolean().optional().describe('Include an answer in the response'), searchDepth: z.enum(['basic', 'advanced']).optional().describe('Search depth'), includeDomains: z.array(z.string()).optional().describe('Domains to include'), excludeDomains: z.array(z.string()).optional().describe('Domains to exclude'), }), handler: async (args, _): Promise<ToolResponse> => { const start = Date.now(); const toolName = 'search-web'; const result = await (async () => { const client = new TavilyClient({ apiKey: process.env.TAVILY_API_KEY ?? '' }); return await client.search({ query: args.query as string, maxResults: args.maxResults as number | undefined, includeAnswer: args.includeAnswer as boolean | undefined, searchDepth: args.searchDepth as 'basic' | 'advanced' | undefined, includeDomains: args.includeDomains as string[] | undefined, excludeDomains: args.excludeDomains as string[] | undefined, }); })().catch((err: unknown) => { const message = err instanceof Error ? err.message : 'Search failed'; recordError({ errorType: 'tool_execution_failed', toolName }); recordToolInvocation({ toolName, status: 'error', durationMs: Date.now() - start }); return { error: message }; }); if ('error' in result) { return errorResponse((result as { error: string }).error); } const data = result as { results: Array<{ title: string; url: string; content: string; score: number }>; answer?: string }; const formatted = data.results.map((r, i) => `### ${String(i + 1)}. ${r.title}\nURL: ${r.url}\nScore: ${String(r.score)}\n\n${r.content}` ).join('\n\n'); const answer = data.answer ? `**Answer:** ${data.answer}\n\n` : ''; recordToolInvocation({ toolName, status: 'success', durationMs: Date.now() - start }); return { content: [textContent(`${answer}${formatted}`)] }; }, });

import { defineTool } from '@reaatech/mcp-server-tools'; import { z } from 'zod'; import { textContent, errorResponse } from '@reaatech/mcp-server-core'; import { TavilyClient } from '../lib/tavily.js'; import type { ToolResponse } from '@reaatech/mcp-server-core'; import { recordToolInvocation, recordError } from '@reaatech/mcp-server-observability'; export default defineTool({ name: 'crawl', description: 'Crawl a website using Tavily', inputSchema: z.object({ url: z.string().url().describe('Starting URL to crawl'), maxDepth: z.number().min(1).max(5).optional().describe('Maximum crawl depth'), limit: z.number().min(1).max(100).optional().describe('Maximum number of pages to crawl'), instructions: z.string().optional().describe('Crawling instructions'), }), handler: async (args, _): Promise<ToolResponse> => { const start = Date.now(); const toolName = 'crawl'; const result = await (async () => { const client = new TavilyClient({ apiKey: process.env.TAVILY_API_KEY ?? '' }); return await client.crawl({ url: args.url as string, maxDepth: args.maxDepth as number | undefined, limit: args.limit as number | undefined, instructions: args.instructions as string | undefined, }); })().catch((err: unknown) => { const message = err instanceof Error ? err.message : 'Crawl failed'; recordError({ errorType: 'tool_execution_failed', toolName }); recordToolInvocation({ toolName, status: 'error', durationMs: Date.now() - start }); return { error: message }; }); if ('error' in result) { return errorResponse((result as { error: string }).error); } const data = result as { results: Array<{ url: string; rawContent: string }> }; const formatted = data.results.map((r) => `=== ${r.url} ===\n${r.rawContent.slice(0, 2000)}${r.rawContent.length > 2000 ? '...' : ''}` ).join('\n\n'); recordToolInvocation({ toolName, status: 'success', durationMs: Date.now() - start }); return { content: [textContent(formatted)] }; }, });

import { defineTool } from '@reaatech/mcp-server-tools'; import { z } from 'zod'; import { textContent, errorResponse } from '@reaatech/mcp-server-core'; import { CohereClient } from '../lib/cohere.js'; import { recordToolInvocation, recordError } from '@reaatech/mcp-server-observability'; export default defineTool({ name: 'chat', description: 'Chat with Cohere language model', inputSchema: z.object({ message: z.string().describe('User message'), system: z.string().optional().describe('System prompt'), model: z.string().optional().describe('Cohere model ID'), temperature: z.number().min(0).max(2).optional().describe('Generation temperature'), maxTokens: z.number().min(1).max(8192).optional().describe('Maximum tokens to generate'), }), handler: async (args, _) => { const start = Date.now(); const toolName = 'chat'; const client = new CohereClient(); const system = args.system as string | undefined; const messages: Array<{ role: 'user' | 'system' | 'assistant'; content: string }> = []; if (system) { messages.push({ role: 'system', content: system }); } messages.push({ role: 'user', content: args.message as string }); const result = await client.chat({ messages, model: args.model as string | undefined, temperature: args.temperature as number | undefined, maxTokens: args.maxTokens as number | undefined, }).catch((err: unknown) => { const message = err instanceof Error ? err.message : 'Chat failed'; recordError({ errorType: 'tool_execution_failed', toolName }); recordToolInvocation({ toolName, status: 'error', durationMs: Date.now() - start }); return { error: message }; }); if ('error' in result) { return errorResponse((result as { error: string }).error); } recordToolInvocation({ toolName, status: 'success', durationMs: Date.now() - start }); return { content: [textContent((result as { text: string }).text)] }; }, });

Cohere MCP Server for SMB Research and Summarization

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Configure environment variables

Step 3: Create the Cohere client wrapper

Step 4: Create the Tavily search client wrapper

Step 5: Create the PDF extraction utility

Step 6: Create the summarize MCP tool

Step 7: Create the search-web MCP tool

Step 8: Create the extract MCP tool

Step 9: Create the crawl MCP tool

Step 10: Create the chat MCP tool

Step 11: Create the parse-document MCP tool

Step 12: Create the generate-report MCP tool

Step 13: Create the server entry point

Step 14: Run the tests

Next steps