@reaatech/agents-markdown-parser

Status: Pre-1.0 — APIs may change in minor versions. Pin to a specific version in production.

Markdown AST parser with YAML frontmatter extraction for AGENTS.md and SKILL.md files. Built on remark and unified for reliable AST extraction, plus section hierarchy analysis, table parsing, and code block extraction.

Installation

terminal

npm install @reaatech/agents-markdown-parser
# or
pnpm add @reaatech/agents-markdown-parser

Feature Overview

Main parser — parseMarkdown produces a typed AgentsMdDocument or SkillMdDocument from raw markdown
YAML frontmatter — Extract, validate, create, and update frontmatter with yaml parsing
Section hierarchy — Build a nested section tree from headings, with find/has/flatten utilities
Table extraction — Parse all markdown tables, extract columns, convert to objects, validate structure
Code blocks — Extract fenced code blocks, detect languages, flag potential secrets
Batch processing — parseMarkdownFiles for multi-file operations

Quick Start

typescript

import { parseMarkdown, extractSections, parseTables } from "@reaatech/agents-markdown-parser";
 
const content = `---
agent_id: my-agent
display_name: My Agent
version: 1.0.0
description: A test agent
type: mcp
---
 
# My Agent
 
## What this is
 
A test agent for demonstration.
 
## Architecture Overview
 
\`\`\`text
┌────────┐    ┌──────────┐
│ Input  │───▶│ Processor │───▶ Output
└────────┘    └──────────┘
\`\`\`
 
| Component | Purpose |
|-----------|---------|
| Parser    | Parsing  |
`;
 
const doc = await parseMarkdown(content, "/path/to/AGENTS.md");
console.log(doc.frontmatter.id);        // "my-agent"
console.log(doc.title);                  // "My Agent"
console.log(doc.sections.length);        // 2
 
const sections = extractSections(content);
const tables = parseTables(content);

API Reference

`parseMarkdown(content, path)`

The main entry point. Parses raw markdown into a fully structured document.

typescript

async function parseMarkdown(
  content: string,
  path: string
): Promise<AgentsMdDocument | SkillMdDocument>

`parseMarkdownFiles(files)`

typescript

async function parseMarkdownFiles(
  files: string[]
): Promise<Array<AgentsMdDocument | SkillMdDocument>>

Function	Signature	Description
`getSectionTitles`	`(doc) => string[]`	All section titles in the document
`findSection`	`(doc, title) => Section \| undefined`	Find a section by title (case-insensitive)
`getHeadings`	`(doc) => Array<{title, level, line}>`	All headings with metadata

Frontmatter (`extractFrontmatter`)

typescript

function extractFrontmatter(content: string): {
  frontmatter: ParsedFrontmatter;
  frontmatterRange: { start: number; end: number };
  contentWithoutFrontmatter: string;
}

Function	Description
`extractFrontmatter(content)`	Extract and parse YAML frontmatter
`validateFrontmatterStructure(frontmatter)`	Check required fields exist
`createFrontmatter(values, isSkill?)`	Generate a frontmatter string
`updateFrontmatter(content, updates)`	Merge updates into existing frontmatter

Section Extraction (`extractSections`)

typescript

function extractSections(content: string, lineOffset?: number): Section[]
 
interface Section {
  title: string;
  level: number;
  content: string;
  location: ErrorLocation;
  subsections: Section[];
}

Function	Description
`extractSections(content, lineOffset?)`	Build section hierarchy from headings
`findSection(sections, title)`	Recursive case-insensitive find
`findSectionByPath(sections, path)`	Find by array path (e.g. `["A", "B"]`)
`findSectionByTitle(sections, title)`	Alias for `findSection` exported for disambiguation
`flattenSectionTitles(sections)`	All titles in a flat array
`hasSection(sections, title)`	Boolean existence check
`getSectionsAtLevel(sections, level)`	Filter by heading level

Table Parsing (`parseTables`)

typescript

function parseTables(content: string): MarkdownTable[]
 
interface MarkdownTable {
  headers: string[];
  rows: string[][];
  location: ErrorLocation;
}

Function	Description
`parseTables(content)`	Extract all markdown tables
`extractColumn(table, columnName)`	Extract a column by header name
`tableToObjects(table)`	Convert to `Record<string, string>[]`
`validateTableStructure(table)`	Check headers, duplicates, row consistency
`formatTable(table)`	Format back to markdown string

Code Block Extraction

Function	Description
`extractCodeBlocks(content)`	Extract fenced code blocks
`hasLanguage(codeBlock)`	Check if language is specified
`getCodeBlocksByLanguage(blocks, language)`	Filter by language
`mightContainSecret(codeBlock)`	Regex-based secret detection
`formatCodeBlock(codeBlock)`	Format back to markdown
`validateCodeBlockLanguages(blocks)`	Check which blocks have languages
`getUniqueLanguages(blocks)`	Sorted unique language identifiers

@reaatech/agents-markdown — Core types and schemas
@reaatech/agents-markdown-validator — Schema validation engine
@reaatech/agents-markdown-linter — Linting rules engine

License

MIT

@reaatech/agents-markdown-parser

@reaatech/agents-markdown-parser

Installation

Feature Overview

Quick Start

API Reference

parseMarkdown(content, path)

parseMarkdownFiles(files)

Document Navigation

Frontmatter (extractFrontmatter)

Section Extraction (extractSections)

Table Parsing (parseTables)

Code Block Extraction

Related Packages

License

`parseMarkdown(content, path)`

`parseMarkdownFiles(files)`

Frontmatter (`extractFrontmatter`)

Section Extraction (`extractSections`)

Table Parsing (`parseTables`)