rlm 1.0.6

.NET 10.0

dotnet tool install --global rlm --version 1.0.6

This package contains a .NET tool you can call from the shell/command line.

dotnet new tool-manifest
                    

                            if you are setting up this repo

dotnet tool install --local rlm --version 1.0.6

This package contains a .NET tool you can call from the shell/command line.

#tool dotnet:?package=rlm&version=1.0.6

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

nuke :add-package rlm --version 1.0.6

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

RLM CLI (Recursive Language Models Context Tool)

A .NET CLI tool for processing large documents that exceed LLM context windows, based on the Recursive Language Models paper. RLM implements a recursive data ingestion pattern, enabling streaming content decomposition, multi-turn processing, and result aggregation.

Key Features

Multi-format support - Markdown, PDF, HTML, JSON, Word (.docx), plain text
6 chunking strategies - Uniform, filtering, semantic, token-based, recursive, auto
Stateful sessions - Persistent session state for multi-turn processing
Claude Code support - RLM Agent and Skill

Projects in Solution

Project	Description	Framework
Rlm.Cli	Console application with 13 commands	.NET 10, AOT-compatible
Rlm.Cli.Tests	Unit tests	MSTest 4.0, Shouldly, NSubstitute

Quick Start

Install from NuGet

# Requires .NET 10+
dotnet tool install -g rlm

# Update to latest version
dotnet tool update -g rlm

Setup Claude Code Integration

To use RLM with Claude Code, copy the skills and agent files to your project:

# Clone the repository (or download the .claude folder)
git clone https://github.com/endjin/RLM.git

# Copy the .claude folder to your project
cp -r RLM/.claude /path/to/your/project/

This provides:

Skill (.claude/skills/rlm/) - Invoke with /rlm to process large documents
Agent (.claude/agents/rlm-worker/) - Parallel chunk processing worker for Claude Code

Build and Run

# Build
dotnet build Solutions/Rlm.slnx

# Run directly
dotnet run --project Solutions/Rlm.Cli -- load document.md

# Run tests
dotnet test --solution Solutions/Rlm.slnx

Install as Global Tool (from source)

cd Solutions/Rlm.Cli
dotnet pack
dotnet tool install --global --add-source ./bin/Release rlm

Basic Workflow

# Load a document
rlm load large-document.md

# Check document info
rlm info

# Chunk for processing
rlm chunk --strategy uniform --size 50000

# Process chunks iteratively
rlm next                          # Get next chunk
rlm store chunk_0 "result..."     # Store partial result
rlm next                          # Continue until done

# Aggregate results
rlm aggregate

Supported Formats

Format	Extensions	Features
Markdown	`.md`, `.markdown`	YAML frontmatter, code blocks, headers
PDF	`.pdf`	Text extraction, page count, title, author
HTML	`.html`, `.htm`	Converts to Markdown, preserves structure
JSON	`.json`	Pretty-printing, element count
Word	`.docx`	Heading preservation, paragraph extraction, document properties
Plain text	`.txt`, etc.	Basic text loading

Commands Reference

Command	Description	Example
`load <file>`	Load document into session	`rlm load corpus.txt`
`load <dir>`	Load directory of documents	`rlm load ./docs/`
`load -`	Load from stdin	`cat file.txt \\| rlm load -`
`info`	Show document metadata	`rlm info --progress`
`slice <range>`	View document section	`rlm slice 0:1000`
`chunk [opts]`	Apply chunking strategy	`rlm chunk --strategy semantic`
`filter <pattern>`	Filter by regex pattern	`rlm filter "email\\|@"`
`next`	Get next chunk	`rlm next --json`
`skip <count>`	Skip forward/backward	`rlm skip 10`
`jump <index>`	Jump to chunk index or %	`rlm jump 50%`
`store <key> <val>`	Store partial result	`rlm store chunk_0 "result"`
`import <pattern>`	Import external results	`rlm import "child-*.json"`
`results`	List stored results	`rlm results`
`aggregate`	Combine all results	`rlm aggregate --final`
`clear`	Reset session	`rlm clear`
`clear --all`	Reset all sessions	`rlm clear --all`

Recursive RLM & Parallel Processing

RLM supports recursive decomposition through session isolation and raw output modes.

Session Management

Use the global --session <id> flag to isolate state for parallel or recursive processes:

# Parent process
rlm load large.txt --session parent

# Child process (simulated recursion)
rlm load chunk_1.txt --session child_1
rlm chunk ... --session child_1

Scripting & Piping

Use --raw for pipe-friendly output without formatting:

# Get raw content of next chunk
rlm next --raw --session child_1 | process_script.sh

# Pipe content into store
echo "result" | rlm store chunk_1 - --session parent

Bulk Import

Import results from child sessions or external files:

# Import results directly from session files
rlm import "rlm-session-child_*.json" --session parent

# Import results from text files (key = filename)
rlm import "summaries/*.txt" --session parent

Chunking Strategies

Strategy	Use Case	Command
Uniform	Summarization, aggregation	`rlm chunk --strategy uniform --size 50000`
Filtering	Needle-in-haystack search	`rlm filter "pattern"`
Semantic	Document structure analysis	`rlm chunk --strategy semantic`
Token-based	Precise token budgeting	`rlm chunk --strategy token --max-tokens 512`
Recursive	Complex mixed documents	`rlm chunk --strategy recursive`
Auto	Query-based selection	`rlm chunk --strategy auto --query "find API key"`

Architecture Overview

Solutions/Rlm.Cli/
├── Commands/          # 13 CLI commands (load, chunk, next, etc.)
├── Core/
│   ├── Documents/     # Multi-format readers (PDF, HTML, Word, etc.)
│   ├── Chunking/      # 6 chunking strategies
│   ├── Validation/    # Input validation framework
│   ├── Processing/    # Chunk post-processors
│   ├── Output/        # JSON output models
│   └── Session/       # Persistent state management
└── Infrastructure/    # Session store, DI, JSON context

Dependencies

CLI Framework

Spectre.Console / Spectre.Console.Cli - Rich console output and command parsing
Spectre.IO - Testable file system operations

Document Processing

PdfPig - PDF text extraction
ReverseMarkdown - HTML to Markdown conversion
DocumentFormat.OpenXml - Word document processing

Tokenization

Microsoft.ML.Tokenizers - Accurate GPT-4 tokenization (cl100k_base)

Infrastructure

Microsoft.Extensions.DependencyInjection - DI container
Polly - Retry resilience for file operations

Documentation

Detailed documentation is available in .claude/skills/rlm/:

Document	Description
SKILL.md	Overview and workflow guide
reference.md	Technical architecture and JSON models
examples.md	Real-world workflow scenarios
strategies.md	Chunking strategies deep-dive
troubleshooting.md	Tips, errors, and edge cases

License

Apache-2.0

Product	Compatible and additional computed target framework versions.
.NET	net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

This package has no dependencies.

Version	Downloads	Last Updated
1.0.6	105	1/31/2026
1.0.5	102	1/31/2026
1.0.4	92	1/30/2026
1.0.3	90	1/25/2026
1.0.2	91	1/23/2026