API · Crawler · CLI · Browser Extensions — used with LangChain, LlamaIndex & more

Turn web content into
LLM-ready Markdown.

Ingest documentation, crawl entire sites, and convert any HTML into clean, token-efficient Markdown. API, CLI, and browser extensions for your RAG pipelines and AI agents.

81%
Token Savings
avg. across 20 real-world pages
See benchmark ›
Free
500 credits/month
HTTPS
Always Encrypted
Quick Start
curl -X POST https://api.unweb.info/api/convert/url \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article"}'
🛡️

No logs. No storage. Your HTML goes in, Markdown comes out.

All processing happens in-memory over HTTPS. Nothing is ever stored, logged, or analyzed. Your content stays yours.

Zero Storage HTTPS Only No Analytics

From HTML noise to pure signal

UnWeb doesn't just strip tags — it understands content structure and extracts only what matters.

page.html — 47,823 chars
<div class="wrapper layout-v3"> <nav class="sidebar">...</nav> <div class="ads-container"> <script src="tracker.js"></script> </div> <main role="main"> <article> <h1 class="title-xl"> Getting Started </h1> <p class="text-body"> Install the SDK with <code>npm i sdk</code> </p> </article> </main> <footer class="site-footer">...</footer> </div>
output.md — 8,942 chars
# Getting Started Install the SDK with `npm i sdk` Configure your project by creating a config.yaml in your root directory: ```yaml name: my-project version: 1.0.0 ``` ## Authentication Generate an API key from the [dashboard](/settings) and add it to your environment. ✓ No ads. No nav. No noise.
Token reduction on real pages:
81% fewer tokens measured with GPT-4 tokenizer (tiktoken)

Everything your AI pipeline needs

Built from the ground up for developer-first AI workflows.

🧠

Smart Content Extraction

Automatically detects main content using semantic HTML, content scoring, and paragraph density analysis. Strips nav, ads, footers, and sidebars.

API-First Architecture

RESTful endpoints for paste, upload, and URL fetch. Drop into LangChain, LlamaIndex, or any custom agent pipeline in minutes.

🔒

Content Never Stored

Your HTML and Markdown are processed in memory and returned — never stored on disk or sent to third parties. Fully encrypted over HTTPS.

📦

CommonMark Output

Strict CommonMark for universal compatibility. Predictable parsing, no GFM ambiguity. Clean output every time.

🔄

Multiple Input Methods

Paste raw HTML, upload files, or fetch directly from URLs. Batch process entire CMS exports or wiki dumps.

☁️

Hosted & Ready to Go

No infrastructure to manage. Call api.unweb.info and get Markdown back. Free tier included — no credit card required.

🛡️

Your content stays yours

UnWeb processes your content in memory and returns the result. Your HTML and Markdown are never stored or sent to third parties. Every request is encrypted over HTTPS. Content goes in, Markdown comes out, and that's it.

0
Content stored
HTTPS
Always encrypted
0
Third parties

Built for real AI workflows

From solo projects to enterprise agent systems, UnWeb fits anywhere clean content is needed.

01

Feed Confluence & Notion exports into LlamaIndex

Convert documentation dumps, wiki exports, and help center pages into clean chunks for your RAG pipeline. Works with LlamaIndex, LangChain, and any vector store.

02

Power n8n & CrewAI web research workflows

Give your AI agents clean markdown from any URL. No more parsing HTML in your agent loops. One API call, structured output.

03

Convert API docs & READMEs for GPT context windows

Strip navigation chrome, footers, and ads from documentation pages. Feed your LLM only the content that matters — save 80%+ on tokens.

04

Clean web input for AutoGPT & multi-agent pipelines

Standardize web content into a consistent markdown format across all your agents. No more per-site parsing logic.

LangChain LlamaIndex OpenAI n8n CrewAI AutoGPT

Start converting in under a minute

Create a free account, grab an API key, and start converting.

🔑

Get Your API Key

Create a free account at app.unweb.info and generate an API key from the dashboard.

📡

Send HTML

Call the API with your HTML — paste it, upload a file, or pass a URL. Use cURL, Python, JS, Go, or the CLI.

Get Markdown

Receive clean, semantic Markdown instantly. Feed it to your LLM, vector DB, or agent chain.

Credit-based pricing. Scale when ready.

Pay for what you use. Different operations cost different credits. Paid plans include overage billing so your pipelines never stop.

Free

$0
500 credits / month
  • 500 credits per month
  • Paste, upload & URL conversion
  • Smart content extraction
  • CommonMark output
  • Usage dashboard
Get Started Free

Pro

$39 / month
15,000 credits / month
  • Everything in Starter
  • 15,000 credits per month
  • Batch conversion & webhooks
  • Priority support
  • Overage: $0.004 / credit
Upgrade to Pro

Scale

$99 / month
60,000 credits / month
  • Everything in Pro
  • 60,000 credits per month
  • Higher crawl limits
  • Dedicated support
  • Overage: $0.002 / credit
Upgrade to Scale
Credit costs: Paste / Upload / URL = 1 credit · Crawl page = 2 credits
Paid plans include overage billing — your pipelines never stop.

Official SDKs, CLI, and browser extensions

Everything you need to integrate — pick your language, your platform, or your workflow.

Python

Python SDK

Full API coverage with sync and async clients. Python 3.9+.

pip install unweb
TypeScript

Node.js / TypeScript SDK

TypeScript-first, zero runtime dependencies. Node 18+.

npm i @mbsoftsystems/unweb
CLI

CLI

Cross-platform terminal tool. Windows, macOS, and Linux.

Download from GitHub Releases
Chrome

Chrome Extension

One-click webpage conversion from any tab.

Add to Chrome
Firefox

Firefox Extension

One-click webpage conversion from any tab.

Add to Firefox

MCP Server

Use UnWeb from Claude Code, Cursor, and Windsurf. Quality scoring & crawl.

npx @mbsoftsystems/unweb-mcp

Try UnWeb free — 500 credits on us

Create a free account, grab your API key, and start converting HTML to clean Markdown in under a minute. No credit card required.