v1.0.0

Markdown Converter

Peter Steinberger Peter Steinberger ← All skills

Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.

Downloads
5.3k
Stars
29
Versions
1
Updated
2026-02-23

Install

npx clawhub@latest install markdown-converter

Documentation

Markdown Converter

Convert files to Markdown using uvx markitdown — no installation required.

Basic Usage

Convert to stdout

uvx markitdown input.pdf

Save to file

uvx markitdown input.pdf -o output.md

uvx markitdown input.docx > output.md

From stdin

cat input.pdf | uvx markitdown

Supported Formats

  • -Documents: PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls)
  • -Web/Data: HTML, CSV, JSON, XML
  • -Media: Images (EXIF + OCR), Audio (EXIF + transcription)
  • -Other: ZIP (iterates contents), YouTube URLs, EPub

Options

-o OUTPUT      # Output file

-x EXTENSION # Hint file extension (for stdin)

-m MIME_TYPE # Hint MIME type

-c CHARSET # Hint charset (e.g., UTF-8)

-d # Use Azure Document Intelligence

-e ENDPOINT # Document Intelligence endpoint

--use-plugins # Enable 3rd-party plugins

--list-plugins # Show installed plugins

Examples

Convert Word document

uvx markitdown report.docx -o report.md

Convert Excel spreadsheet

uvx markitdown data.xlsx > data.md

Convert PowerPoint presentation

uvx markitdown slides.pptx -o slides.md

Convert with file type hint (for stdin)

cat document | uvx markitdown -x .pdf > output.md

Use Azure Document Intelligence for better PDF extraction

uvx markitdown scan.pdf -d -e "https://your-resource.cognitiveservices.azure.com/"

Notes

  • -Output preserves document structure: headings, tables, lists, links
  • -First run caches dependencies; subsequent runs are faster
  • -For complex PDFs with poor extraction, use -d with Azure Document Intelligence

Launch an agent with Markdown Converter on Termo.