Markete geri dön

pymupdf-pdf

Fast local PDF parsing with PyMuPDF for Markdown/JSON outputs and optional images/tables.

3,729indirme19yükleme19yıldız
v1.0.0
cmdopDevelopmentimages, json, markdown, pdf, pymupdf, tables3/2/2026

Overview

Fast local PDF parsing with PyMuPDF (fitz) for Markdown/JSON outputs and optional images/tables. Use when speed matters more than robustness, or as a fallback while heavier parsers are unavailable. Default to single-PDF parsing with per-document output folders.

Key Features

  • Fast local PDF parsing with PyMuPDF
  • Markdown/JSON outputs
  • Optional images/tables extraction

How It Works

Parse PDFs locally using PyMuPDF for fast, lightweight extraction into Markdown by default, with optional JSON and image/table outputs in a per-document directory.

Use Cases

  • Quickly extract text from PDFs for Markdown documentation
  • Extract images and tables from PDFs for further processing
  • Use as a fallback parser when heavier parsers are unavailable

Yorumlar

Henüz yorum yok.