A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
Updated
Jul 16, 2024 - Python
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A curated list of resources for Document Understanding (DU) topic
MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
A C++17 PDF manipulation library
Python bindings to PDFium
Analyze PDFs. With colors. And Yara.
Malicious PDF files recently considered one of the most dangerous threats to the system security. The flexible code-bearing vector of the PDF format enables to attacker to carry out malicious code on the computer system for user exploitation.
Up-to-date Laravel documentation in PDF format (all versions)
Open Source PDF Document Management
wxPdfDocument - Generation of PDF documents from wxWidgets applications
A painless HTML to PDF rendering service. Generate PDF reports and documents from HTML templates or raw HTML.
A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.
🔏 Sign PDFs with the portuguese citizen card (aka "cartão de cidadão")
CLI program for searching inside text and tables in PDF documents and displaying results in HTML.
BASH scripts for exploring information contents of pdf documents.
Use the Azure Key Vault API to sign a PDF document.
Utility for modifying page markings of PDF documents
Lightweight Helper classes based on iTextSharp for scaling and resizing Pdf Documents & Pages.
Add a description, image, and links to the pdf-documents topic page so that developers can more easily learn about it.
To associate your repository with the pdf-documents topic, visit your repo's landing page and select "manage topics."