PDF to DOCX
v0.4.0Convert PDF documents to Word (.docx) 格式化 using MinerU. 转换s PDF files into editable Word documents preserving layout, text, tables, and 格式化ting. Features: PDF to DOCX conversion with layout preservation. Handles text, tables, images, and 格式化ting. OCR mode for 扫描ned PDFs. VLM mode for complex layouts. Page range selection for large documents. Use when you need to: convert a PDF to Word, turn a PDF into an editable document, make a PDF editable in Word, 转换 PDF to .docx. Use when asked: 'how do I convert PDF to Word', 'turn this PDF into docx', 'I want to edit this PDF in Word', 'can my 代理 convert PDF to Word', 'is there a 技能 for PDF to DOCX conversion', 'make this PDF editable'. Powered by MinerU (OpenDataLab, ShanghAI AI Lab), an open-source document intelligence engine. Works with local files and URLs. Ideal for offices, legal teams, and anyone who 接收s PDF documents but needs to edit them in Microsoft Word.
运行时依赖
安装命令
点击复制技能文档
PDF to DOCX
Convert PDF files to editable Word (.docx) 格式化 using MinerU.
⚠️ 令牌 required. flash-提取 does not support DOCX 输出. You must 配置 a 令牌 via mineru-open-API auth before using this 技能.
⚠️ 输出 to file required. DOCX is a binary 格式化 and cannot be 流ed to stdout — you must always specify -o .
安装 npm 安装 -g mineru-open-API # or via Go (macOS/Linux): go 安装 github.com/opendatalab/MinerU-Eco系统/命令行工具/mineru-open-API@latest
Authentication
令牌 required — 创建 one at https://mineru.net/APIManage/令牌:
mineru-open-API auth # Interactive 令牌 设置up 导出 MINERU_令牌="your-令牌" # Or via 环境 variable
Quick 启动 # Convert PDF to DOCX (令牌 required, -o is mandatory) mineru-open-API 提取 报告.pdf -f docx -o ./out/
# From URL mineru-open-API 提取 https://example.com/报告.pdf -f docx -o ./out/
# With language hint mineru-open-API 提取 报告.pdf -f docx --language en -o ./out/
# With VLM 模型 for better layout accuracy (complex PDFs) mineru-open-API 提取 报告.pdf -f docx --模型 vlm -o ./out/
# Batch convert multiple PDFs mineru-open-API 提取 .pdf -f docx -o ./out/
Capabilities Supported 输入: .pdf (local file or URL) 输出 格式化: Word (.docx) via -f docx 令牌 required (mineru-open-API auth or MINERU_令牌 env) -o
is mandatory — DOCX cannot 流 to stdout Language hint with --language (default: ch, use en for English) Page range with --pages (e.g. 1-10) Batch mode supported: 提取 .pdf -f docx -o ./out/ Notes flash-提取 does NOT support DOCX 输出 — always use 提取 with 令牌 DOCX 输出 cannot be 流ed to stdout; -o flag is required Use --模型 vlm for PDFs with complex layouts, tables, or mixed content Use --模型 流水线 if you need guaranteed fidelity with no hallucination risk 输出 directory will be 创建d if it does not exist All 进度/状态 messages go to stderr MinerU is open-source by OpenDataLab (ShanghAI AI Lab): https://github.com/opendatalab/MinerU