微信扫码
添加专属顾问
我要投稿
pip install magic-pdf
cp magic-pdf.template.json ~/magic-pdf.json
magic-pdf pdf-command --pdf "pdf_path" --model "model_json_path"
图像
,表格
,标题
,文本
等;行内公式
和行间公式
;conda create -n pdfpipeline python=3.10
git clone https://github.com/opendatalab/PDF-Extract-Kit.git
pip3 install -r requirements+cpu.txt
pip install https://github.com/opendatalab/PDF-Extract-Kit/raw/main/assets/whl/detectron2-0.6-cp310-cp310-macosx_11_0_arm64.whl
PDF-Extract-Kit/configs/model_configs.yaml:2
PDF-Extract-Kit/modules/layoutlmv3/layoutlmv3_base_inference.yaml:72
git lfs clone https://huggingface.co/wanderkid/PDF-Extract-Kit
python pdf_extract.py --pdf data/pdfs/ocr_1.pdf
File "/Users/linyu/ai/PDF-Extract-Kit/modules/layoutlmv3/layoutlmft/models/layoutlmv3/modeling_layoutlmv3.py", line 71, in forward
position_embedding = F.interpolate(position_embedding, size=(Hp, Wp), mode='bicubic')
NotImplementedError: The operator 'aten::upsample_bicubic2d.out' is not currently implemented for the MPS device. If you want this op to be added in priority during the prototype phase of this feature, please comment on https://github.com/pytorch/pytorch/issues/77764. As a temporary fix, you can set the environment variable `PYTORCH_ENABLE_MPS_FALLBACK=1` to use the CPU as a fallback for this op. WARNING: this will be slower than running natively on MPS.
export PYTORCH_ENABLE_MPS_FALLBACK=1
或者在py 运行文件中加import os
os.environ['PYTORCH_ENABLE_MPS_FALLBACK'] = '1'
RuntimeError: Expected repeatBuffer && cumsumBuffer && resultBuffer to be true, but got false.
magic-pdf pdf-command --pdf "/Users/linyu/ai/pdf储能政策知识库/《“十四五”能源领域科技创新规划》2021.pdf" --model ""
53AI,企业落地大模型首选服务商
产品:场景落地咨询+大模型应用平台+行业解决方案
承诺:免费POC验证,效果达标后再合作。零风险落地应用大模型,已交付160+中大型企业
2025-07-31
字节打响 Agent 平台战!Coze扣子、n8n、Dify谁是终点?
2025-07-31
Coze开源后,我用LLM+OCR做了一个文档智能问答Agent
2025-07-31
字节开源“扣子”,企业数字化转型的新机遇!
2025-07-31
一个不卷大模型的清华学霸,率先用AI赚到了钱
2025-07-31
阿里刚刚开源Qwen3新思考模型:Agent能力超强,支持100万上下文
2025-07-30
Coze:打造智能高效的股票交易系统
2025-07-30
阿里云Qwen3重大升级,已全面超越DeepSeek与Kimi
2025-07-30
更强的 Qwen3-Coder 来了,都用上了吗?
2025-07-23
2025-06-17
2025-06-17
2025-07-23
2025-07-14
2025-07-12
2025-05-29
2025-05-12
2025-05-14
2025-05-20
2025-07-31
2025-07-30
2025-07-30
2025-07-30
2025-07-29
2025-07-29
2025-07-28
2025-07-28