Skip to content

增加PDF采样率配置参数 pdf转图片dpi为72*scale#17

Open
wkjobs wants to merge 8 commits intodataelement:mainfrom
wkjobs:main
Open

增加PDF采样率配置参数 pdf转图片dpi为72*scale#17
wkjobs wants to merge 8 commits intodataelement:mainfrom
wkjobs:main

Conversation

@wkjobs
Copy link

@wkjobs wkjobs commented Jul 3, 2024

模糊的pdf经过bisheng-unstructured处理,OCR出来的字会有错误,提高转换pdf成图片的dpi,提升模糊PDF识别的准确率

wkjobs added 8 commits July 2, 2024 17:20
# Conflicts:
#	src/bisheng_unstructured/api/pipeline.py
#	src/bisheng_unstructured/documents/pdf_parser/pdf.py
# Conflicts:
#	src/bisheng_unstructured/api/pipeline.py
#	src/bisheng_unstructured/api/types.py
#	src/bisheng_unstructured/documents/pdf_parser/pdf.py
#	tests/test_image.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant