Skip to content

Releases: ArtifexSoftware/pdf2docx

v0.4.5

05 Oct 05:32
Compare
Choose a tag to compare

fixed reproducing images issue:

  • images in CMYK color-space
  • transparent images

v0.4.4

29 Sep 16:09
Compare
Choose a tag to compare

detect vector graphic and convert to bitmap

v0.4.3

20 Sep 12:48
Compare
Choose a tag to compare

support paragraph alignment

v0.4.2

12 Sep 16:29
Compare
Choose a tag to compare
  • correct block bbox due to unnamed font in PDF
  • set borders style for each cell

v0.4.1

05 Sep 16:22
Compare
Choose a tag to compare
  • align stream table borders
  • consider explicit border in stream table
  • consider page rotation
  • parsing pages with multi-processing

v0.4.0

24 Aug 06:18
Compare
Choose a tag to compare

new feature: nested table (support 1 layer only)

v0.3.5

17 Aug 13:27
Compare
Choose a tag to compare

enhancement: text format parsing

  • consider text shading instead if out of default highlight colors
  • font name localization: Chinese
  • do not adjust spacing between Chinese and Latin/number

v0.3.4

14 Aug 09:22
Compare
Choose a tag to compare

fixed issue: operator name conflicts with text when extracting path

v0.3.3

12 Aug 16:50
Compare
Choose a tag to compare
  • extract annotations over multi-lines precisely
  • fix docx.image.exceptions.UnrecognizedImageError
  • set font size precisely by setting character scaling
  • CS transformation between PDF and PyMuPDF
  • consider cell shading for implicit table
  • fixed issue on empty page, text style

v0.3.2

04 Aug 14:50
7704ceb
Compare
Choose a tag to compare

parse and recreate text blocks in vertical direction