-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathChangeLog
80 lines (51 loc) · 3.03 KB
/
ChangeLog
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
2010 November 7 (Alexey Kryukov) Version 1.0.0
* Initial release
2010 November 12 (Alexey Kryukov) Version 1.0.1
* JBIG2 support didn't work properly under MS Windows.
2010 November 15 (Alexey Kryukov) Version 1.0.3
+ Make it possible to specify binarization threshold for mixed scans
(it is still not recommended to perform segmentation with pdfbeads,
unless the image has been previously processed with ScanTailor, so
the normal value for this parameter is 1).
* If a multipage TIFF file is passed to pdfbeads, warn user that
only the first image will be processed.
* TIFF files with an embedded EXIF block were incorrectly processed
by ImageInspector.
2012 February 4 (Alexey Kryukov) Version 1.0.5
* If JPEG2000 compression was requested but not available,
JPEG files were still written with the 'JP2' extension.
* Some tweaks to minimize the effect of page labels being inconsistently handled
in various PDF viewers (prefer ISO-8859-1 strings if possible).
2012 February 5 (Alexey Kryukov) Version 1.0.6
+ Add Greek letters (the monotonic set) to the list of characters with hardcoded
glyph names and width.
* Minor bugs fixed.
2012 February 10 (Alexey Kryukov) Version 1.0.7
+ An attempt to achive better positioning of the hidden text layer, taking into
account not just lines, but also individual words. This should work with hOCR
files produced with Cuneiform or Tesseract.
2012 March 5 (Alexey Kryukov) Version 1.0.8
* Palette colors in PNG images were treated as signed chars and this could cause
indexed images to be incorrecty displayed in the resulting PDF.
2012 April 22 (Alexey Kryukov) Version 1.0.9
+ Add an option allowing to delete image files produced as an intermediate stage
during the PDF creation process.
* Processing indexed images with a small number of colors was broken.
* Don't attempt to use 'ocrx_word' elements which contain no bounding box
data (this should fix the problem with the hOCR output produced by some
tesseract versions).
2013 Mar 20 (Alexey Kryukov) Version 1.1.0
+ It is now possible to take the text layer from another PDF document (normally
this would be a file produced by passing the same set of images to an
OCR application) and embed it into the pdfbeads output. Warning: this feature
has been tested so far only with files produced with ABBYY FineReader. It may or
may not work with PDF files generated by other OCR programs.
* The default PDF page layout is now "OneColumn".
+ Make it possible to specify that the preferred reading direction for the
PDF document is left-to-right.
+ In order to simplify debugging of resulting files I have added a special
flag allowing to make the hidden text layer visible and to disable
compression in page streams.
2014 Jan 26 (Alexey Kryukov) Version 1.1.1
* hpricot is no longer developed, so switch to Nokagiri for hOCR processing.
+ English HTML documentation added.