You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Upon reviewing the default Tika Java configuration, the internal default value when creating a PDFParserConfig object is extract_unique_inline_images_only = True.
Therefore, I believe the current default value of extract_unique_inline_images_only being set to false is a bug and should be true.
Greetings!
The text was updated successfully, but these errors were encountered:
Currently, the default value of "extract_unique_inline_images_only" is set to "false":
extractous/extractous-core/src/config.rs
Line 31 in 818d94c
In binding-python, there is a comment stating that this default value should be "true":
extractous/bindings/extractous-python/src/config.rs
Line 80 in 818d94c
In core, the same comment states that this default value is "false":
extractous/extractous-core/src/config.rs
Line 74 in 818d94c
Upon reviewing the default Tika Java configuration, the internal default value when creating a PDFParserConfig object is extract_unique_inline_images_only = True.

Therefore, I believe the current default value of extract_unique_inline_images_only being set to false is a bug and should be true.
Greetings!
The text was updated successfully, but these errors were encountered: