Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change output format #7

Open
Tushar-Mehndiratta opened this issue Jul 19, 2020 · 4 comments
Open

Change output format #7

Tushar-Mehndiratta opened this issue Jul 19, 2020 · 4 comments
Assignees
Labels

Comments

@Tushar-Mehndiratta
Copy link

Can we change the output of "extract_only_content('File_Name')"?

The code that i Used is working fine....
from tikapp import TikaApp client = TikaApp('File_Path of Tika-app.jar file') content = client.extract_only_content("File_Name") print(content)

The content contains normal string text. Can we get the content output as a html file string (with tags and all) ???

@Tushar-Mehndiratta
Copy link
Author

@fedelemantuano What is the solution?

@fedelemantuano
Copy link
Owner

Hi @Tushar-Mehndiratta yes it's possible. I should add a method that call tika with option -h or --html Output HTML content.

@Tushar-Mehndiratta
Copy link
Author

Ok Great, I would love to see this feature soon! Will you be working on this in the coming days or later?

@Tushar-Mehndiratta
Copy link
Author

?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants