Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
python machine-learning information-retrieval clustering tika cosine-similarity jaccard-similarity cosine-distance similarity-score tika-similarity metadata-features tika-python
-
Updated
Mar 26, 2024 - Python