File tree 1 file changed +10
-1
lines changed
1 file changed +10
-1
lines changed Original file line number Diff line number Diff line change @@ -48,30 +48,39 @@ https://archive.ics.uci.edu/ml/datasets/reuters-21578+text+categorization+collec
48
48
and should be extracted in the /data directory
49
49
50
50
K shingles input
51
+
51
52
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/kshingles.jpg )
52
53
53
54
3 shingles output
55
+
54
56
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/shingles.PNG )
55
57
56
58
Minhashing input
59
+
57
60
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/hashfunctions.jpg )
58
61
59
62
Minhashing output
63
+
60
64
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/minhashing.jpg )
61
65
62
66
LSH input
67
+
63
68
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/lsh.jpg )
64
69
65
70
Shingle similarity
71
+
66
72
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/jaccard%20sim.jpg )
67
73
68
74
Signature similarity
75
+
69
76
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/shingle%20sim.jpg )
70
77
71
78
LSH similarity
79
+
72
80
![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/lsh%20sim.jpg )
73
81
74
82
Time consumption
75
- ![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/time.png )
83
+
84
+ ![ alt text] ( https://github.com/evagian/Document-similarity-K-shingles-minhashing-LSH-python/blob/master/data/doc/time.jpg )
76
85
77
86
You can’t perform that action at this time.
0 commit comments