New types of tree: ConcurrentPermutermTree, ConcurrentWildcardTree for wildcard queries #3

GoogleCodeExporter · 2015-07-03T06:43:38Z

It would be useful to support wildcard queries.

Two approaches to be investigated (both of which will be tracked in this issue):

(1) A permuterm index on top of the ConcurrentRadixTree. This would support 
queries such as "<prefix>*<suffix>" on a single tree. It may be more memory 
efficient than a hash-dictionary approach. See: 
http://nlp.stanford.edu/IR-book/html/htmledition/permuterm-indexes-1.html

(2) A composite of a ConcurrentRadixTree and a ConcurrentReversedRadixTree. One 
tree would support prefix lookup, the other suffix lookup. Query 
"prefix*suffix" may return the intersection of the results from both trees, 
after some post-filtering. This second approach however, is near the territory 
of a query engine on top of multiple indexes, so if implemented would not 
belong in this project, but in http://code.google.com/p/cqengine/

Example usage for (1) would be:

public static void main(String[] args) {
    PermutermTree<Integer> tree = new ConcurrentPermutermTree<Integer>(new DefaultCharArrayNodeFactory());

    tree.put("TEST", 1);
    tree.put("TOAST", 2);
    tree.put("TEAM", 3);

    System.out.println("Keys matching 'T*T': " + Iterables.toString(tree.getKeysMatching("T", "T"))); // prefix, suffix
}


Output would be:
    Keys matching 'T*T': [TOAST, TEST]

Original issue reported on code.google.com by [email protected] on 24 Mar 2013 at 10:19

The text was updated successfully, but these errors were encountered:

GoogleCodeExporter added Priority-Medium auto-migrated Type-Enhancement labels Jul 3, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New types of tree: ConcurrentPermutermTree, ConcurrentWildcardTree for wildcard queries #3

New types of tree: ConcurrentPermutermTree, ConcurrentWildcardTree for wildcard queries #3

GoogleCodeExporter commented Jul 3, 2015

New types of tree: ConcurrentPermutermTree, ConcurrentWildcardTree for wildcard queries #3

New types of tree: ConcurrentPermutermTree, ConcurrentWildcardTree for wildcard queries #3

Comments

GoogleCodeExporter commented Jul 3, 2015