GitHub - MaksDanylenko/anargams: Two words are defined as anagrams if they do share the same letters, but are in a different order (i.e. the english words race and care are anagrams)

Performance:

Filtering input dataset. My solution works only with unique set. English language contains only 1_000_000 words. And if try to upload 100 billion, unique words will be only 1_000_000 or less))
Working with set in parallelStream();
For defining as anagrams, program compares two sorted arrays of chars.
Storing anagrams in map. The map which contains key – is a sorted word, the value - is a collection of anagrams. For map interface I used ConcurrentHashMap, and for collection Collections.synchronizedList(new ArrayList<>()). Program works in multithreading, that’s why I used concurrent implementation.

Scalability:

You can run my code in multi-nodes environment and feed the node with some parts of data;
My class Anagram works with unique words set and returns list of anagram collections;

Maintainability: I think my code is easy to maintain, expand and support. One class stands for one responsibility.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback