Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide multiprocessing option for large classification jobs #59

Open
GoogleCodeExporter opened this issue Mar 31, 2015 · 0 comments
Open

Comments

@GoogleCodeExporter
Copy link

Large datasets (e.g., agemap H & E stain ~40,000 samples) take an extremely 
long time in Pychrm to do train/test split and classify operations. Euclidean 
distances can be calculated in a parallellized way, e.g., one processor can to 
all the samples from a given class.

This would entail exposing the samples in the FeatureSet.data_matrix to C++. A 
C++ implemented, Python-wrapped wndchrm classify option would also speed up 
computation.


Original issue reported on code.google.com by [email protected] on 18 Jan 2013 at 9:33

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant