Skip to content

Latest commit

 

History

History
35 lines (29 loc) · 1.14 KB

File metadata and controls

35 lines (29 loc) · 1.14 KB

article
spark-ml-document-classification

example for document classification

License Travis Build

Summary

An example on using spark ML models to classify docuements

Sample DataSet

Sample News article data is from by www.theguardian.com.

This example uses a very small dataset extracted from few articles. Each new line contains article data consisting of following data format:

{ 
    "bodyText": "..", 
    "webPublicationDate": "25-01-1999", 
    "topics": ["media"] 
}