Repository for ADM Homework 4. It contains:
main.ipynb
: main notebook;mapreduce_keymeans.ipynb
: Spark notebook containing the implementation of KMeans via MapReduce;CommandLine.sh
: bash file with the command line question answered;files
: folder with some additional files, in particular:aux_3.txt
: auxiliary file for question 3 of the command line;mapreduce_keymeans.csv
: output pf the mapreduce implementation, obtained in EMR;script.py
: script with some functions used for the algorithmic question tests.
img
: folder containing the image of the command line results.