Random Projection Hash For Scalable Data Clustering for the MapReduce Programming Model
Software Accompaniment of my current dissertation proposal work found here: https://github.com/leecarraher/nsf_proposal
run.sh builds and runs the RPHash Algorithm on random gaussian clusters of varying dimension.