Skip to content

Spark Data Mining assignments for my Data Mining class at USC

Notifications You must be signed in to change notification settings

nealsonS/SparkDataMining

Repository files navigation

Data Mining with Spark

  • This repo details the data mining algorithms I did in my "Data Mining with Spark" class.
  • All the algorithms are written with distributed processing in mind.
  • Mainly written in Spark's Python SDK: PySpark
  • Most algorithms here manipulates Spark RDDs not Spark DataFrames
    • With some exceptions using GraphFrames and Spark SQL

About

Spark Data Mining assignments for my Data Mining class at USC

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages