(https://travis-ci.org/holdenk/sparklingpandas)

SparklingPandas

SparklingPandas aims to make it easy to use the distributed computing power of PySpark to scale your data anlysis with Pandas.

Documentation

None (right now). You can find some slides in the slides/ directory

Requirements

The primary requirement of SparklingPandas is that you have a recent (v1.0-SNAPSHOT currently) version of Spark installed - http://spark.apache.org

Using

Make sure you have the SPARK_HOME enviroment variable set correctly, as SparklingPandas uses this for including the PySpark libraries

Other than that you can install SparklingPandas with pip and just import it. The primary unit of SparklingPandas is a PRDD (Pandas Resillent Distributed Data Set)

State

This is in early development and should not be considered usable.

Support

There isn't really a mailing list, but if you want to use please feel free to e-mail me ( [email protected] ) with any questions.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
img		img
sparklingpandas		sparklingpandas
.env		.env
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGES.txt		CHANGES.txt
LICENSE.txt		LICENSE.txt
MANIFSET.in		MANIFSET.in
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

(https://travis-ci.org/holdenk/sparklingpandas)

SparklingPandas

Documentation

Requirements

Using

State

Support

About

Uh oh!

Releases

Packages

License

dyzsasd/sparklingpandas

Folders and files

Latest commit

History

Repository files navigation

(https://travis-ci.org/holdenk/sparklingpandas)

SparklingPandas

Documentation

Requirements

Using

State

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages