Skip to content

An abstraction layer for distributed computation

License

Notifications You must be signed in to change notification settings

racranjan/fugue

 
 

Repository files navigation

Fugue

GitHub release PyPI pyversions PyPI license PyPI version Coverage Status Doc

Join Fugue-Project on Slack

Fugue is a pure abstraction layer that adapts to different computing frameworks such as Spark and Dask. It is to unify the core concepts of distributed computing and to help you decouple your logic from specific computing frameworks.

Installation

pip install fugue

Fugue has these extras:

For example a common use case is:

pip install fugue[sql,spark]

Docs and Tutorials

To read the complete static docs, click here

The best way to start is to go through the tutorials. We have the tutorials in an interactive notebook environent.

Run the tutorial using binder:

Binder

But it runs slow on binder, the machine on binder isn't powerful enough for a distributed framework such as Spark. Parallel executions can become sequential, so some of the performance comparison examples will not give you the correct numbers.

Run the tutorial using docker

Alternatively, you should get decent performance if running its docker image on your own machine:

docker run -p 8888:8888 fugueproject/tutorials:latest

About

An abstraction layer for distributed computation

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 96.7%
  • ANTLR 3.2%
  • Makefile 0.1%