Skip to content

pwais/oarphpy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

6cf85cc · Apr 1, 2023

History

28 Commits
Mar 6, 2023
Apr 1, 2023
Jul 25, 2020
Mar 6, 2023
Mar 6, 2023
Mar 6, 2023
Jan 25, 2020
Nov 13, 2021
Jan 25, 2020
Mar 6, 2023
Apr 1, 2023
Mar 6, 2023
Nov 5, 2021
Jan 24, 2020
Mar 6, 2023

Repository files navigation

                                          _________________________
                                         < OarphPy!! Oarph! Oarph! >
                                         <   OarphKit for Python!! >
                                          -------------------------
                                                        \
                                                         \
                         ____                __   ___       -~~~~-
                        / __ \___ ________  / /  / _ \__ __|O __ O|     
                       / /_/ / _ `/ __/ _ \/ _ \/ ___/ // /|_\__/_|__-  
                       \____/\_,_/_/ / .__/_//_/_/   \_,---(__/\__)---  
                                 .--/_/             /___/ /  ~--~  \    
                            ,__;`  o __`'.          _,..-/  | \/ |  \   
                            '  `'---'  `'.'.      .'.'` |   | /\ |   |
                                          .'-...-`.'  _/ /\__    __/\ \_
                                            -...-`  ~~~~~    ~~~~    ~~~~~

License Build Status PyPI version

OarphPy is a collection of Python utilities for Data Science with PySpark and Tensorflow. Related (but orthogonal) to OarphKit.

Quickstart

Install from PyPI: pip install oarphpy. We test OarphPy in a variet of environments (see below), so it should play well with your Jupyter/Colab notebook or project environment. To include all extras, use pip install oarphpy[all].

Or use the dockerized environment hosted on DockerHub:

  $ ./oarphcli --shell
  -- or --
  $ docker run -it --net=host oarphpy/full bash

See also API documentation.

Demos

Dockerized Development Environments

OarphPy is built and tested in a variety of environments to ensure the library works with and without optional dependencies. These environments are shared on DockerHub and defined in the docker subdirectory of this repo:

  • oarphpy/full -- Includes Tensorflow, Jupyter, a binary install of Spark, and other tools like Bokeh. Use this environment for adhoc data science or as a starter for other projects.

  • oarphpy/base-py2 -- Tests oarphpy in a vanilla Python 2.7 environment to ensure clean interop with other projects.

  • oarphpy/base-py3 -- Tests oarphpy in a vanilla Python 3 environment to ensure clean interop with other projects.

  • oarphpy/spark -- Tests oarphpy with a vanilla install of PySpark to ensure basic compatibility.

  • oarphpy/tensorflow -- Tests oarphpy with Tensorflow 1.x to ensure basic compatibility (e.g. of oarphpy.util.tfutil).

Development

See ./oarphcli --help for the development and release workflow.

Releases

No releases published

Packages

No packages published