Skip to content

๐Ÿผ Data pipelines using Pandas ๐Ÿ”ง๐Ÿ“Š

License

Notifications You must be signed in to change notification settings

carlosjpc/panditas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

54 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Panditas

CI Buld Status Code style: Black

Build Data Pipelines using Pandas and S3. Initially this will support the following executors:

Models

  • Data Flow
  • Data Flow Steps
    • Data Set
    • Data Transformation, available transformations include:
      • Calculated Column
      • Columns Subset
      • Conditional Fill
      • Constant Column
      • Filter
      • Format Columns (Currency, Date, etc)
      • Pivot Table (for grouping by)
      • Remove Duplicates
      • Rename Column
      • Replace Text
      • Sort by Columns
      • Value Mapper
    • Single Merge Rule
    • Multiple Merge Rule

Example

This is a sample dataflow from the insurance industry, implemented here:

Credits

"gummy bear" icon by emilegraphics from the Noun Project.

About

๐Ÿผ Data pipelines using Pandas ๐Ÿ”ง๐Ÿ“Š

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages