-
NannyML
- Amsterdam
- in/madkowalczuk
- @Anopsy
- https://medium.com/@anopsy28
- anopsy_amsterdam
Stars
A multiverse of Prophet models for timeseries
GeostatsPy Python package for spatial data analytics and geostatistics. Started as a reimplementation of GSLIB, Geostatistical Library (Deutsch and Journel, 1992) from Fortran to Python, Geostatist…
Python interactive dashboards for learning data science
well-documented demonstration Python Jupyter workflows for many common machine learning workflows
The Techdoc is a Hugo Theme for technical documentation.
nannyml: post-deployment data science in python
Ease multi-version support for scikit-learn compatible library
A data modelling layer built on top of polars and pydantic
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Fast High-Dimensional Fixed Effects Regression in Python following fixest-syntax
Draw datasets from within Python notebooks.
Morph an input dataset of 2D points into select shapes, while preserving the summary statistics to a given number of decimal points through simulated annealing. It is intended to be used as a teach…
WebApps in pure Python. No JavaScript, HTML and CSS needed
Julia package for automated Bayesian inference on a factor graph with reactive message passing
Serverside scaling for Vega and Altair visualizations
The book every data scientist needs on their desk.
Contains experiments using data and models from the paper: "Open-Source Drift Detection Tools in Action: Insights from Two Use Cases"
📊 Explain why metrics change by unpacking them
A command line tool to easily add an ethics checklist to your data science projects.
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Visualize and compare datasets, target values and associations, with one line of code.
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Efficient matrix representations for working with tabular data
Knowledge Graph Generator app