Stars
Surfalytics projces on Data Engineering and Analytics
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
🪄 Create rich visualizations with AI
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K…
Sample files to accompany the FT's Chart Doctor column
One Way Communication Using Socket Programming, Java And Python
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Actively curated list of awesome BI tools. PRs welcome!
This is a repo with links to everything you'd ever want to learn about data engineering
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
Automated modeling and machine learning framework FEDOT
Custom Contoso database generator and ready-to-use Contoso sample databases for SQL Server
Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
Open, Multi-modal Catalog for Data & AI
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
Free, simple, and intuitive online database diagram editor and SQL generator.
The modern replacement for Jupyter Notebooks
A JavaScript library aimed at visualizing graphs of thousands of nodes and edges
This repository provides an example of dataset preprocessing, GBRT (Gradient Boosted Regression Tree) model training and evaluation, model tuning and finally model serving (REST API) in a container…
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Documentation for creating visuals for Power BI
Product Rationalization of Pro Bikes Inc using Power BI
Open source tool for monitoring and managing ClickHouse clusters
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform