Lists (6)
Sort Name ascending (A-Z)
Stars
A declarative PySpark framework for row- and aggregate-level data quality validation.
Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅
📙 Awesome Data Catalogs and Observability Platforms.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
This is a repo with links to everything you'd ever want to learn about data engineering
WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data.
Make awesome display tables using Python
Roadmap to becoming a data engineer in 2021
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
A huge collection of polybar themes with different styles, colors and variants.
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
The List of Ukrainian IT communities, news portals, Telegram groups, and other places where people can communicate with each other.
More than 2000+ Data engineer interview questions.
Running Scala in WebAssembly through Scala Native
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
A curated list of software and architecture related design patterns.
Master the command line, in one page
Apache Spark - A unified analytics engine for large-scale data processing
a pyenv plugin to manage virtualenv (a.k.a. python-virtualenv)
A type-safe TypeScript SQL query builder