Stars
A collection of prompts, system prompts and LLM instructions
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Projects & Resources to help you become a better AI Developer.
A lightweight data processing framework built on DuckDB and 3FS.
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
⚡ TabPFN: Foundation Model for Tabular Data ⚡
ClickHouse® is a real-time analytics database management system
Panel: The powerful data exploration & web app framework for Python
A short demo to introduce the polars dataframe library through a marimo notebook.
InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A Structured Output Framework for LLM Outputs
Towards Human-Friendly, Fast Learning and Adaptable Agent Communities
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
This is a public repository to go over all the LLM-driven data engineering concepts.
A standard framework for modelling Deep Learning Models for tabular data
🏅 Collection of Kaggle Solutions and Ideas 🏅
Turns Data and AI algorithms into production-ready web applications in no time.
A light-weight, flexible, and expressive statistical data testing library
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
Typer, build great CLIs. Easy to code. Based on Python type hints.