Skip to content
View yee-kevin's full-sized avatar
:octocat:
:octocat:

Block or report yee-kevin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data

Python 27 52 Updated Jan 20, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,945 4,254 Updated Jan 21, 2025

A Strict JSON Framework for LLM Outputs

Jupyter Notebook 324 33 Updated Oct 22, 2024

Towards Human-Friendly, Fast Learning and Adaptable Agent Communities

Jupyter Notebook 113 9 Updated Jan 14, 2025

AI Verify

Python 129 36 Updated Jan 20, 2025

Apache Flink

Java 24,423 13,480 Updated Jan 21, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,726 3,065 Updated Jan 21, 2025
Jupyter Notebook 65 50 Updated Dec 5, 2024

This is a public repository to go over all the LLM-driven data engineering concepts.

Python 955 164 Updated Oct 26, 2024

Apache DataFusion SQL Query Engine

Rust 6,649 1,278 Updated Jan 20, 2025

A standard framework for modelling Deep Learning Models for tabular data

Python 1,434 145 Updated Jan 21, 2025

πŸ… Collection of Kaggle Solutions and Ideas πŸ…

HTML 5,060 1,897 Updated Jan 7, 2025

Turns Data and AI algorithms into production-ready web applications in no time.

Python 17,633 1,879 Updated Jan 21, 2025

🍦 Never use print() to debug again.

Python 9,436 197 Updated Jan 13, 2025

A light-weight, flexible, and expressive statistical data testing library

Python 3,556 312 Updated Jan 15, 2025

Data validation using Python type hints

Python 22,097 1,972 Updated Jan 20, 2025

The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

Python 12,343 2,731 Updated Jan 20, 2025

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 16,238 693 Updated Jan 20, 2025

Python logging made (stupidly) simple

Python 20,590 712 Updated Jan 17, 2025

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 1,987 133 Updated Jan 13, 2025

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 31,443 2,050 Updated Jan 20, 2025

Generate deterministic fake values: The same input will always generate the same fake-output.

TypeScript 917 34 Updated Jan 14, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 14,864 3,594 Updated Jan 21, 2025

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 4,246 369 Updated Jan 15, 2025

Up to 200x Faster Dot Products & Similarity Metrics β€” for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …

C 1,197 68 Updated Dec 21, 2024

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 79,561 11,227 Updated Jan 20, 2025

DuckDB is an analytical in-process SQL database management system

C++ 25,909 2,046 Updated Jan 20, 2025

the portable Python dataframe library

Python 5,457 609 Updated Jan 21, 2025
Jupyter Notebook 413 98 Updated Apr 29, 2024

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 25,020 1,342 Updated Jan 21, 2025
Next