Skip to content
View saiisa's full-sized avatar

Block or report saiisa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 2,018 133 Updated Feb 12, 2025

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Java 17,562 3,330 Updated Feb 8, 2025

Ibis Substrait Compiler

Python 98 20 Updated Feb 12, 2025

Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.

Java 820 190 Updated Feb 10, 2025

Python SQL Parser and Transpiler

Python 7,110 770 Updated Feb 11, 2025

Writing AI Conference Papers: A Handbook for Beginners

1,884 67 Updated Dec 23, 2024

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 39,249 5,760 Updated Feb 12, 2025

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 2,840 507 Updated Feb 10, 2025

Home of the Open Data Contract Standard (ODCS).

Ruby 438 46 Updated Feb 11, 2025

The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"

Python 390 29 Updated Nov 28, 2023

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 10,079 840 Updated Feb 11, 2025
TypeScript 43 2 Updated Nov 27, 2023

The Memory layer for AI Agents

Python 24,529 2,277 Updated Feb 6, 2025

System for collecting, deriving and working with facts about source code.

Hack 1,139 54 Updated Feb 11, 2025

A @ClickHouse fork that supports high-performance vector search and full-text search.

C++ 918 52 Updated Feb 5, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,216 43 Updated Feb 7, 2025

An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'

Python 47 8 Updated Aug 19, 2024

Build Conversational AI in minutes ⚡️

Python 8,535 1,115 Updated Feb 11, 2025

Simple Chainlit UI for running llms from Groq and LangChain

Python 17 21 Updated Feb 28, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 766 45 Updated Jul 29, 2024

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 20,712 2,196 Updated Jan 23, 2025

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

524 40 Updated Apr 22, 2024

Checkmk - Best-in-class infrastructure & application monitoring

Python 1,657 473 Updated Feb 12, 2025

Question and Answer based on Anything.

Python 12,447 1,202 Updated Nov 19, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,185 7,413 Updated Nov 13, 2024

SoTA LLM for converting natural language questions to SQL queries

Jupyter Notebook 3,542 229 Updated May 23, 2024
Jupyter Notebook 307 51 Updated Nov 22, 2023

Home of StarCoder: fine-tuning & inference!

Python 7,360 522 Updated Feb 27, 2024

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,644 531 Updated Jul 10, 2024
Jupyter Notebook 4,073 537 Updated Mar 28, 2024
Next