Skip to content
View feng-tao's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Organizations

@apache @amundsen-io

Block or report feng-tao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,315 226 Updated Jan 31, 2025

List of language agents based on paper "Cognitive Architectures for Language Agents"

TeX 852 64 Updated Jan 16, 2025

LLM training code for Databricks foundation models

Python 4,113 541 Updated Jan 31, 2025

Introduction to Machine Learning Systems

JavaScript 1,402 170 Updated Jan 30, 2025

DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!

Python 46 19 Updated Jan 22, 2025

Malloy is an experimental language for describing data relationships and transformations.

TypeScript 2,045 78 Updated Jan 28, 2025

LLM101n: Let's build a Storyteller

31,195 1,709 Updated Aug 1, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,626 429 Updated Jan 29, 2025

LLM inference in C/C++

C++ 72,399 10,431 Updated Jan 31, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,355 886 Updated Jul 1, 2024

Web UI for semi-automatically importing external data into beancount

Python 405 107 Updated Sep 21, 2024

深度学习经典、新论文逐段精读

27,960 2,488 Updated Nov 17, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,956 1,017 Updated Jan 14, 2025

The shared semantic layer definitions that dbt-core and MetricFlow use.

Python 75 14 Updated Jan 23, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,659 5,404 Updated Jan 31, 2025

the AI-native open-source embedding database

Rust 17,286 1,434 Updated Jan 31, 2025

✨✨Latest Advances on Multimodal Large Language Models

13,698 877 Updated Jan 28, 2025

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,722 1,037 Updated Jan 29, 2025

Inference code for Llama models

Python 57,430 9,689 Updated Jan 26, 2025

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,360 520 Updated Sep 18, 2024

LangChain 的中文入门教程

7,638 611 Updated Aug 11, 2024

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,081 771 Updated Jan 22, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,327 27,760 Updated Jan 31, 2025

ChatGPT for wechat https://github.com/AutumnWhj/ChatGPT-wechat-bot

TypeScript 4,708 984 Updated Aug 19, 2024

The official Python library for the OpenAI API

Python 24,276 3,488 Updated Jan 31, 2025

Use ChatGPT On Wechat via wechaty

TypeScript 13,325 3,912 Updated May 20, 2024

Import Evernote ENEX files to Notion

Python 430 37 Updated Jan 17, 2024

Universal LLM Deployment Engine with ML Compilation

Python 19,793 1,639 Updated Jan 24, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 119,417 16,096 Updated Jan 14, 2025
Next