hiworldwzj

hiworldwzj

22 followers · 4 following

Achievements

x3 x2

Achievements

x3 x2

Stars

pytorch-labs / applied-ai

Applied AI experiments and examples for PyTorch

Python 193 18 Updated Dec 17, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 672 27 Updated Sep 21, 2024

facebookexperimental / triton

Github mirror of trition-lang/triton repo.

C++ 15 4 Updated Dec 23, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 56,371 8,338 Updated Dec 28, 2024

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 690 40 Updated Dec 26, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,711 217 Updated Dec 28, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,820 620 Updated Dec 28, 2024

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Markdown 319,318 29,624 Updated Sep 3, 2024

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 34,842 2,639 Updated Dec 27, 2024

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 758 41 Updated Dec 28, 2024

DaoCloud / public-image-mirror

很多镜像都在国外。比如 gcr 。国内下载很慢，需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 7,512 981 Updated Dec 28, 2024

qntm / greenery

Regular expression manipulation library

Python 346 41 Updated Jun 8, 2024

mrabarnett / mrab-regex

C 465 50 Updated Nov 7, 2024

MegaIng / interegular

Allows to check regexes for overlaps. Based on greenery by @qntm.

Python 43 6 Updated Jun 5, 2024

umut-sahin / dotlr

An LR(1) parser generator and visualizer created for educational purposes.

Rust 93 5 Updated Nov 19, 2024

lark-parser / lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,000 420 Updated Nov 13, 2024

forthespada / CS-Books

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

21,600 3,793 Updated Jul 2, 2024

pydantic / pydantic

Data validation using Python type hints

Python 21,801 1,944 Updated Dec 28, 2024

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 663 52 Updated Sep 4, 2024

dottxt-ai / outlines

Structured Text Generation

Python 10,168 531 Updated Dec 28, 2024

ModelTC / general-sam-py

Python bindings for general-sam and some utilities

Python 3 Updated Dec 9, 2024

ModelTC / general-sam

A general suffix automaton implementation in Rust with Python bindings

Rust 4 Updated Oct 18, 2024

LazyAGI / LazyLLM

Easiest and laziest way for building multi-agent LLMs applications.

Python 1,050 69 Updated Dec 27, 2024

OI-wiki / OI-wiki

🌟 Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）

TypeScript 21,767 4,036 Updated Dec 28, 2024

brianpetro / obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

JavaScript 2,964 188 Updated Dec 27, 2024

YangLing0818 / buffer-of-thought-llm

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Python 565 54 Updated Oct 28, 2024

commune-ai / LightLLM

Python 1 1 Updated Apr 17, 2024

ppppppppig / glx_lightllm

lightllm加上pp优化

Python 1 Updated Feb 26, 2024

Chainlit / chainlit

Build Conversational AI in minutes ⚡️

Python 7,506 1,002 Updated Dec 28, 2024

ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 370 42 Updated Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hiworldwzj

Achievements

Achievements

Block or report hiworldwzj

Stars

pytorch-labs / applied-ai

efeslab / Nanoflow

facebookexperimental / triton

langgenius / dify

mirage-project / mirage

ModelTC / lightllm

sgl-project / sglang

codecrafters-io / build-your-own-x

gradio-app / gradio

thu-ml / SageAttention

DaoCloud / public-image-mirror

qntm / greenery

mrabarnett / mrab-regex

MegaIng / interegular

umut-sahin / dotlr

lark-parser / lark

forthespada / CS-Books

pydantic / pydantic

IST-DASLab / marlin

dottxt-ai / outlines

ModelTC / general-sam-py

ModelTC / general-sam

LazyAGI / LazyLLM

OI-wiki / OI-wiki

brianpetro / obsidian-smart-connections

YangLing0818 / buffer-of-thought-llm

commune-ai / LightLLM

ppppppppig / glx_lightllm

Chainlit / chainlit

ModelTC / llmc