everloom

everloom

Stars

torvalds / linux

Linux kernel source tree

C 184,865 54,530 Updated Dec 26, 2024

apache / brpc

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 16,634 3,991 Updated Dec 24, 2024

brendangregg / FlameGraph

Stack trace visualizer

Perl 17,577 1,984 Updated Oct 20, 2024

zhihu / ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 434 37 Updated Dec 16, 2024

pcg-mlp / KsanaLLM

C++ 298 30 Updated Dec 26, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,443 428 Updated Dec 22, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,708 217 Updated Dec 26, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,305 1,971 Updated Dec 25, 2024

ModelTC / quant_horizon

Cuda 7 2 Updated Dec 26, 2024

zjhellofss / KuiperLLama

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 252 59 Updated Nov 5, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,929 445 Updated Dec 26, 2024

PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ 22,375 5,638 Updated Dec 26, 2024

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 322 38 Updated Dec 26, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,202 684 Updated Dec 24, 2024

jasonacox / TinyLLM

Setup and run a local LLM and Chatbot using consumer grade hardware.

JavaScript 199 19 Updated Dec 10, 2024

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,932 175 Updated Nov 20, 2024

p0deje / Maccy

Lightweight clipboard manager for macOS

Swift 13,445 565 Updated Dec 24, 2024

ApolloAuto / apollo

An open autonomous driving platform

C++ 25,368 9,739 Updated Dec 19, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,714 611 Updated Dec 26, 2024

kubernetes / kubernetes

Production-Grade Container Scheduling and Management

Go 111,952 39,913 Updated Dec 24, 2024

zhongyang219 / TrafficMonitor

这是一个用于显示当前网速、CPU及内存利用率的桌面悬浮窗软件，并支持任务栏显示，支持更换皮肤。

C++ 35,573 3,293 Updated Mar 16, 2024

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,625 5,895 Updated Dec 26, 2024

ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 367 42 Updated Dec 23, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,795 27,395 Updated Dec 26, 2024

docker / compose

Define and run multi-container applications with Docker

Go 34,310 5,268 Updated Dec 19, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,624 216 Updated Dec 20, 2024

pybind / pybind11

Seamless operability between C++11 and Python

C++ 15,983 2,126 Updated Dec 22, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,791 1,393 Updated Dec 26, 2024

numpy / numpy

The fundamental package for scientific computing with Python.

Python 28,411 10,239 Updated Dec 26, 2024

python / cpython

The Python programming language

Python 64,367 30,766 Updated Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly