Skip to content
View everloom's full-sized avatar

Block or report everloom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Linux kernel source tree

C 184,865 54,530 Updated Dec 26, 2024

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 16,634 3,991 Updated Dec 24, 2024

Stack trace visualizer

Perl 17,577 1,984 Updated Oct 20, 2024

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 434 37 Updated Dec 16, 2024
C++ 298 30 Updated Dec 26, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,443 428 Updated Dec 22, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,708 217 Updated Dec 26, 2024

A framework for few-shot evaluation of language models.

Python 7,305 1,971 Updated Dec 25, 2024
Cuda 7 2 Updated Dec 26, 2024

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 252 59 Updated Nov 5, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,929 445 Updated Dec 26, 2024

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,375 5,638 Updated Dec 26, 2024

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 322 38 Updated Dec 26, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,202 684 Updated Dec 24, 2024

Setup and run a local LLM and Chatbot using consumer grade hardware.

JavaScript 199 19 Updated Dec 10, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,932 175 Updated Nov 20, 2024

Lightweight clipboard manager for macOS

Swift 13,445 565 Updated Dec 24, 2024

An open autonomous driving platform

C++ 25,368 9,739 Updated Dec 19, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,714 611 Updated Dec 26, 2024

Production-Grade Container Scheduling and Management

Go 111,952 39,913 Updated Dec 24, 2024

这是一个用于显示当前网速、CPU及内存利用率的桌面悬浮窗软件,并支持任务栏显示,支持更换皮肤。

C++ 35,573 3,293 Updated Mar 16, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,625 5,895 Updated Dec 26, 2024

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 367 42 Updated Dec 23, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,795 27,395 Updated Dec 26, 2024

Define and run multi-container applications with Docker

Go 34,310 5,268 Updated Dec 19, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,624 216 Updated Dec 20, 2024

Seamless operability between C++11 and Python

C++ 15,983 2,126 Updated Dec 22, 2024

Fast and memory-efficient exact attention

Python 14,791 1,393 Updated Dec 26, 2024

The fundamental package for scientific computing with Python.

Python 28,411 10,239 Updated Dec 26, 2024

The Python programming language

Python 64,367 30,766 Updated Dec 26, 2024
Next