Skip to content
View lucky521's full-sized avatar

Organizations

@ludolph

Block or report lucky521

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A list of awesome papers and resources of recommender system on large language model (LLM).

1,543 130 Updated Aug 15, 2024

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 1,698 180 Updated Jan 16, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,369 1,243 Updated Dec 12, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,181 460 Updated Jan 16, 2025

ONNX-TensorRT: TensorRT backend for ONNX

C++ 2,996 547 Updated Dec 3, 2024

Large Language Model Text Generation Inference

Python 9,586 1,119 Updated Jan 15, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 2,929 640 Updated Jan 16, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,148 1,064 Updated Jan 13, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 98,316 15,972 Updated Jan 16, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,372 8,879 Updated Jan 4, 2025

Fast and memory-efficient exact attention

Python 15,063 1,422 Updated Jan 15, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,963 5,233 Updated Jun 27, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,959 333 Updated Jul 31, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,809 5,176 Updated Jan 16, 2025

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,367 348 Updated Jan 13, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,496 4,587 Updated Jan 14, 2025

Port of OpenAI's Whisper model in C/C++

C++ 36,919 3,802 Updated Jan 14, 2025

LLM inference in C/C++

C++ 70,748 10,226 Updated Jan 15, 2025

Source code for Twitter's Recommendation Algorithm

Python 10,164 2,219 Updated Jul 10, 2024

Source code for Twitter's Recommendation Algorithm

Scala 62,765 12,170 Updated Jul 10, 2024

The core of our monitoring platform with a powerful configuration language and REST API.

C++ 2,042 579 Updated Jan 15, 2025

阿里巴巴 MySQL binlog 增量订阅&消费组件

Java 28,747 7,645 Updated Jan 13, 2025

Netty project - an event-driven asynchronous network application framework

Java 33,684 15,995 Updated Jan 15, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,586 1,012 Updated Jan 16, 2025

Java Native Access

Java 8,598 1,684 Updated Dec 28, 2024

Feathr – A scalable, unified data and AI engineering platform for enterprise

Scala 1,869 259 Updated Apr 4, 2024

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.

C++ 1,605 322 Updated Jan 9, 2025

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Scala 886 247 Updated Jan 6, 2025

Apache Hive

Java 5,612 4,707 Updated Jan 16, 2025

Deep learning with dynamic computation graphs in TensorFlow

Python 1,823 266 Updated Jun 26, 2021
Next