Skip to content
View songhappy's full-sized avatar

Block or report songhappy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,429 5,377 Updated Jan 29, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,772 377 Updated Jul 11, 2024

A latent text-to-image diffusion model

Jupyter Notebook 69,319 10,280 Updated Jun 18, 2024

LLM inference in C/C++

C++ 72,074 10,404 Updated Jan 29, 2025

Machine learning glossary

Python 3,035 726 Updated Aug 8, 2024

Cheatsheet for Spark DataFrame

91 38 Updated Nov 18, 2019

Yahoo! news article recommendation system by linUCB

Python 114 44 Updated Feb 1, 2018

Open Source AI/ML Platform

Python 8,497 789 Updated Jan 29, 2025

Notebooks for learning deep learning

Jupyter Notebook 5,667 5,341 Updated Oct 3, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,079 5,968 Updated Jan 29, 2025

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,673 734 Updated Jan 21, 2025

VIP cheatsheets for Stanford's CS 229 Machine Learning

17,878 3,992 Updated May 20, 2020

Models and examples built with TensorFlow

Python 1 Updated Aug 24, 2020

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…

Python 7,024 1,289 Updated Jan 26, 2025