Skip to content
View zshanwei's full-sized avatar

Block or report zshanwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,404 595 Updated Aug 8, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,215 312 Updated Jan 22, 2025

Build multimodal language agents for fast prototype and production

Python 1,293 110 Updated Jan 20, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,196 402 Updated Jan 18, 2025

MiniCPM on Android platform.

Python 625 50 Updated Apr 11, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,326 1,236 Updated Jan 22, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,807 224 Updated Jan 11, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,161 440 Updated Jan 9, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 1,989 143 Updated Jan 21, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1,076 89 Updated Jan 14, 2025

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,273 262 Updated Jan 21, 2025

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 4,269 609 Updated Jan 20, 2025

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 138 9 Updated Oct 3, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

739 42 Updated Oct 22, 2024

Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)

Python 32 4 Updated Aug 28, 2023

Everything about the SmolLM & SmolLM2 family of models

Python 1,566 83 Updated Jan 7, 2025
Python 33 1 Updated Nov 16, 2024

Official inference framework for 1-bit LLMs

C++ 12,639 881 Updated Dec 20, 2024

PyTorch native post-training library

Python 4,726 494 Updated Jan 21, 2025

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,424 424 Updated Jan 22, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,292 606 Updated Jan 18, 2025

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,224 70 Updated Nov 27, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,767 95 Updated Jan 22, 2025

Kubernetes-native Deep Learning Framework

Python 734 115 Updated Jan 26, 2024
Python 99 10 Updated Dec 28, 2024

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 564 39 Updated Jan 21, 2025

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,445 210 Updated Nov 13, 2024

PyTorch native quantization and sparsity for training and inference

Python 1,762 204 Updated Jan 22, 2025

Next-Token Prediction is All You Need

Python 1,969 78 Updated Oct 24, 2024

An app that brings language models directly to your phone.

TypeScript 1,636 130 Updated Jan 21, 2025
Next