zshanwei

Follow

zshanwei

Follow

6 followers · 7 following

Stars

TencentQQGYLab / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,404 595 Updated Aug 8, 2024

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,215 312 Updated Jan 22, 2025

om-ai-lab / OmAgent

Build multimodal language agents for fast prototype and production

Python 1,293 110 Updated Jan 20, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,196 402 Updated Jan 18, 2025

OpenBMB / mlc-MiniCPM

Forked from mlc-ai/mlc-llm

MiniCPM on Android platform.

Python 625 50 Updated Apr 11, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,326 1,236 Updated Jan 22, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,807 224 Updated Jan 11, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,161 440 Updated Jan 9, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 1,989 143 Updated Jan 21, 2025

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,076 89 Updated Jan 14, 2025

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,273 262 Updated Jan 21, 2025

NexaAI / nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 4,269 609 Updated Jan 20, 2025

Hsu1023 / DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 138 9 Updated Oct 3, 2024

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

739 42 Updated Oct 22, 2024

cliang1453 / task-aware-distillation

Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)

Python 32 4 Updated Aug 28, 2023

huggingface / smollm

Everything about the SmolLM & SmolLM2 family of models

Python 1,566 83 Updated Jan 7, 2025

UbiquitousLearning / PhoneLM

Python 33 1 Updated Nov 16, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,639 881 Updated Dec 20, 2024

pytorch / torchtune

PyTorch native post-training library

Python 4,726 494 Updated Jan 21, 2025

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,424 424 Updated Jan 22, 2025

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 8,292 606 Updated Jan 18, 2025

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,224 70 Updated Nov 27, 2024

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,767 95 Updated Jan 22, 2025

sql-machine-learning / elasticdl

Kubernetes-native Deep Learning Framework

Python 734 115 Updated Jan 26, 2024

Cornell-RelaxML / qtip

Python 99 10 Updated Dec 28, 2024

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 564 39 Updated Jan 21, 2025

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,445 210 Updated Nov 13, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 1,762 204 Updated Jan 22, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 1,969 78 Updated Oct 24, 2024

a-ghorbani / pocketpal-ai

An app that brings language models directly to your phone.

TypeScript 1,636 130 Updated Jan 21, 2025