Stars
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Simple, safe way to store and distribute tensors
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence…
Tsung is a high-performance benchmark framework for various protocols including HTTP, XMPP, LDAP, etc.
Reference implementations of MLPerf™ training benchmarks
Ongoing research training transformer models at scale
High-performance Python librarys for connecting AI/ML frameworks with OSS storage.
Stable Diffusion web UI
A quick guide (especially) for trending instruction finetuning datasets
A topic-centric list of HQ open datasets.
A high-throughput and memory-efficient inference and serving engine for LLMs
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
A simple HTTPS client based on Boost Asio.