Stars
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
A simple Python program to implement the search-extract-summarize flow.
A generative world for general-purpose robotics & embodied AI learning.
Opensource IDE For Exploring and Testing Api's (lightweight alternative to postman/insomnia)
macOS Integrated Injection Framework (GUI version)
Everything about the SmolLM & SmolLM2 family of models
A course on aligning smol models.
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。
Streamlit — A faster way to build and share data apps.
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Hackable and optimized Transformers building blocks, supporting a composable construction.
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Accessible large language models via k-bit quantization for PyTorch.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
A professional cross-platform SSH/Sftp/Shell/Telnet/Tmux/Serial terminal.
😎 A curated list of awesome GitHub Profile which updates in real time
🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
🔥 Top-Rated Web-Based Linux Server Management Tool. 1Panel features an intuitive web interface that seamlessly integrates server management and monitoring, container management, database administra…