Skip to content
View zhsh9's full-sized avatar

Highlights

  • Pro

Block or report zhsh9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Python 7,639 2,222 Updated Aug 9, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 3,790 347 Updated Dec 25, 2024

A simple Python program to implement the search-extract-summarize flow.

Python 228 31 Updated Dec 20, 2024

Large Language Model in Action

Python 133 29 Updated May 28, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 22,762 1,863 Updated Jan 12, 2025

Opensource IDE For Exploring and Testing Api's (lightweight alternative to postman/insomnia)

JavaScript 29,753 1,402 Updated Jan 15, 2025

macOS Integrated Injection Framework (GUI version)

Swift 1,231 77 Updated Oct 6, 2024

Community fork of PlayCover

Swift 8,966 780 Updated Jan 2, 2025

Everything about the SmolLM & SmolLM2 family of models

Python 1,552 81 Updated Jan 7, 2025

A course on aligning smol models.

Jupyter Notebook 4,499 1,474 Updated Jan 15, 2025

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 265 25 Updated Sep 3, 2024

AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。

Python 628 85 Updated Jan 14, 2025

Streamlit — A faster way to build and share data apps.

Python 36,715 3,162 Updated Jan 15, 2025

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,269 127 Updated Dec 17, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 11,934 1,730 Updated Jan 2, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,907 637 Updated Jan 15, 2025

科学上网🕸️之跑路机场名单收集(2020-2024),欢迎投稿。

2,638 48 Updated Jan 9, 2025

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Jupyter Notebook 18,542 5,414 Updated Oct 14, 2021

Accessible large language models via k-bit quantization for PyTorch.

Python 6,517 649 Updated Jan 14, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,242 4,192 Updated Jan 15, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,618 493 Updated Dec 15, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,340 2,640 Updated Dec 18, 2024

95.47% on CIFAR10 with PyTorch

Python 6,062 2,153 Updated Feb 24, 2023

A professional cross-platform SSH/Sftp/Shell/Telnet/Tmux/Serial terminal.

C 24,580 1,901 Updated Jan 12, 2025

😎 A curated list of awesome GitHub Profile which updates in real time

25,291 3,820 Updated Aug 19, 2024

🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!

Python 4,933 562 Updated Dec 13, 2024

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,375 689 Updated Jan 15, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,112 490 Updated May 3, 2024

🔥 Top-Rated Web-Based Linux Server Management Tool. 1Panel features an intuitive web interface that seamlessly integrates server management and monitoring, container management, database administra…

Go 24,962 2,238 Updated Jan 15, 2025

从零实现一个小参数量中文大语言模型。

Python 409 50 Updated Aug 22, 2024
Next