Skip to content
View hybug's full-sized avatar

Block or report hybug

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,506 5,955 Updated Aug 24, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,406 2,355 Updated Aug 12, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,748 166 Updated Jan 22, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,823 5,762 Updated Sep 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,350 4,943 Updated Feb 13, 2025

交易模块

Python 5,431 1,199 Updated May 13, 2024
C++ 347 33 Updated Feb 13, 2025

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 264 16 Updated Feb 7, 2024

Chinese version implementation of Generative Agents: Interactive Simulacra of Human Behavior

Python 73 7 Updated Sep 7, 2023

Generative Agents: Interactive Simulacra of Human Behavior

18,344 2,421 Updated Aug 5, 2024

A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.

Python 99 13 Updated Jan 16, 2024

Learning-based agent for Google Research Football (足球游戏智能体)

Python 111 20 Updated Apr 20, 2023

A dystopia simulator powered by ChatGPT

JavaScript 59 14 Updated Jun 18, 2024

[Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2Level Generation through Large Language Models" https://arxiv.org/abs/2302.05981

Python 1,118 103 Updated Jul 22, 2024

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,448 703 Updated Jan 28, 2025

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,224 3,257 Updated Aug 17, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,580 478 Updated Jan 8, 2024

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 3,008 680 Updated Oct 30, 2023

Python Fan calculator for Chinese Standard Mahjong

C++ 17 9 Updated Jan 26, 2025

Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states

JavaScript 80 9 Updated Aug 24, 2022

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning

Python 721 131 Updated Jan 12, 2025

Java and Python protobuf rpc implementation using tcp/ip sockets.

Java 59 31 Updated Apr 30, 2015

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,808 253 Updated Jun 12, 2023

EVA: Large-scale Pre-trained Chit-Chat Models

Python 307 50 Updated Mar 11, 2023

Massively Parallel Deep Reinforcement Learning. 🔥

Python 3,853 860 Updated Jan 13, 2025

Chinese Transformer Generative Pre-Training Model

Jupyter Notebook 59 10 Updated Oct 31, 2019

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,511 1,707 Updated Apr 25, 2024

Repo for external large-scale work

Python 6,516 729 Updated Apr 27, 2024

Proximal Policy Optimization with Tensorflow 2.0

Python 30 7 Updated Oct 14, 2019
Next