Skip to content
View MrToser's full-sized avatar

Highlights

  • Pro

Block or report MrToser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A LLM-based Agent that predict its tasks proactively.

Python 275 26 Updated Jan 7, 2025

ProAgent: Building Proactive Cooperative Agents with Large Language Models

JavaScript 67 8 Updated Apr 8, 2024

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,226 394 Updated Sep 29, 2023

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,742 836 Updated Aug 20, 2024

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 946 131 Updated Dec 20, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,171 1,559 Updated Feb 29, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,426 851 Updated Jan 6, 2025
Jupyter Notebook 31 2 Updated Oct 14, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

1,067 87 Updated Jan 3, 2025
Dockerfile 7 1 Updated Oct 9, 2024
Python 3 Updated Jan 30, 2024

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 110 6 Updated Dec 6, 2024

This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

345 31 Updated Nov 25, 2023

A comprehensive overview of affective computing research in the era of large language models (LLMs).

17 2 Updated Aug 7, 2024

This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.

248 26 Updated Apr 17, 2024

Awesome papers for role-playing with language models

144 7 Updated Nov 3, 2024

Must-read Papers on LLM Agents.

2,021 110 Updated Nov 12, 2024

Paper List for In-context Learning 🌷

828 59 Updated Oct 8, 2024
Python 17 2 Updated Dec 20, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,800 56 Updated Nov 30, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,643 346 Updated Jan 8, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,605 221 Updated Dec 5, 2024

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 435 37 Updated Jan 3, 2025

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,803 6,058 Updated Jul 13, 2023

Autonomous Agents (LLMs) research papers. Updated Daily.

595 33 Updated Jan 9, 2025
Python 13 1 Updated Oct 22, 2024

A repo lists papers related to LLM based agent

Python 1,178 77 Updated Aug 1, 2024

Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation

Jupyter Notebook 24 3 Updated Nov 11, 2024

记录本人整理的一些数据集

1,018 132 Updated Jun 16, 2022
Next