MrToser

Follow

Toser MrToser

Follow

0 followers · 12 following

Achievements

Achievements

Highlights

Pro

Lists (9)

Sort

LLM efficiency inference

11 repositories

LLM fine-tuning

LLM rope

Machine learning

11 repositories

multimodal

Negotiation

23 repositories

Paper daily

12 repositories

prompt

reinforcement-learning

Stars

thunlp / ProactiveAgent

A LLM-based Agent that predict its tasks proactively.

Python 275 26 Updated Jan 7, 2025

PKU-Alignment / ProAgent

ProAgent: Building Proactive Cooperative Agents with Large Language Models

JavaScript 67 8 Updated Apr 8, 2024

HarderThenHarder / transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,226 394 Updated Sep 29, 2023

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,742 836 Updated Aug 20, 2024

SmartFlowAI / EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 946 131 Updated Dec 20, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,171 1,559 Updated Feb 29, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

13,426 851 Updated Jan 6, 2025

Jiaxin-Pei / Prompting-with-Social-Roles

Jupyter Notebook 31 2 Updated Oct 14, 2024

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,067 87 Updated Jan 3, 2025

todexter3 / Richelieu

Dockerfile 7 1 Updated Oct 9, 2024

zishan-ai / neg

Python 3 Updated Jan 30, 2024

thu-nics / MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 110 6 Updated Dec 6, 2024

Tongji-KGLLM / RAG-Survey

1,915 126 Updated May 8, 2024

Zoeyyao27 / CoT-Igniting-Agent

This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

345 31 Updated Nov 25, 2023

NEU-DataMining / awesome-affective-computing

A comprehensive overview of affective computing research in the era of large language models (LLMs).

17 2 Updated Aug 7, 2024

Sahandfer / EMPaper

This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.

248 26 Updated Apr 17, 2024

nuochenpku / Awesome-Role-Play-Papers

Awesome papers for role-playing with language models

144 7 Updated Nov 3, 2024

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

2,021 110 Updated Nov 12, 2024

dqxiu / ICL_PaperList

Paper List for In-context Learning 🌷

828 59 Updated Oct 8, 2024

linxihui / dkernel

Python 17 2 Updated Dec 20, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,800 56 Updated Nov 30, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,643 346 Updated Jan 8, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,605 221 Updated Dec 5, 2024

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 435 37 Updated Jan 3, 2025

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,803 6,058 Updated Jul 13, 2023

tmgthb / Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

595 33 Updated Jan 9, 2025

chengtan9907 / ReviewMT

Python 13 1 Updated Oct 22, 2024

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 1,178 77 Updated Aug 1, 2024

S-Abdelnabi / LLM-Deliberation

Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation

Jupyter Notebook 24 3 Updated Nov 11, 2024

liucongg / NLPDataSet

记录本人整理的一些数据集

1,018 132 Updated Jun 16, 2022