-
Yonsei University
-
06:35
- 9h ahead
Stars
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
Awesome LLMs on Device: A Comprehensive Survey
React Native On-Device Machine Learning w/ Google ML Kit
Search-o1: Agentic Search-Enhanced Large Reasoning Models
PyTorch implementation of soft actor critic
A framework for few-shot evaluation of language models.
All Algorithms implemented in Python
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
A bibliography and survey of the papers surrounding o1
[ACL 2024] RDRec: Rationale Distillation for LLM-based Recommendation
Repository for the paper Stream of Search: Learning to Search in Language
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Curated papers on Large Language Models in Healthcare and Medical domain
Official implementation of Learning to Correct for QA Reasoning with Black-box LLMs (CoBB)
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
800,000 step-level correctness labels on LLM solutions to MATH problems
Meditron is a suite of open-source medical Large Language Models (LLMs).
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)