Skip to content
View talzoomanzoo's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Yonsei University
  • 06:35 - 9h ahead

Block or report talzoomanzoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,330 158 Updated Apr 3, 2023

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

353 17 Updated Sep 12, 2024

Awesome LLMs on Device: A Comprehensive Survey

1,051 101 Updated Jan 12, 2025

React Native On-Device Machine Learning w/ Google ML Kit

Java 474 65 Updated Jul 14, 2024

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 760 82 Updated Apr 1, 2025

PyTorch implementation of soft actor critic

Python 867 183 Updated Nov 9, 2021

A framework for few-shot evaluation of language models.

Python 8,503 2,267 Updated Apr 3, 2025

All Algorithms implemented in Python

Python 199,062 46,527 Updated Apr 2, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,482 133 Updated Mar 10, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,945 154 Updated Mar 26, 2025

Critique-out-Loud Reward Models

Python 56 4 Updated Oct 18, 2024
Python 120 6 Updated Jan 21, 2025

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Python 74 2 Updated Jan 14, 2025
Python 62 3 Updated Nov 19, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,543 284 Updated Mar 25, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,185 50 Updated Nov 16, 2024

[ACL 2024] RDRec: Rationale Distillation for LLM-based Recommendation

Python 34 4 Updated Jan 9, 2025

Repository for the paper Stream of Search: Learning to Search in Language

Python 142 22 Updated Feb 3, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,198 495 Updated Jan 16, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,912 162 Updated Mar 19, 2025

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 674 57 Updated Oct 4, 2024

Curated papers on Large Language Models in Healthcare and Medical domain

293 33 Updated Jan 13, 2025
Python 241 36 Updated May 27, 2024
Python 914 104 Updated Jan 23, 2025

Official implementation of Learning to Correct for QA Reasoning with Black-box LLMs (CoBB)

4 1 Updated Jun 26, 2024

[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction

Python 66 4 Updated Mar 23, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,963 116 Updated Jun 1, 2023

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,999 191 Updated Apr 10, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,157 75 Updated Feb 24, 2025

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 59 3 Updated Feb 22, 2025
Next
Showing results