talzoomanzoo

🏠

Working from home

minju gwak talzoomanzoo

🏠

Working from home

3 followers · 4 following

Yonsei University
06:35 - 9h ahead

Achievements

Stars

facebookresearch / diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,330 158 Updated Apr 3, 2023

WindyLab / LLM-RL-Papers

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

353 17 Updated Sep 12, 2024

NexaAI / Awesome-LLMs-on-device

Awesome LLMs on Device: A Comprehensive Survey

1,051 101 Updated Jan 12, 2025

a7medev / react-native-ml-kit

React Native On-Device Machine Learning w/ Google ML Kit

Java 474 65 Updated Jul 14, 2024

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 760 82 Updated Apr 1, 2025

pranz24 / pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Python 867 183 Updated Nov 9, 2021

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 8,503 2,267 Updated Apr 3, 2025

TheAlgorithms / Python

All Algorithms implemented in Python

Python 199,062 46,527 Updated Apr 2, 2025

AI-in-Health / MedLLMsPracticalGuide

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,482 133 Updated Mar 10, 2025

hyp1231 / awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,945 154 Updated Mar 26, 2025

zankner / CLoud

Critique-out-Loud Reward Models

Python 56 4 Updated Oct 18, 2024

bytarnish / AGILE

Python 120 6 Updated Jan 21, 2025

SparkJiao / dpo-trajectory-reasoning

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Python 74 2 Updated Jan 14, 2025

thu-wyz / inference_scaling

Python 62 3 Updated Nov 19, 2024

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,543 284 Updated Mar 25, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,185 50 Updated Nov 16, 2024

WangXFng / RDRec

[ACL 2024] RDRec: Rationale Distillation for LLM-based Recommendation

Python 34 4 Updated Jan 9, 2025

kanishkg / stream-of-search

Repository for the paper Stream of Search: Learning to Search in Language

Python 142 22 Updated Feb 3, 2025

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,198 495 Updated Jan 16, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,912 162 Updated Mar 19, 2025

madaan / self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 674 57 Updated Oct 4, 2024

burglarhobbit / Awesome-Medical-Large-Language-Models

Curated papers on Large Language Models in Healthcare and Medical domain

293 33 Updated Jan 13, 2025

gersteinlab / MedAgents

Python 241 36 Updated May 27, 2024

zhentingqi / rStar

Python 914 104 Updated Jan 23, 2025

bbuing9 / CoBB

Official implementation of Learning to Correct for QA Reasoning with Black-box LLMs (CoBB)

4 1 Updated Jun 26, 2024

YangLing0818 / SuperCorrect-llm

[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction

Python 66 4 Updated Mar 23, 2025

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,963 116 Updated Jun 1, 2023

epfLLM / meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,999 191 Updated Apr 10, 2024

jxzhangjhu / Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,157 75 Updated Feb 24, 2025

xiaowu0162 / LongMemEval

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 59 3 Updated Feb 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minju gwak talzoomanzoo

Achievements

Achievements

Block or report talzoomanzoo

Stars

facebookresearch / diplomacy_cicero

WindyLab / LLM-RL-Papers

NexaAI / Awesome-LLMs-on-device

a7medev / react-native-ml-kit

sunnynexus / Search-o1

pranz24 / pytorch-soft-actor-critic

EleutherAI / lm-evaluation-harness

TheAlgorithms / Python

AI-in-Health / MedLLMsPracticalGuide

hyp1231 / awesome-llm-powered-agent

zankner / CLoud

bytarnish / AGILE

SparkJiao / dpo-trajectory-reasoning

thu-wyz / inference_scaling

GT-RIPL / Awesome-LLM-Robotics

srush / awesome-o1

WangXFng / RDRec

kanishkg / stream-of-search

princeton-nlp / tree-of-thought-llm

atfortes / Awesome-LLM-Reasoning

madaan / self-refine

burglarhobbit / Awesome-Medical-Large-Language-Models

gersteinlab / MedAgents

zhentingqi / rStar

bbuing9 / CoBB

YangLing0818 / SuperCorrect-llm

openai / prm800k

epfLLM / meditron

jxzhangjhu / Awesome-LLM-RAG

xiaowu0162 / LongMemEval