ohsuz

Suzie Oh ohsuz

ʕ•̫͡•ʔ-̫͡-ʕ•͓͡•ʔ-̫͡-ʕ•̫͡•ʔ-̫͡-ʕ•͓͡•ʔ-̫͡-ʔ

146 followers · 145 following

Achievements

Organizations

Lists (7)

Sort

Stars

qodo-ai / pr-agent

🚀 PR-Agent (Qodo Merge open-source): An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Python 7,004 719 Updated Feb 28, 2025

qodo-ai / qodo-cover

Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Python 4,854 394 Updated Feb 26, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 207 12 Updated Feb 24, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 2,879 271 Updated Feb 28, 2025

public-apis / public-apis

A collective list of free APIs

Python 328,688 34,854 Updated Oct 31, 2024

Zhou-Zoey / RMB-Reward-Model-Benchmark

Python 23 1 Updated Feb 14, 2025

THU-KEG / RM-Bench

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 19 Updated Feb 14, 2025

KwaiKEG / KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Python 1,135 114 Updated Jun 19, 2024

Kiln-AI / Kiln

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 2,967 195 Updated Feb 28, 2025

ThomasRochefortB / open-agentinstruct

An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation

Python 14 Updated Jan 13, 2025

DSXiangLi / DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

2,890 290 Updated Feb 28, 2025

itsual / Notable-LLM-Research-Papers

Curated list of research papers published in 2024 related to Large Language Models (LLM)

3 1 Updated Jan 25, 2025

pengr / LLM-Synthetic-Data

Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥

226 20 Updated Jan 24, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,173 69 Updated Feb 20, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,003 223 Updated Feb 19, 2025

bespokelabsai / curator

Synthetic data curation for post-training and structured data extraction

Python 886 63 Updated Feb 28, 2025

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 488 47 Updated Dec 28, 2024

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 5,182 344 Updated Feb 28, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,764 655 Updated Feb 23, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,169 50 Updated Nov 16, 2024

HAE-RAE / haerae-evaluation-toolkit

The most modern LLM evaluation toolkit

Python 16 4 Updated Feb 28, 2025

TIGER-AI-Lab / MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 135 9 Updated Oct 27, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,767 1,929 Updated Feb 28, 2025

minseye / korean-english-context-aware-translation-dataset

Dataset for the COLING 2025 accepted paper: "A Testset for Context-Aware LLM Translation in Korean-to-English Discourse Level Translation." This dataset features 600 instances covering six linguist…

1 Updated Jan 1, 2025

minghao-wu / transagents

The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts"

547 25 Updated Jul 5, 2024