Skip to content
View Cartus's full-sized avatar
🍭
Focusing
🍭
Focusing

Highlights

  • Pro

Block or report Cartus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

660 34 Updated Apr 22, 2025

MMR1: Advancing the Frontiers of Multimodal Reasoning

155 5 Updated Mar 17, 2025

A series of technical report on Slow Thinking with LLM

Python 647 35 Updated Apr 13, 2025
Python 10 Updated Nov 20, 2024

Efficient triton implementation of Native Sparse Attention.

Python 138 5 Updated Apr 10, 2025

CiteCheck: Towards Accurate Citation Faithfulness Detection

Python 1 Updated Feb 18, 2025

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

Python 107 3 Updated Apr 22, 2025

Official Repo for Open-Reasoner-Zero

Python 1,883 97 Updated Apr 8, 2025

Latest Advances on System-2 Reasoning

Python 954 44 Updated Apr 23, 2025

nsfc - 国家自然科学基金项目LaTeX模版(面青地)

TeX 472 129 Updated Mar 15, 2025

Code for EMNLP 2024 paper "DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering"

Python 4 Updated Nov 28, 2024
2 Updated Nov 1, 2024
Python 14 Updated Nov 16, 2024

This is the official implementation of the paper: SwiftCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Python 11 1 Updated Apr 1, 2025

The source code of paper "Read it Twice: Towards Faithfully Interpretable Fact Verification by Revisiting Evidence" at SIGIR 2023.

Python 7 1 Updated Jun 8, 2023

PyTorch implementation for our proposed CFIE in EMNLP 2021 paper "Uncovering Main Causalities for Long-tailed Information Extraction".

Python 27 2 Updated Jan 5, 2022
Python 44 Updated Oct 28, 2024

An index of algorithms for reinforcement learning from human feedback (rlhf))

93 3 Updated Apr 17, 2024

[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Python 189 12 Updated Dec 3, 2024

OptiBench and ReSocratic Synthesis Method

Python 18 Updated Mar 25, 2025

The official GitHub repo for the paper "DebateQA: Evaluating Question Answering on Debatable Knowledge"

Python 9 Updated Mar 1, 2025

This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.

Jupyter Notebook 60 10 Updated Oct 9, 2024

This the implementation of LeCo

Python 32 1 Updated Jan 20, 2025
Python 28 Updated Jan 10, 2025
11 Updated Jun 5, 2024

This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"

Python 48 1 Updated Oct 31, 2024

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

112 5 Updated Sep 21, 2024
Next