Skip to content
View zxlzr's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@spatio-temporal-cloud

Block or report zxlzr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,994 592 Updated Mar 27, 2025

Circuit-Aware Editing Enables Generalizable Knowledge Learners

Python 7 1 Updated Feb 19, 2025
1 Updated Jan 29, 2025
Python 82 6 Updated Dec 30, 2024

OneKE is a knowledge extraction framework based on a large model, with preliminary generalized knowledge extraction capabilities in both Chinese and English and in multiple fields and tasks.

HTML 13 2 Updated Mar 18, 2025

LookAhead Tuning: Safer Language Models via Partial Answer Previews

Python 11 Updated Mar 26, 2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Python 436 57 Updated Mar 24, 2025

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Jupyter Notebook 9 Updated Mar 25, 2025

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 46 3 Updated Dec 10, 2024

[ICLR 2025] Benchmarking Agentic Workflow Generation

Python 66 3 Updated Feb 19, 2025

A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following

Jupyter Notebook 26 3 Updated Jan 15, 2025

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

Jupyter Notebook 46 3 Updated Feb 19, 2025

[WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.

HTML 60 6 Updated Mar 23, 2025

[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.

Python 147 15 Updated Nov 13, 2024

Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning

Python 4 Updated Aug 18, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 734 43 Updated Aug 5, 2024
Python 51 4 Updated Oct 30, 2024

[TrustNLP@NAACL 2025] BiasEdit: Debiasing Stereotyped Language Models via Model Editing

Python 11 2 Updated Mar 3, 2025

[EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Python 40 1 Updated Jan 23, 2025

Exploring Model Kinship for Merging Large Language Models

Jupyter Notebook 23 2 Updated Mar 28, 2025

[NeurIPS 2024] Agent Planning with World Knowledge Model

Python 121 10 Updated Dec 17, 2024

Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1

Python 12 Updated Mar 27, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

4,712 435 Updated Sep 20, 2024

[ACL 2024] OceanGPT: A Large Language Model for Ocean Science Tasks

Python 41 4 Updated Mar 23, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 130 6 Updated Sep 20, 2024

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …

Jupyter Notebook 109 31 Updated Apr 27, 2024

[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing

Python 35 3 Updated Aug 19, 2024

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

Python 30 2 Updated Feb 25, 2025

[NLPCC 2024] Shared Task 10: Regulating Large Language Models

13 3 Updated Jun 12, 2024
Next