Unakar

Follow

Xie Tian Unakar

Follow

LLM Reasoning (Interpretability & RLHF)

34 followers · 76 following

Shanghai AI Lab intern @open-mmlab @InternLM
Beijing, China

Achievements

Achievements

Highlights

Pro

Lists (2)

Sort

Computer Vision

Large Language Models

17 repositories

Stars

deepseek-ai / DeepSeek-V3

Python 2,273 108 Updated Dec 27, 2024

jbloomAus / SAELens

Training Sparse Autoencoders on Language Models

Jupyter Notebook 543 132 Updated Dec 15, 2024

RUCAIBox / Language-Specific-Neurons

Python 56 7 Updated Dec 23, 2024

Trustworthy-ML-Lab / CB-LLMs

Python 3 1 Updated Dec 14, 2024

ElvishElvis / LCA-on-the-line

LCA-on-the-line (ICML 2024 Oral)

Python 11 Updated Sep 26, 2024

DAMO-NLP-SG / multilingual_analysis

[NeurIPS 2024] How do Large Language Models Handle Multilingualism?

Python 16 Updated Nov 8, 2024

snap-research / weights2weights

Official Implementation of weights2weights

Jupyter Notebook 125 4 Updated Dec 16, 2024

tsb0601 / EMP-SSL

This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."

Python 224 12 Updated Aug 21, 2023

yossigandelsman / rosetta_neurons

Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"

Jupyter Notebook 29 5 Updated Oct 17, 2023

tsb0601 / tsb0601.github.io

Forked from jonbarron/website

HTML 2 2 Updated Dec 24, 2024

minyoungg / platonic-rep

Python 483 32 Updated Jul 29, 2024

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 284 22 Updated Nov 26, 2024

OS-Copilot / OS-Atlas

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

213 7 Updated Nov 19, 2024

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,079 96 Updated Dec 17, 2024

TianxingChen / Embodied-AI-Guide

具身智能入门指南

793 38 Updated Dec 27, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 19,716 1,478 Updated Dec 27, 2024

zihangdai / xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,183 1,178 Updated May 28, 2023

lucidrains / recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Python 398 15 Updated Nov 19, 2024

lucidrains / coconut-pytorch

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 97 4 Updated Dec 24, 2024

mattneary / attention

visualizing attention for LLM users

Python 182 8 Updated Dec 14, 2024

openai / sparse_autoencoder

Python 389 39 Updated Jul 19, 2024

zjysteven / VLM-Visualizer

Visualizing the attention of vision-language models

Jupyter Notebook 91 6 Updated Oct 26, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,795 419 Updated Dec 27, 2024

pkunlp-icler / FastV

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 316 12 Updated Aug 12, 2024

callummcdougall / ARENA_3.0

HTML 387 235 Updated Dec 18, 2024

callummcdougall / sae_vis

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 173 37 Updated Dec 16, 2024

revelio-diffusion / revelio

Python 5 Updated Dec 23, 2024

tim-lawson / mlsae

Multi-Layer Sparse Autoencoders

Python 12 Updated Dec 20, 2024

EleutherAI / transformer-reasoning

Forked from OSU-NLP-Group/GrokkedTransformer

Experiments in transformer knowledge and reasoning

Jupyter Notebook 5 Updated Dec 21, 2024

EleutherAI / sae

Sparse autoencoders

Python 387 51 Updated Dec 18, 2024