Skip to content
View Unakar's full-sized avatar

Highlights

  • Pro

Block or report Unakar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2,273 108 Updated Dec 27, 2024

Training Sparse Autoencoders on Language Models

Jupyter Notebook 543 132 Updated Dec 15, 2024
Python 3 1 Updated Dec 14, 2024

LCA-on-the-line (ICML 2024 Oral)

Python 11 Updated Sep 26, 2024

[NeurIPS 2024] How do Large Language Models Handle Multilingualism?

Python 16 Updated Nov 8, 2024

Official Implementation of weights2weights

Jupyter Notebook 125 4 Updated Dec 16, 2024

This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."

Python 224 12 Updated Aug 21, 2023

Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"

Jupyter Notebook 29 5 Updated Oct 17, 2023
HTML 2 2 Updated Dec 24, 2024
Python 483 32 Updated Jul 29, 2024

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 284 22 Updated Nov 26, 2024

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

213 7 Updated Nov 19, 2024

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,079 96 Updated Dec 17, 2024

具身智能入门指南

793 38 Updated Dec 27, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 19,716 1,478 Updated Dec 27, 2024

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,183 1,178 Updated May 28, 2023

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Python 398 15 Updated Nov 19, 2024

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 97 4 Updated Dec 24, 2024

visualizing attention for LLM users

Python 182 8 Updated Dec 14, 2024
Python 389 39 Updated Jul 19, 2024

Visualizing the attention of vision-language models

Jupyter Notebook 91 6 Updated Oct 26, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,795 419 Updated Dec 27, 2024

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 316 12 Updated Aug 12, 2024

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 173 37 Updated Dec 16, 2024
Python 5 Updated Dec 23, 2024

Multi-Layer Sparse Autoencoders

Python 12 Updated Dec 20, 2024

Experiments in transformer knowledge and reasoning

Jupyter Notebook 5 Updated Dec 21, 2024

Sparse autoencoders

Python 387 51 Updated Dec 18, 2024
Next