Skip to content
View Zhou-Hangyu's full-sized avatar

Highlights

  • Pro

Block or report Zhou-Hangyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in Python
Clear filter

Inference code for Llama models

Python 57,098 9,642 Updated Aug 18, 2024

Let us control diffusion models!

Python 31,119 2,786 Updated Feb 25, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,854 1,983 Updated Sep 26, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,732 373 Updated Jul 11, 2024

Tools for merging pretrained large language models.

Python 5,043 467 Updated Dec 15, 2024

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,953 428 Updated Jan 5, 2025

Fuzzy String Matching in Python

Python 2,990 141 Updated Feb 27, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,636 360 Updated Oct 17, 2024

Codebase for Merging Language Models (ICML 2024)

Python 789 46 Updated May 5, 2024

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 719 53 Updated Feb 1, 2024

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 638 35 Updated May 14, 2024

A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training

Python 291 25 Updated Jan 18, 2024

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 177 14 Updated Oct 3, 2024

Official code repository for NeurIPS 2022 paper "SatMAE: Pretraining Transformers for Temporal and Multi-Spectral Satellite Imagery"

Python 175 20 Updated Sep 14, 2024

The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)

Python 157 14 Updated Jan 6, 2023

Official code repository for ICLR 2024 paper "DiffusionSat: A Generative Foundation Model for Satellite Imagery"

Python 134 5 Updated Sep 14, 2024

FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion

Python 100 10 Updated Jan 6, 2025
Python 78 12 Updated Jan 23, 2024

AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.

Python 58 3 Updated Oct 28, 2024
Python 55 3 Updated Dec 26, 2023
Python 50 12 Updated Feb 26, 2024
Python 45 8 Updated Oct 27, 2019
Python 40 6 Updated Jul 27, 2020

Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]

Python 36 Updated Oct 25, 2024
Python 33 5 Updated Apr 19, 2024

Official Code of IdealGPT

Python 32 8 Updated Oct 13, 2023

ESPER

Python 22 2 Updated Mar 29, 2024
Python 19 1 Updated Dec 22, 2024