Skip to content
View blameitonme1's full-sized avatar
🤪
I'm all over the place
🤪
I'm all over the place
  • Sichuan University
  • China
  • 07:27 (UTC -12:00)

Highlights

  • Pro

Block or report blameitonme1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,644 95 Updated Mar 7, 2025

Flash-Linear-Attention models beyond language

Python 7 Updated Mar 12, 2025

A collection of resources and papers on Diffusion Models

HTML 11,514 967 Updated Aug 1, 2024

把萌萌哒的看板娘抱回家 (ノ≧∇≦)ノ | Live2D widget for web platform

JavaScript 9,392 2,462 Updated Dec 28, 2024

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Python 5,000 1,103 Updated Jan 15, 2024

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 204 16 Updated Mar 3, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,806 123 Updated Dec 26, 2024

Code for paper [Neat: Nonlinear Parameter-efficient Adaptation of Pre-trained Models]

Python 5 1 Updated Dec 27, 2024

Awesome Knowledge Distillation

3,606 507 Updated Mar 11, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,085 128 Updated Mar 12, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,222 918 Updated Mar 5, 2025

Code for paper [Low-Rank Interconnected Adaptation across Layers]

Python 7 Updated Nov 27, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,363 330 Updated Mar 9, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,827 3,483 Updated Jul 23, 2024

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2024)

Jupyter Notebook 476 160 Updated Dec 27, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,866 417 Updated Mar 5, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,775 2,392 Updated Aug 12, 2024

Collection of AWESOME vision-language models for vision tasks

2,563 200 Updated Dec 3, 2024

LLM inference in C/C++

C++ 76,368 11,051 Updated Mar 12, 2025
Python 97 10 Updated Jul 6, 2024

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,440 124 Updated Feb 6, 2025

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,151 102 Updated Dec 4, 2024

A Survey on multimodal learning research.

320 22 Updated Aug 22, 2023

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,689 823 Updated Sep 1, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,925 5,108 Updated Jan 22, 2025

Building blocks for foundation models.

460 21 Updated Jan 3, 2024

Fast and memory-efficient exact attention

Python 16,251 1,542 Updated Mar 12, 2025

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,137 111 Updated Mar 10, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 729 50 Updated Oct 1, 2024
Next