Skip to content
View Kaleido0's full-sized avatar

Block or report Kaleido0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 3,128 394 Updated Dec 13, 2024

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025)

Python 510 39 Updated Jul 3, 2024
Python 5,064 301 Updated Dec 27, 2024

Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previously: Do Role-Playing Chatbots Capture the Character Persona…

Python 61 5 Updated Oct 12, 2024

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

627 30 Updated Nov 22, 2024

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,873 170 Updated Aug 13, 2024

[NAACL Findings 2024] PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

Jupyter Notebook 41 6 Updated Sep 18, 2024

[EMNLP-2023] Official Codes for “Can ChatGPT Assess Human Personalities? A General Evaluation Framework”

Python 97 7 Updated Jan 15, 2024

Code and Data for the paper "Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works".

Python 14 Updated Jul 24, 2024

An open source implementation of Mamba 2 in one file of pytorch

Python 8 1 Updated Dec 19, 2024
Python 246 61 Updated Nov 18, 2024

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 257 19 Updated Apr 29, 2024

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

Python 784 112 Updated Jan 13, 2022

The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning

Python 38 Updated Jul 28, 2024

A PyTorch Library for Multi-Task Learning

Python 2,124 198 Updated Oct 18, 2024

A list of papers, codes and applications on multi-task learning.

59 11 Updated Oct 30, 2024

[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"

Python 70 9 Updated Jul 15, 2024

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Python 434 56 Updated Dec 16, 2024

PyTorch Extension Library of Optimized Scatter Operations

Python 1 Updated Aug 15, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,332 109 Updated Dec 26, 2024

Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This repository contains the code for the experiments in the paper.

Cuda 45 2 Updated Oct 31, 2024
Python 52 7 Updated Dec 13, 2024
Python 36 9 Updated Oct 17, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 247 17 Updated Mar 25, 2024

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Python 253 45 Updated Mar 5, 2023

Implementation of Google's USM speech model in Pytorch

Python 26 4 Updated Nov 11, 2024

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 758 41 Updated Dec 28, 2024

Implementation of the convolutional module from the Conformer paper, for use in Transformers

Python 375 54 Updated May 17, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 977 179 Updated Dec 22, 2023

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,657 111 Updated Dec 6, 2024
Next