Hi, I'm Jiasheng Gu, an algorithm engineer in Alibaba.
I am interested in Large Language Models and Vision-Language Models.
languages and tools:
📈 my github stats
Hi, I'm Jiasheng Gu, an algorithm engineer in Alibaba.
I am interested in Large Language Models and Vision-Language Models.
languages and tools:
📈 my github stats
Forked from a-r-r-o-w/finetrainers
Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed
Python 2
Forked from yizhongw/Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
Python 1
Forked from noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Jupyter Notebook 1
Few-shot image classification based on CADA-VAE, using cosine similarity to align two modal features.
Python