Skip to content
View YanzhaoShi's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing University of Technology

Highlights

  • Pro

Block or report YanzhaoShi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

686 16 Updated Jan 13, 2025
5 Updated Aug 13, 2023

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,414 418 Updated Jan 12, 2025

A New Federated Learning Framework Against Gradient Inversion Attacks [AAAI 2025].

Python 9 Updated Dec 11, 2024
1 Updated Dec 10, 2024

CVPR 2024: Residual Denoising Diffusion Models

Python 438 39 Updated Jan 11, 2025

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 219 9 Updated Dec 28, 2024

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 873 31 Updated Jan 12, 2025

🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 225 1 Updated Dec 28, 2024

(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights

Jupyter Notebook 22 1 Updated Oct 28, 2024

Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

145 3 Updated Dec 3, 2024

This is the official code for the paper "See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning" (EMNLP2024).

Python 4 Updated Dec 16, 2024

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Python 236 13 Updated Dec 22, 2024

The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".

Python 24 Updated Nov 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,119 2,325 Updated Aug 12, 2024

Utilities intended for use with Llama models.

Python 5,605 933 Updated Jan 15, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,993 3,407 Updated Jul 23, 2024

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 417 24 Updated Oct 31, 2024

O1 Replication Journey

1,875 57 Updated Jan 14, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,498 1,233 Updated Jul 23, 2024

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 117,758 15,937 Updated Jan 14, 2025

Selective Aggregation for Low-Rank Adaptation in Federated Learning

Python 12 1 Updated Oct 2, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,851 317 Updated Jun 12, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 15,933 2,321 Updated Jan 17, 2025

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,672 203 Updated Aug 13, 2024

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Python 1,456 420 Updated Oct 5, 2023
Python 3 Updated Nov 1, 2023

The official code for "SegVol: Universal and Interactive Volumetric Medical Image Segmentation".

Python 282 25 Updated Oct 23, 2024

The original code for paper "Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation"

Python 12 1 Updated Oct 30, 2024
Next