YanzhaoShi

🎯

Focusing

Yanzhao Shi YanzhaoShi

🎯

Focusing

Beijing University of Technology

3 followers · 1 following

Beijing University of Technology

Highlights

Stars

jiahaoli57 / Call-for-Reviewers

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

686 16 Updated Jan 13, 2025

Arise-zwy / CIRI

5 Updated Aug 13, 2023

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,414 418 Updated Jan 12, 2025

Pengxin-Guo / HyperFL

A New Federated Learning Framework Against Gradient Inversion Attacks [AAAI 2025].

Python 9 Updated Dec 11, 2024

YanzhaoShi / MEPNet

1 Updated Dec 10, 2024

nachifur / RDDM

CVPR 2024: Residual Denoising Diffusion Models

Python 438 39 Updated Jan 11, 2025

dvlab-research / VisionZip

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 219 9 Updated Dec 28, 2024

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 873 31 Updated Jan 12, 2025

ByteFlow-AI / TokenFlow

🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 225 1 Updated Dec 28, 2024

CVMI-Lab / clip-beyond-tail

(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights

Jupyter Notebook 22 1 Updated Oct 28, 2024

SunzeY / X-Prompt

Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

145 3 Updated Dec 3, 2024

Chauncey-Jheng / PCRL-MRG

This is the official code for the paper "See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning" (EMNLP2024).

Python 4 Updated Dec 16, 2024

BAAI-DCAI / M3D

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Python 236 13 Updated Dec 22, 2024

ljy19970415 / AutoRG-Brain

The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".

Python 24 Updated Nov 18, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,119 2,325 Updated Aug 12, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 5,605 933 Updated Jan 15, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,993 3,407 Updated Jul 23, 2024

mit-han-lab / duo-attention

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 417 24 Updated Oct 31, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey

1,875 57 Updated Jan 14, 2025

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,498 1,233 Updated Jul 23, 2024

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 117,758 15,937 Updated Jan 14, 2025

Pengxin-Guo / FedSA-LoRA

Selective Aggregation for Low-Rank Adaptation in Federated Learning

Python 12 1 Updated Oct 2, 2024

amazon-science / mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,851 317 Updated Jun 12, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 15,933 2,321 Updated Jan 17, 2025

microsoft / LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,672 203 Updated Aug 13, 2024