Skip to content
View HaojunYu1998's full-sized avatar

Highlights

  • Pro

Block or report HaojunYu1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Witness the aha moment of VLM with less than $3.

Python 2,051 150 Updated Feb 11, 2025

veRL: Volcano Engine Reinforcement Learning for LLM

Python 2,842 238 Updated Feb 11, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,408 1,425 Updated Feb 9, 2025
Jupyter Notebook 3 Updated Jan 12, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 23,737 2,024 Updated Feb 10, 2025

Collection of awesome medical dataset resources.

566 46 Updated Jan 23, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,980 530 Updated Dec 25, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,984 498 Updated Feb 10, 2025

Segment Anything in Medical Images

Jupyter Notebook 3,236 448 Updated Oct 10, 2024

Grok open release

Python 49,901 8,335 Updated Aug 30, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,410 418 Updated Aug 7, 2024

BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks

Python 599 72 Updated Oct 25, 2024

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,510 75 Updated Sep 25, 2024

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

Python 251 17 Updated Dec 30, 2024
Python 6,261 1,839 Updated Feb 5, 2025

王孟源的博客镜像【可搜索】,每 6 小时更新

HTML 139 30 Updated Feb 11, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,529 429 Updated Jan 12, 2025

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 328 15 Updated Jan 14, 2025

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 537 39 Updated Jan 11, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,165 2,327 Updated Feb 10, 2025

Inference Llama 2 in one file of pure C

C 18,016 2,195 Updated Aug 6, 2024

Inference code for Llama models

Python 57,563 9,688 Updated Jan 26, 2025

Empowers LLMs with the ability to see and draw.

Jupyter Notebook 1 Updated Oct 8, 2023

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 39,183 5,750 Updated Feb 11, 2025

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 786 90 Updated Aug 14, 2024

活动资讯小帮手(

Python 37 1 Updated Jul 6, 2024

Official implementation for paper "Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos"

Python 25 4 Updated Nov 7, 2023
Next