zhyj3038

zhang ya jun zhyj3038

17 followers · 78 following

JDJR
beijing

Organizations

Stars

Alibaba-NLP / OmniSearch

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 223 16 Updated Feb 5, 2025

PizazzGY / TVBox_warehouse

JavaScript 298 92 Updated Dec 20, 2024

NiuTrans / Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 99 6 Updated Oct 16, 2024

360CVGroup / Inner-Adaptor-Architecture

LMM which strictly superset LLM embedded

Python 38 4 Updated Nov 5, 2024

anaer / Meow

自用tvbox配置

294 56 Updated Feb 16, 2025

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 483 79 Updated Aug 27, 2024

amazon-science / mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,865 318 Updated Jun 12, 2024

FuxiaoLiu / LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 270 13 Updated Mar 13, 2024

ChenDelong1999 / instruct-flamingo

🚀 Codebase and Fondation Models for Visual Instruction Tuning

Python 14 3 Updated Aug 19, 2023

ChenDelong1999 / polite-flamingo

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

Python 63 3 Updated Dec 9, 2023

wenliangdai / VLP-Object-Hallucination

Code Repository for the EACL 2023 paper "Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training"

7 Updated Feb 9, 2023

easychen / lean-side-bussiness

精益副业：程序员如何优雅地做副业

9,345 669 Updated Mar 28, 2024

360CVGroup / SEEChat

Multimodal chatbot with computer vision capabilities integrated

Python 100 9 Updated May 17, 2024

tylin / coco-caption

Jupyter Notebook 1,152 547 Updated May 13, 2024

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,716 307 Updated Dec 12, 2023

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

C 4,148 420 Updated Nov 14, 2024

chenking2020 / FindTheChatGPTer

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

2,022 201 Updated Aug 14, 2023

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,577 2,925 Updated Sep 2, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,899 347 Updated Aug 7, 2024

amazon-science / prompt-pretraining

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

Python 256 10 Updated May 3, 2024

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,397 87 Updated May 31, 2023

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,823 294 Updated Aug 31, 2024

KKGo1999 / Stable-diffusion-person

由基于Stable-diffusion的Chilloutmix模型生成高清真实的人像

567 78 Updated Feb 27, 2023

bloc97 / CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Jupyter Notebook 1,319 87 Updated Oct 18, 2022

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,489 2,820 Updated Feb 25, 2024

vnpy / vnpy

基于Python的开源量化交易平台开发框架

Python 27,344 9,119 Updated Feb 12, 2025

HongwenZhang / PyMAF-X

[TPAMI 2023] PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

Python 243 27 Updated Sep 29, 2024

gmayday1997 / SceneChangeDet

pytorch implementation of scene change detection

Python 242 73 Updated Mar 5, 2023

williamyang1991 / VToonify

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Jupyter Notebook 3,564 449 Updated Oct 25, 2023

Visual-Attention-Network / SegNeXt

Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)

Python 810 85 Updated Nov 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly