Skip to content
View zhyj3038's full-sized avatar

Organizations

@360CVGroup

Block or report zhyj3038

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 223 16 Updated Feb 5, 2025
JavaScript 298 92 Updated Dec 20, 2024

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 99 6 Updated Oct 16, 2024

LMM which strictly superset LLM embedded

Python 38 4 Updated Nov 5, 2024

自用tvbox配置

294 56 Updated Feb 16, 2025

Referring Expression Datasets API

Jupyter Notebook 483 79 Updated Aug 27, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,865 318 Updated Jun 12, 2024

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 270 13 Updated Mar 13, 2024

🚀 Codebase and Fondation Models for Visual Instruction Tuning

Python 14 3 Updated Aug 19, 2023

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

Python 63 3 Updated Dec 9, 2023

Code Repository for the EACL 2023 paper "Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training"

7 Updated Feb 9, 2023

精益副业:程序员如何优雅地做副业

9,345 669 Updated Mar 28, 2024

Multimodal chatbot with computer vision capabilities integrated

Python 100 9 Updated May 17, 2024
Jupyter Notebook 1,152 547 Updated May 13, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,716 307 Updated Dec 12, 2023

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,148 420 Updated Nov 14, 2024

ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

2,022 201 Updated Aug 14, 2023

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,577 2,925 Updated Sep 2, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,899 347 Updated Aug 7, 2024

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

Python 256 10 Updated May 3, 2024

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,397 87 Updated May 31, 2023

An open-source framework for training large multimodal models.

Python 3,823 294 Updated Aug 31, 2024

由基于Stable-diffusion的Chilloutmix模型生成高清真实的人像

567 78 Updated Feb 27, 2023

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Jupyter Notebook 1,319 87 Updated Oct 18, 2022

Let us control diffusion models!

Python 31,489 2,820 Updated Feb 25, 2024

基于Python的开源量化交易平台开发框架

Python 27,344 9,119 Updated Feb 12, 2025

[TPAMI 2023] PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

Python 243 27 Updated Sep 29, 2024

pytorch implementation of scene change detection

Python 242 73 Updated Mar 5, 2023

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Jupyter Notebook 3,564 449 Updated Oct 25, 2023

Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)

Python 810 85 Updated Nov 22, 2022
Next