Skip to content
View Gaotianhong's full-sized avatar

Highlights

  • Pro

Block or report Gaotianhong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,599 228 Updated Jan 3, 2025

High-quality datasets, tools, and concepts for LLM fine-tuning.

2,185 187 Updated Dec 26, 2024

An Open Large Reasoning Model for Real-World Solutions

Python 1,314 68 Updated Nov 28, 2024

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 275 15 Updated Sep 15, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,310 2,088 Updated Jan 3, 2025

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,625 572 Updated Dec 5, 2024

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 123 4 Updated Dec 17, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,349 847 Updated Jan 2, 2025

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 196 12 Updated Dec 10, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,541 436 Updated Dec 22, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,984 528 Updated Jan 2, 2025
Python 47 Updated Dec 13, 2024

A Self-Training Framework for Vision-Language Reasoning

Python 54 1 Updated Nov 13, 2024
HTML 73 7 Updated May 10, 2024

ACL 2024: LoRA-Flow Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Python 11 Updated Oct 9, 2024

WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge

Python 112 11 Updated Nov 11, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,519 201 Updated Dec 5, 2024

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 586 33 Updated Nov 26, 2024

精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总

3,224 492 Updated Apr 15, 2024
Python 173 Updated Sep 11, 2024

致力于实习/校招/社招进大厂打法,计算机基础知识学习,C++、Java、算法学习路线,专注于编程打法!

1,226 76 Updated Aug 15, 2021

[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models

Python 142 13 Updated Jan 1, 2025

CV Homework

Python 1 Updated Nov 22, 2023

MICCAI 2024 - Loose Lesion Location Self-supervision Enhanced Colorectal Cancer Diagnosis

Python 2 Updated Oct 10, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,184 147 Updated Sep 3, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,249 461 Updated Nov 6, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,753 480 Updated Aug 6, 2024

数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路

Jupyter Notebook 4,336 1,068 Updated Oct 8, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,223 1,640 Updated Sep 19, 2024

Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Python 4,700 362 Updated Jan 3, 2025
Next