-
HUST
- Wuhan, Hubei, China
- https://whonamedcody.github.io
Starred repositories
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"
The download link for the dataset LAD.
Awesome artificial intelligence (AI) and large language model (LLM) for education papers.
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
This repository is maintained to release dataset and models for multimodal puzzle reasoning.
A Framework of Small-scale Large Multimodal Models
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
971946256 / Enterprise-Registration-Data-of-Chinese-Mainland
Forked from kinginsun/Enterprise-Registration-Data-of-Chinese-MainlandThis is the offical repository for "DetFusion: A Detection-driven Infrared and Visible Image Fusion Network" (ACM MM 2022).
This is the official repository for “Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning” (IEEE T-CSVT 2022).
All deep learning-based infrared and visible image fusion algorithms in a whole framework
RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network
✨✨Latest Advances on Multimodal Large Language Models
LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).
Detect AI-generated text [relatively] quickly via compression ratios
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current issues and future directions.
[CVPR 2023] DynamicDet: A Unified Dynamic Architecture for Object Detection
FocalNet / FocalNet-DINO
Forked from IDEA-Research/DINOThis repo contains the code and configuration files for reproducing object detection results of FocalNets with DINO
A General NeRF Acceleration Toolbox in PyTorch.
OpenXRLab Neural Radiance Field (NeRF) Toolbox and Benchmark
WebUI extension for ControlNet
Basic data mining model, including feature importance display
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.