Skip to content
View LaunchGemini's full-sized avatar
🎯
Focusing
🎯
Focusing
  • CUHK
  • Hong Kong
  • 15:35 (UTC +08:00)

Block or report LaunchGemini

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The next generation deep reinforcement learning tookit

Python 3,418 903 Updated Jun 16, 2023

Bounty Board is a decentralized platform designed to streamline Web3 community activities.

TypeScript 905 8 Updated Jan 17, 2025

AML end to end system

Java 752 106 Updated Dec 7, 2024

Secure storage system based on cryptography|基于密码学的安全存储系统

Java 634 92 Updated Jan 6, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,773 404 Updated Jul 30, 2024

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Python 2,753 397 Updated Aug 16, 2024

Text to image synthesis using thought vectors

Python 2,163 400 Updated Jan 30, 2018

[CVPR 2018] Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

C++ 1,078 300 Updated May 11, 2023

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,427 218 Updated Dec 9, 2024

Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022

Python 538 62 Updated Sep 22, 2022

Major Project - Sign Language To Text Conversion Using Python, Computer Vision and Machine Learning

Python 267 111 Updated Sep 11, 2024

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,006 73 Updated Jul 23, 2024

AwesomeBump is a free program written using Qt library designed to generate normal, height, specular or ambient occlusion textures from a single image. Since the image processing is done in 99% on …

C++ 1,680 178 Updated Jan 11, 2023

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Python 991 75 Updated Nov 2, 2023

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 461 64 Updated Jan 8, 2025

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Python 2,027 160 Updated Aug 20, 2022

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 289 14 Updated Jan 23, 2025

Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.

Python 1,192 157 Updated Sep 14, 2021

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Python 3,448 684 Updated Aug 30, 2024

The NASA Vision Workbench is a general purpose image processing and computer vision library developed by the Autonomous Systems and Robotics (ASR) Area in the Intelligent Systems Division at the NA…

C++ 473 229 Updated Jan 27, 2025

This repository contains the source code of our work on designing efficient CNNs for computer vision

Python 413 82 Updated Jul 15, 2024

open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.

C++ 5,835 1,680 Updated Jan 17, 2025

Build fully-functioning computer vision models with PyTorch

Python 616 103 Updated Jul 25, 2024

Repository for PyImageSearch Crash Course on Computer Vision and Deep Learning

Python 309 196 Updated Dec 8, 2022

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 6,636 736 Updated Jan 27, 2025

Yet another vine copula package, using PyTorch.

Python 116 31 Updated Jan 7, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 11,362 1,569 Updated Jan 29, 2025

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 4,296 613 Updated Jan 29, 2025

The first open autoregressive foundational video AI model.

2,873 713 Updated Oct 14, 2024

3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 872 26 Updated Oct 17, 2024
Next