Skip to content
View Luhuanz's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Luhuanz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Android application for super-resolution & interpolation. Contains RealSR-NCNN, SRMD-NCNN, RealCUGAN-NCNN, Real-ESRGAN-NCNN, Waifu2x-NCNN, Anime4kcpp, nearest, bilinear, bicubic, AVIR...

C++ 1,214 91 Updated May 23, 2024

The Pokémon API

Python 4,516 971 Updated Feb 4, 2025

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 4,946 313 Updated Jan 22, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,319 2,344 Updated Aug 12, 2024

Aligning LMMs with Factually Augmented RLHF

Python 343 24 Updated Nov 1, 2023

EVA Series: Visual Representation Fantasies from BAAI

Python 2,409 174 Updated Aug 1, 2024
Jupyter Notebook 1 1 Updated May 18, 2022

via->yolo, yolo->via

Python 10 3 Updated Nov 20, 2024

Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions

Python 115 19 Updated Jun 7, 2022

Student Classroom Behavior dataset

Python 253 24 Updated Jan 16, 2025

Image Annotation Tools 图像标注工具用户指南

6 3 Updated Mar 14, 2019

记录大模型相关的一些知识和方法

Jupyter Notebook 613 100 Updated Feb 7, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 62,860 9,328 Updated Feb 7, 2025

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,717 701 Updated Jan 11, 2025

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,044 2,912 Updated Jan 16, 2025

快速入门RAG与私有化部署

Python 149 30 Updated Apr 17, 2024
C++ 9 Updated Mar 26, 2023

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Jupyter Notebook 951 218 Updated Dec 7, 2020

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,088 419 Updated Jul 11, 2024

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,295 255 Updated Oct 18, 2024

The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

Python 214 52 Updated Sep 21, 2022

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 5,234 687 Updated Jan 3, 2025

The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

Python 155 37 Updated May 15, 2023
HTML 2 Updated Jan 30, 2025

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 2,298 378 Updated Jan 8, 2025

Repository for OpenCV's extra modules

C++ 9,544 5,789 Updated Feb 5, 2025

Open Source Computer Vision Library

C++ 80,429 55,959 Updated Feb 7, 2025

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,221 238 Updated Feb 7, 2025

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,128 3,115 Updated Feb 7, 2025

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,230 531 Updated Jan 27, 2025
Next