Skip to content
View blissrat's full-sized avatar
🐀
seeking
🐀
seeking

Block or report blissrat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VideoLLaMA 2: Improving Video-LLMs with Convolutional Spatial-Temporal Aggregation and Stronger Audio Capability

Python 4 Updated Jan 20, 2025

[cvpr2023] implementation of out-of-candidate rectification methods

16 1 Updated Feb 28, 2023

This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Python 2 Updated Mar 18, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 5 Updated Aug 19, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 905 40 Updated Sep 27, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,827 278 Updated Dec 21, 2024

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Python 63 2 Updated Oct 31, 2024

[CVPR 2025] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.

Python 232 11 Updated Jan 16, 2025

Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.

Python 21 Updated Dec 30, 2024

✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Python 41 2 Updated Oct 17, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,940 269 Updated Jun 4, 2024

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 248 14 Updated Oct 7, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,101 72 Updated Jan 23, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 4 2 Updated Oct 15, 2021

2021 Fall Computer Vision (Jian Zhang)

Jupyter Notebook 3 Updated Nov 16, 2021

轻小说文库 epub 解析打包

Python 20 3 Updated May 3, 2020

The implementation of VectorNet. Done and Lose

Python 41 7 Updated Jun 21, 2020

Space Invaders game implemented with VHDL

VHDL 153 17 Updated Feb 10, 2016

Implementation of YOLO v3 object detector in Tensorflow (TF-Slim)

Python 887 349 Updated May 15, 2023

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 21,944 7,967 Updated Nov 6, 2024

This is a robot project for television live. System will tracking the host's face, making the face in the middle of the screen. Main algorithm is Yolov3, trained on WIDER FACE and tested on FDDB be…

Python 63 13 Updated Aug 2, 2018

Face detection with darknet on WIDER FACE

C 37 19 Updated Feb 27, 2020

LabelImgTool is a graphical image annotation tool which supports CLS,DET and SEG(semantic&instance )

Python 209 73 Updated Aug 26, 2019

darknet深度学习框架源码分析:详细中文注释,涵盖框架原理与实现语法分析

C 1,606 487 Updated Nov 7, 2018

Complete YOLO v3 TensorFlow implementation. Support training on your own dataset.

Python 1,555 578 Updated Sep 16, 2022

Reproduce MTCNN using Tensorflow

Python 1,508 710 Updated Dec 16, 2019

General code to convert a trained keras model into an inference tensorflow model

Python 1,661 538 Updated Nov 23, 2020

Deep learning-based Face detection using the YOLOv3 algorithm (https://github.com/sthanhng/yoloface)

Python 460 179 Updated Jun 23, 2022

🚀 😏 Near Real Time CPU Face detection using deep learning

Python 551 148 Updated Dec 22, 2019

(WARNING: This repository is NO LONGER maintained ) Real time face detection and recognition base on opencv/tensorflow/mtcnn/facenet

Python 896 416 Updated Jan 28, 2017
Next