Skip to content
View LAYDOWN-J's full-sized avatar

Highlights

  • Pro

Block or report LAYDOWN-J

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Python 307 17 Updated May 27, 2024

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Python 3,215 432 Updated Jun 25, 2023

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,509 92 Updated Dec 11, 2024

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

Python 89 9 Updated Sep 13, 2024

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Python 115 11 Updated May 21, 2023

Code Release for MViTv2 on Image Recognition.

Python 412 47 Updated Nov 26, 2024

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Python 703 86 Updated Aug 25, 2021

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,595 217 Updated Apr 9, 2024

Implementation of ViViT: A Video Vision Transformer

Python 520 66 Updated Jun 21, 2021

An effective multimodal representation and fusion method for multimodal intent recognition

Python 6 1 Updated Jun 7, 2024

Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.

Python 41 Updated Dec 14, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,078 486 Updated May 3, 2024

A course on aligning smol models.

Jupyter Notebook 3,774 1,225 Updated Dec 30, 2024

Instruction Tuning with GPT-4

HTML 4,251 301 Updated Jun 11, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,706 4,056 Updated Jul 17, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,610 741 Updated May 31, 2024

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,286 195 Updated Jan 3, 2025

Tiny Kinetics-400 for test

Python 86 10 Updated Feb 21, 2024

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 874 62 Updated Jul 6, 2024

The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"

Python 189 10 Updated Apr 10, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,969 2,305 Updated Aug 12, 2024

Learn OpenCV : C++ and Python Examples

Jupyter Notebook 21,483 11,642 Updated Jan 2, 2025

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 249 44 Updated Nov 20, 2024

Wrapper to expose Kinect for Windows v2 API in Python

Python 506 236 Updated Mar 7, 2023

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 229,411 25,076 Updated Aug 11, 2024

😎 Awesome lists about all kinds of interesting topics

340,190 28,195 Updated Dec 12, 2024

🪄 Create rich visualizations with AI

TypeScript 1,431 87 Updated Jan 2, 2025

Intel® RealSense™ SDK

C++ 7,684 4,831 Updated Jan 2, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,946 345 Updated Dec 24, 2024
Next