Skip to content
View dboshardy's full-sized avatar

Block or report dboshardy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,313 489 Updated Dec 30, 2024

4 Axis 3D Printer

G-code 385 33 Updated Jan 19, 2025

GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.

Python 219 64 Updated Jan 24, 2023

Golf swing detection/extraction by computer vision and machine learning techniques. Using Roboflow's object detection model and RNNs in PyTorch

Jupyter Notebook 8 Updated Oct 6, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,685 256 Updated Jan 4, 2025

AI Agents for Semi-Autonomous Public Goods Production

Python 7 1 Updated May 20, 2024

A small project to track and calculate the speed from a putt.

Python 14 9 Updated Oct 26, 2023

A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model

Python 549 34 Updated Dec 3, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,573 3,041 Updated Jan 7, 2025

[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.

29 1 Updated Jul 12, 2023

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 1,841 46 Updated Sep 30, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

851 36 Updated Jun 5, 2024

Your image is almost there!

Python 7,477 428 Updated Jul 26, 2024

A High-Quality Real Time Upscaler for Anime Video

Jupyter Notebook 18,728 1,359 Updated Aug 17, 2024
Python 41 2 Updated Apr 12, 2024

Implementation of Key-Locked Rank One Editing, from Nvidia AI

Python 231 9 Updated Sep 7, 2023

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,739 102 Updated Dec 24, 2024

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Python 1,946 184 Updated May 3, 2024

Tag manager and captioner for image datasets

Python 863 38 Updated Jan 21, 2025

Image composition toolbox: everything you want to know about image composition or object insertion

Python 572 37 Updated Jan 15, 2025

Adala: Autonomous DAta (Labeling) Agent framework

Python 1,039 82 Updated Jan 23, 2025

A 99% automatized pipeline to construct training set from anime and more for text-to-image model training

Jupyter Notebook 211 15 Updated May 1, 2024

πŸ† A ranked list of awesome Python open-source libraries and tools. Updated weekly.

3,781 261 Updated Aug 14, 2024

πŸ† A ranked list of awesome python developer tools and libraries. Updated weekly.

Python 1,076 50 Updated Jun 6, 2024

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Python 2,909 217 Updated Jan 23, 2025

πŸ‘· Build images with images

Python 5,955 425 Updated Oct 31, 2023

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,352 196 Updated Feb 29, 2024

GENIE: Higher-Order Denoising Diffusion Solvers

Python 89 5 Updated Oct 23, 2023
Next