Skip to content
View mnshukla95's full-sized avatar

Block or report mnshukla95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

360 20 Updated Apr 24, 2024

An iOS app that visually clones Spotify's app and consumes the official Spotify's Web API to show(and play) songs, podcasts, artists and more.

Swift 262 61 Updated Sep 4, 2023

Autonomous agents for everyone

TypeScript 14,995 4,839 Updated Mar 12, 2025

The fastest way to build robust AI agents

Python 1,710 152 Updated Mar 7, 2025

Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)

Python 491 93 Updated Mar 1, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,184 759 Updated Mar 12, 2025

A SwiftUI Mastodon client

Swift 5,898 575 Updated Feb 10, 2025

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 313 56 Updated Jan 24, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,810 173 Updated Dec 21, 2024

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 1,201 66 Updated Feb 27, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,289 3,151 Updated Jan 7, 2025

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,460 59 Updated Jan 7, 2025

zero-dependency browser-based video editor

JavaScript 957 76 Updated Jan 6, 2023

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 30,545 2,583 Updated Mar 11, 2025

a self-hosted webui for 30+ generative ai

Python 560 67 Updated Mar 11, 2025

real time face swap and one-click video deepfake with only a single image

Python 44,589 6,573 Updated Mar 6, 2025

Create Videos with Code

TypeScript 3,123 127 Updated Feb 24, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,462 1,499 Updated Dec 25, 2024

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Python 1,857 126 Updated Feb 23, 2024

Voice memo app for iOS with multi-level organization, built with Swift and SwiftUI

Swift 3 Updated Jul 21, 2024
Swift 1 Updated Aug 15, 2023

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,166 2,921 Updated Mar 10, 2025
Jupyter Notebook 31 3 Updated Dec 18, 2023

Fast Segment Anything

Python 7,758 721 Updated Jul 30, 2024

pytorch implementation for "Deep Flow-Guided Video Inpainting"(CVPR'19)

Python 2,363 450 Updated Dec 8, 2022
Python 691 47 Updated May 6, 2024

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Python 762 67 Updated Jun 3, 2023

Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)

Python 231 18 Updated Oct 19, 2022

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,897 1,454 Updated Sep 5, 2024
Next