Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…

Python 1,769 252 Updated Oct 7, 2024

ajayyy / SponsorBlock

Skip YouTube video sponsors (browser extension)

TypeScript 10,027 321 Updated Oct 2, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,566 5,759 Updated Aug 19, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,241 1,063 Updated May 23, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 65,908 9,464 Updated Oct 7, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,499 2,994 Updated Aug 12, 2024

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 959 79 Updated Jul 23, 2024

meilisearch / meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 46,792 1,808 Updated Oct 3, 2024

motion-canvas / motion-canvas

Visualize Your Ideas With Code

TypeScript 15,946 600 Updated Oct 6, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,442 289 Updated Oct 3, 2024

benjamn / recast

JavaScript syntax tree transformer, nondestructive pretty-printer, and automatic source map generator

TypeScript 4,967 346 Updated Jul 16, 2024

tree-sitter / tree-sitter

An incremental parsing system for programming tools

Rust 18,267 1,403 Updated Oct 6, 2024

getgrit / gritql

GritQL is a query language for searching, linting, and modifying code.

Rust 3,034 71 Updated Oct 7, 2024

kyegomez / ScreenAI

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python 275 27 Updated Sep 23, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,363 133 Updated Jul 9, 2024

rmokady / CLIP_prefix_caption

Simple image captioning model

Jupyter Notebook 1,290 214 Updated Jun 9, 2024

Noeda / rllama

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Rust 536 29 Updated Feb 12, 2024

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

2,831 232 Updated Sep 9, 2024

tsujuifu / pytorch_empirical-mvm

A PyTorch implementation of EmpiricalMVM

Python 39 2 Updated Dec 18, 2023

liveseongho / Awesome-Video-Language-Understanding

A Survey on video and language understanding.

46 2 Updated Apr 21, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,917 4,121 Updated Oct 7, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,400 2,231 Updated Sep 24, 2024

nvpro-samples / gl_vk_simple_interop

Display an image created by Vulkan compute shader, with OpenGL

C++ 83 15 Updated Jun 28, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,685 278 Updated Aug 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phineas yshuolu

Block or report yshuolu

Starred repositories

merveenoyan / siglip

rhysdg / vision-at-a-clip

mlfoundations / open_clip

ollama / ollama

facebookresearch / fairseq

suno-ai / bark

NexaAI / nexa-sdk