Skip to content
View yshuolu's full-sized avatar

Block or report yshuolu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗

Jupyter Notebook 130 10 Updated Jan 10, 2024

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts

Jupyter Notebook 21 1 Updated Aug 31, 2024

An open source implementation of CLIP.

Python 9,937 959 Updated Aug 19, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,298 7,263 Updated Oct 7, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,266 6,386 Updated Oct 3, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,573 4,181 Updated Aug 19, 2024

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…

Python 1,769 252 Updated Oct 7, 2024

Skip YouTube video sponsors (browser extension)

TypeScript 10,027 321 Updated Oct 2, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,566 5,759 Updated Aug 19, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,241 1,063 Updated May 23, 2024

LLM inference in C/C++

C++ 65,908 9,464 Updated Oct 7, 2024

The official Meta Llama 3 GitHub site

Python 26,499 2,994 Updated Aug 12, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 959 79 Updated Jul 23, 2024

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 46,792 1,808 Updated Oct 3, 2024

Visualize Your Ideas With Code

TypeScript 15,946 600 Updated Oct 6, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,442 289 Updated Oct 3, 2024

JavaScript syntax tree transformer, nondestructive pretty-printer, and automatic source map generator

TypeScript 4,967 346 Updated Jul 16, 2024

An incremental parsing system for programming tools

Rust 18,267 1,403 Updated Oct 6, 2024

GritQL is a query language for searching, linting, and modifying code.

Rust 3,034 71 Updated Oct 7, 2024

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python 275 27 Updated Sep 23, 2024

Contrastive Language-Audio Pretraining

Python 1,363 133 Updated Jul 9, 2024

Simple image captioning model

Jupyter Notebook 1,290 214 Updated Jun 9, 2024

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Rust 536 29 Updated Feb 12, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

2,831 232 Updated Sep 9, 2024

A PyTorch implementation of EmpiricalMVM

Python 39 2 Updated Dec 18, 2023

A Survey on video and language understanding.

46 2 Updated Apr 21, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,917 4,121 Updated Oct 7, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,400 2,231 Updated Sep 24, 2024

Display an image created by Vulkan compute shader, with OpenGL

C++ 83 15 Updated Jun 28, 2024

An open-source framework for training large multimodal models.

Python 3,685 278 Updated Aug 31, 2024
Next