Skip to content
View simonzfei's full-sized avatar

Block or report simonzfei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

476 30 Updated Jan 28, 2025

A python package to build AI-powered real-time audio applications

Python 1,175 92 Updated Feb 11, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,826 825 Updated Feb 10, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 76,017 9,085 Updated Jan 4, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,027 831 Updated Feb 11, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,520 1,276 Updated Feb 9, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,125 1,025 Updated Feb 8, 2025

We write your reusable computer vision tools. 💜

Python 24,849 1,864 Updated Feb 10, 2025

收藏一些电子书

4,022 1,088 Updated Feb 22, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,987 1,417 Updated Dec 25, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,248 1,725 Updated Feb 10, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,257 708 Updated Dec 17, 2024

【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。

1,041 158 Updated Feb 11, 2025

CVNets: A library for training computer vision networks

Python 1,824 237 Updated Oct 30, 2023

Open-Set Grounded Text-to-Image Generation

Python 2,072 155 Updated Mar 6, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,327 7,989 Updated Feb 11, 2025

Official inference repo for FLUX.1 models

Python 20,074 1,399 Updated Feb 6, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,412 419 Updated Aug 7, 2024

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 46,990 3,656 Updated Feb 11, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,034 5,139 Updated Oct 10, 2024

The official Meta Llama 3 GitHub site

Python 28,257 3,267 Updated Jan 26, 2025

大模型基础: 一文了解大模型基础知识

3,801 340 Updated Feb 4, 2025

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,005 4,882 Updated Aug 1, 2024

Inference code for Llama models

Python 57,563 9,688 Updated Jan 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,274 2,299 Updated Jan 22, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,568 1,192 Updated Feb 1, 2025

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Jupyter Notebook 394 61 Updated Mar 20, 2023

[CSUR] A Survey on Video Diffusion Models

1,935 97 Updated Dec 9, 2024

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 6,397 1,157 Updated Jan 16, 2025
Next