Skip to content
View O1ieGao's full-sized avatar

Block or report O1ieGao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository give a guidline to learn CUDA and TensorRT from the beginning.

C++ 223 55 Updated Feb 17, 2025

🚀 The fast, Pythonic way to build MCP servers and clients

Python 5,120 245 Updated Apr 16, 2025

"PhD-Level AI Agents: Fully-Automated Scientific Discovery with Our AI-Researcher Powered by LLMs"

Python 1,383 171 Updated Apr 3, 2025

Build your own AI friend

C++ 11,516 2,204 Updated Apr 14, 2025

一个基于 WebRTC 和 Cloudflare Durable Objects 实现的简单高效的屏幕共享工具。通过 WebSocket 实现实时信令服务,配合 WebRTC 技术,实现低延迟的屏幕共享功能。只需输入投屏码,即可实现跨设备的屏幕分享。

TypeScript 208 23 Updated Jan 6, 2025

Multilingual Voice Understanding Model

Python 5,360 482 Updated Mar 23, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,567 949 Updated Apr 16, 2025

2018秋哈工大视听觉实验

Python 145 58 Updated Nov 18, 2019

J-Moshi: A Japanese Full-duplex Spoken Dialogue System

JavaScript 231 12 Updated Feb 13, 2025

first base model for full-duplex conversational audio

Python 1,731 111 Updated Jan 5, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,588 543 Updated Mar 24, 2025

On-device wake word detection powered by deep learning

Python 4,041 519 Updated Apr 16, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 18,673 2,568 Updated Apr 11, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 80,073 9,614 Updated Jan 4, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 137,523 11,438 Updated Apr 16, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,952 754 Updated Apr 15, 2025

WebRTC and ORTC implementation for Python using asyncio

Python 4,593 803 Updated Apr 6, 2025

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,184 172 Updated Feb 5, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 49,184 4,641 Updated Apr 16, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,746 5,869 Updated Sep 18, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 12,393 1,389 Updated Apr 16, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 37,502 2,853 Updated Apr 16, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,305 505 Updated Feb 26, 2025

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,359 131 Updated Aug 1, 2024

Export Segment Anything Models to ONNX

Python 329 39 Updated Aug 3, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,132 1,475 Updated Sep 5, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,466 4,437 Updated Aug 19, 2024

Light-weight system monitor for X, Wayland, and other things, too

C++ 7,603 626 Updated Mar 17, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,862 789 Updated Aug 12, 2024

✯ 可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕

JavaScript 25,312 3,849 Updated Apr 16, 2025
Next