Stars
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Track and Collaborate on ML & AI Experiments.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
SmartCodable is a data parsing library based on Codable. It is simple to use, with robust compatibility being one of its main features. SmartCodable 是基于Codable实现的数据解析库。简单易用,强悍的兼容性是SmartCodable的主要特点…
《AI 研发提效:构建 AI 辅助编码助手》 —— 介绍如何 DIY 一个端到端(从 IDE 插件、模型选型、数据集构建到模型微调)的 AI 辅助编程工具,类似于 GitHub Copilot、JetBrains AI Assistant、AutoDev 等。
A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
ArchGuard is a architecture workbench, also for architecture governance, which can analysis architecture in container, component, code level, create architecure fitness functions, and anaysis syste…
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
A pure Unix shell script implementing ACME client protocol
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
Go DDD example application. Complete project to show how to apply DDD, Clean Architecture, and CQRS by practical refactoring.
Generate Go client and server boilerplate from OpenAPI 3 specifications
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Self-hosted version of OpenAI’s new stateful Assistants API
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Web Speech API で音声認識した結果の字幕をWebカメラ映像に重ねて表示するWebページ
chrome netease music extension with mediaSession support
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Faster Whisper transcription with CTranslate2
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用