🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: Office Hour:
A generative speech model for daily dialogue.
Easily train a good VC model with voice data <= 10 mins!
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Open-Sora: Democratizing Efficient Video Production for All
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, OpenAI, and more.
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
ImageBind One Embedding Space to Bind Them All
Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!
A fluent design widgets library based on C++ Qt/PyQt/PySide. Make Qt Great Again.
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Code and dataset for photorealistic Codec Avatars driven from audio
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning