
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Python library and CLI tool to interface with Google Translate's text-to-speech API
Train your AI self, amplify you, bridge the world
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
No fortress, purely open ground. OpenManus is Coming.
Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.
Model Context Protocol (MCP) Server for dify workflows
Secure open source cloud runtime for AI apps & AI agents
Code for the manim-generated scenes used in 3blue1brown videos
Cost-efficient and pluggable Infrastructure components for GenAI inference
A simple screen parsing tool towards pure vision based GUI agent
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Fully open reproduction of DeepSeek-R1
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
Everything you need to build state-of-the-art foundation models, end-to-end.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
🦀 Small exercises to get you used to reading and writing Rust code!
Synchronized Translation for Videos. Video dubbing
Predict Fujifilm custom settings by image - powered by AI
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.