Stars
🦜🔗 Build context-aware reasoning applications
A latent text-to-image diffusion model
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A High-Quality Real Time Upscaler for Anime Video
📡 Simple and ready-to-use tutorials for TensorFlow
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
A simple screen parsing tool towards pure vision based GUI agent
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
This is the official repository for the LENS (Large Language Models Enhanced to See) system.