-
Baidu
- Beijing
- http://blog.csdn.net/dongdong230
Highlights
Stars
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
AI magics meet Infinite draw board.
A fast gigapixel processing system
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Structure your STEM essay in several minutes with Generative AI.
Comprehensive Deep Learning Tutorial : From Zero To Hero
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
python tool to transpile a tf.keras model into a circom circuit
This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.
A code repository designed to show the best GitHub has to offer.
In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing da…
AI solution for Patent Classification
A machine learning-driven solution designed to detect fraudulent activities in bank payment systems
4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions
WorldGPT: Empowering LLM as Multimodal World Model
An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator
教你只用最基本的python语法和numpy一步步实现深度学习框架