-
Baidu
- Beijing
- http://blog.csdn.net/dongdong230
Highlights
Stars
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
AI magics meet Infinite draw board.
A fast gigapixel processing system
Structure your STEM essay in several minutes with Generative AI.
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Comprehensive Deep Learning Tutorial : From Zero To Hero
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
python tool to transpile a tf.keras model into a circom circuit
This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.
A code repository designed to show the best GitHub has to offer.
AI solution for Patent Classification
In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing da…
A machine learning-driven solution designed to detect fraudulent activities in bank payment systems
4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions
WorldGPT: Empowering LLM as Multimodal World Model
An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator
教你只用最基本的python语法和numpy一步步实现深度学习框架