-
14:37
(UTC +08:00) - florinshen.github.io
Highlights
- Pro
Stars
[ECCV2024] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
[ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Images
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
The official repository for GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling
3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image
Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
Official implementation of the SIGGRAPH 2024 paper "N-Dimensional Gaussians for Fitting of High Dimensional Functions"
Official code of "LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation"
We present Object Images (Omages): An homage to the classic Geometry Images.
Official inference repo for FLUX.1 models
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
✨✨Latest Advances on Multimodal Large Language Models
[ECCV 2024 - Oral] Analytic-Splatting Anti-Aliased 3D Gaussian Splatting via Analytic Integration
Gaga: Group Any Gaussians via 3D-aware Memory Bank
Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions
A differentiable point-based rendering framework.