- Germany
Highlights
- Pro
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Context parallel attention that accelerates DiT model inference with dynamic caching
๐ฏ Queue background jobs inspector
lucasresck / gnome-shell-extension-alt-tab-scroll-workaround
Forked from buzztaiki/gnome-shell-extension-alt-tab-move-mouseQuick fix to the bug where scrolling in one application is repeated in another when switching between them using Alt+Tab (e.g., VS Code and Chrome)
A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer
Open source Raspberry Pi 5 compatible Micro Four Thirds camera module based on IMX294
[NeurIPS 2024 Spotlight] Implementation of the paper "3D Gaussian Splatting as Markov Chain Monte Carlo"
PixArt-ฮฃ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Ring attention implementation with flash attention
Open-Sora: Democratizing Efficient Video Production for All
Implementation of ๐ Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
๐ The OpenAPI to TypeScript codegen. Generate clients, SDKs, validators, and more. Support: @mrlubos. Join the Hey API platform ๐ [email protected]
A Home Assistant Integration for Bambu Lab Printers
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
An unofficial cuda assembler, for all generations of SASS, hopefully ๏ผ๏ผ
Terraform Provider for Genesis Cloud
This is a Helm plugin which map deprecated or removed Kubernetes APIs in a release to supported APIs
Simple, safe way to store and distribute tensors
Legendary - A free and open-source replacement for the Epic Games Launcher
Visualize Your Ideas With Code
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.