-
University of Washington
- Seattle, WA
-
21:55
(UTC -07:00) - xzhu27.me
- @xiangfeng_zhu
Highlights
- Pro
Stars
A Datacenter Scale Distributed Inference Serving Framework
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
A high-throughput and memory-efficient inference and serving engine for LLMs
Service Weaver port of the GCP Online Boutique demo application. See https://github.com/GoogleCloudPlatform/microservices-demo.
Expressive, Easy-to-build, and High-performance Application Networks
A curated list for awesome cloud native tools, software and tutorials. - https://jimmysong.io/awesome-cloud-native/
WebAssembly for Proxies (Rust SDK)
Packet, where are you? -- eBPF-based Linux kernel networking debugger
Instant Kubernetes-Native Application Observability
gRPC proxy is a Go reverse proxy that allows for rich routing of gRPC calls with minimum overhead.
Compilation of P4 exercises, examples, documentation, slides for learning or teaching
"How to Do Great Research" Course for Ph.D. Students
🐎 Benchmarks for Inter-Process-Communication Techniques
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Batfish is a network configuration analysis tool that can find bugs and guarantee the correctness of (planned or current) network configurations. It enables network engineers to rapidly and safely …
Programming framework for writing and deploying cloud applications.
A suite of gRPC debugging tools. Like Fiddler/Charles but for gRPC.