-
intel
- Los Angeles
Stars
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
SGLang is a fast serving framework for large language models and vision language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Examples and guides for using the OpenAI API
VQPy: An object-oriented approach to modern video analytics
A playbook for systematically maximizing the performance of deep learning models.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A curated list of awesome Machine Learning frameworks, libraries and software.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
A curated list of automated machine learning papers, articles, tutorials, slides and projects
Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersection of machine learning and systems. PR are welcome.
CS294; AI For Systems and Systems For AI
A smaller subset of 10 easily classified classes from Imagenet, and a little more French
Named Tensor implementation for Torch
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…
ZooKeeper client wrapper and rich ZooKeeper framework