Stars
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Azure Machine Learning for Visual Studio Code, previously called Visual Studio Code Tools for AI, is an extension to easily build, train, and deploy machine learning models to the cloud or the edge…
A validation and profiling tool for AI infrastructure
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
nnScaler: Compiling DNN models for Parallel Training
Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of lists of statically-shaped tensors, referred to as a Fractal…