mfuntowicz

Funtowicz Morgan mfuntowicz

@huggingface | Head of Machine Learning Optimizations

290 followers · 17 following

@huggingface
Paris, France
20:34 (UTC +01:00)

Achievements

x3 x3 x3

Achievements

x3 x3 x3

Organizations

Lists (1)

Sort

🚀 My stack

1 repository

Starred repositories

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 716 55 Updated Feb 19, 2025

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 1,626 321 Updated Feb 21, 2025

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 883 70 Updated Jan 10, 2025

huggingface / optimum-nvidia

Python 930 96 Updated Feb 6, 2025

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 2,999 663 Updated Feb 21, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,031 898 Updated Mar 27, 2024

llvm / torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,433 533 Updated Feb 19, 2025

tensorflow / mlir-hlo

MLIR 405 72 Updated Feb 21, 2025

openxla / stablehlo

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 446 126 Updated Feb 21, 2025

huggingface / optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python 171 233 Updated Feb 21, 2025

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 32,603 3,038 Updated Feb 21, 2025

microsoft / windows-samples-rs

Rust 188 27 Updated Jan 25, 2022

UKPLab / sentence-transformers

State-of-the-Art Text Embeddings

Python 16,031 2,546 Updated Feb 14, 2025

huggingface / optimum-graphcore

Blazing fast training of 🤗 Transformers on Graphcore IPUs

Python 85 35 Updated Mar 11, 2024

tokio-rs / console

a debugger for async rust!

Rust 3,781 148 Updated Jan 22, 2025

google-research / deduplicate-text-datasets

Rust 1,176 116 Updated Jul 30, 2024

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,227 396 Updated Feb 21, 2025

facebookresearch / hydra

Hydra is a framework for elegantly configuring complex applications

Python 9,058 662 Updated Feb 18, 2025

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,329 263 Updated Feb 21, 2025

dapr / docs

Dapr user documentation, used to build docs.dapr.io

HTML 997 734 Updated Feb 20, 2025

NVIDIA / libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

C++ 2,297 186 Updated Feb 7, 2024

huggingface / hfapi

Simple Python client for the Hugging Face Inference API

Python 72 10 Updated Aug 18, 2020

PAIR-code / lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,521 356 Updated Feb 20, 2025

openvinotoolkit / openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 7,839 2,433 Updated Feb 21, 2025

liquidctl / liquidctl

Cross-platform CLI and Python drivers for AIO liquid coolers and other devices

Python 2,294 229 Updated Jan 27, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,895 4,252 Updated Feb 21, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,423 2,852 Updated Feb 20, 2025

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,690 3,052 Updated Feb 21, 2025

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,772 1,527 Updated Feb 21, 2025

onnx / onnx

Open standard for machine learning interoperability

Python 18,480 3,721 Updated Feb 21, 2025

Funtowicz Morgan mfuntowicz

Organizations

Lists (1)

🚀 My stack

Starred repositories

Python

Raspberry Pi

Rust