- Santa Clara
- https://kaixih.github.io/
-
maxtext Public
Forked from AI-Hypercomputer/maxtextA simple, performant and scalable Jax LLM!
Python Apache License 2.0 UpdatedFeb 12, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 12, 2025 -
jax Public
Forked from jax-ml/jaxComposable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedDec 19, 2024 -
xla Public
Forked from openxla/xlaA machine learning compiler for GPUs, CPUs, and ML accelerators
C++ Apache License 2.0 UpdatedNov 26, 2024 -
-
-
paxml Public
Forked from google/paxmlPax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…
Python Apache License 2.0 UpdatedJul 23, 2024 -
flax Public
Forked from google/flaxFlax is a neural network library for JAX that is designed for flexibility.
Python Apache License 2.0 UpdatedJul 11, 2024 -
JAX-Toolbox Public
Forked from NVIDIA/JAX-ToolboxJAX-Toolbox
Python Apache License 2.0 UpdatedMay 31, 2024 -
cudnn_frontend_test Public
A scalable code framework for CuDNN frontend APIs
-
-
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedDec 16, 2023 -
atex Public
Forked from NVIDIA/atexA TensorFlow Extension: GPU performance tools for TensorFlow.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 15, 2023 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Cuda Apache License 2.0 UpdatedJun 8, 2023 -
-
-
-
-
-
dl_samples Public
Code samples to use deep learning frameworks, libraries.
-
tf_op_graph Public
A visualization tool to display TF-Grappler optimized op graph
-
-
-
-
cudnn_migration Public
Migrate your cuDNN v7 legacy APIs to cuDNN v8 frontend APIs
-
-
-
-
horovod Public
Forked from horovod/horovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
-
slate Public
Forked from pages-themes/slateSlate is a Jekyll theme for GitHub Pages
CSS Creative Commons Zero v1.0 Universal UpdatedDec 15, 2020