Stars
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Oryx is a library for probabilistic programming and deep learning built on top of Jax.
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more