pixqc/llm-inference Llama 3.2 inference in MLX and Pytorch. And some evals too. For educational purposes ^_^ Heavily inspired by https://github.com/xjdr-alt/entropix