- Mountain View, California
- jovsa.github.io
- @JovanSardinha
-
mla-experiments Public
Forked from ambisinister/mla-experimentsExperiments on Multi-Head Latent Attention
Python UpdatedFeb 7, 2025 -
production-stack Public
Forked from vllm-project/production-stackPython Apache License 2.0 UpdatedFeb 7, 2025 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 2, 2025 -
tinygrad Public
Forked from tinygrad/tinygradYou like pytorch? You like micrograd? You love tinygrad! ❤️
Python MIT License UpdatedJan 10, 2025 -
openr Public
Forked from openreasoner/openrOpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Python MIT License UpdatedNov 15, 2024 -
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python Other UpdatedNov 14, 2024 -
systems Public
Forked from lethain/systemssystems is a set of tools for describing, running and visualizing systems diagrams.
HTML MIT License UpdatedOct 29, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 8, 2024 -
meltingpot Public
Forked from google-deepmind/meltingpotA suite of test scenarios for multi-agent reinforcement learning.
Python Apache License 2.0 UpdatedOct 1, 2024 -
dreamerv3-torch Public
Forked from NM512/dreamerv3-torchImplementation of Dreamer v3 in pytorch.
Python MIT License UpdatedSep 27, 2024 -
open_spiel Public
Forked from google-deepmind/open_spielOpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
C++ Apache License 2.0 UpdatedSep 23, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 15, 2024 -
stream-of-search Public
Forked from kanishkg/stream-of-searchRepository for the paper Stream of Search: Learning to Search in Language
Python Apache License 2.0 UpdatedAug 10, 2024 -
dreamerv3 Public
Forked from danijar/dreamerv3Mastering Diverse Domains through World Models
Python MIT License UpdatedJul 29, 2024 -
mctx Public
Forked from google-deepmind/mctxMonte Carlo tree search in JAX
Python Apache License 2.0 UpdatedJul 25, 2024 -
SWE-agent Public
Forked from SWE-agent/SWE-agentSWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Python MIT License UpdatedJul 10, 2024 -
carbs Public
Forked from imbue-ai/carbsCost aware hyperparameter tuning algorithm
Jupyter Notebook MIT License UpdatedJun 27, 2024 -
rebel Public
Forked from facebookresearch/rebelAn algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
C++ Apache License 2.0 UpdatedMar 22, 2024 -
kaggle-connect-x Public
https://www.kaggle.com/c/connectx
Python GNU General Public License v3.0 UpdatedMar 20, 2024 -
diplomacy_cicero Public
Forked from facebookresearch/diplomacy_ciceroCode for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Python Other UpdatedMar 18, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedMar 15, 2024 -
-
-
mmd Public
Forked from ssokota/mmdCode for magnetic mirror descent.
Python MIT License UpdatedOct 5, 2023 -
-
alpha-zero-general Public
Forked from suragnair/alpha-zero-generalA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Jupyter Notebook MIT License UpdatedJun 10, 2023 -
micrograd Public
Forked from karpathy/microgradA tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Jupyter Notebook MIT License UpdatedApr 16, 2023 -
Tensor-Puzzles Public
Forked from srush/Tensor-PuzzlesSolve puzzles. Improve your pytorch.
Jupyter Notebook MIT License UpdatedApr 15, 2023 -
minGPT Public
Forked from karpathy/minGPTA minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Python MIT License UpdatedApr 10, 2023 -
trlx Public
Forked from CarperAI/trlxA repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python MIT License UpdatedFeb 19, 2023