jovsa

Jovan Sardinha jovsa

personal account

39 followers · 0 following

Mountain View, California
jovsa.github.io
@JovanSardinha

Achievements

mla-experiments Public
Forked from ambisinister/mla-experiments

Experiments on Multi-Head Latent Attention

Python Updated Feb 7, 2025
production-stack Public
Forked from vllm-project/production-stack

Python Apache License 2.0 Updated Feb 7, 2025
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Feb 2, 2025
tinygrad Public
Forked from tinygrad/tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python MIT License Updated Jan 10, 2025
openr Public
Forked from openreasoner/openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python MIT License Updated Nov 15, 2024
cleanrl Public
Forked from vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python Other Updated Nov 14, 2024
systems Public
Forked from lethain/systems

systems is a set of tools for describing, running and visualizing systems diagrams.

HTML MIT License Updated Oct 29, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Oct 8, 2024
meltingpot Public
Forked from google-deepmind/meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Python Apache License 2.0 Updated Oct 1, 2024
dreamerv3-torch Public
Forked from NM512/dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Python MIT License Updated Sep 27, 2024
open_spiel Public
Forked from google-deepmind/open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ Apache License 2.0 Updated Sep 23, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated Sep 15, 2024
stream-of-search Public
Forked from kanishkg/stream-of-search

Repository for the paper Stream of Search: Learning to Search in Language

Python Apache License 2.0 Updated Aug 10, 2024
dreamerv3 Public
Forked from danijar/dreamerv3

Mastering Diverse Domains through World Models

Python MIT License Updated Jul 29, 2024
mctx Public
Forked from google-deepmind/mctx

Monte Carlo tree search in JAX

Python Apache License 2.0 Updated Jul 25, 2024
SWE-agent Public
Forked from SWE-agent/SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python MIT License Updated Jul 10, 2024
carbs Public
Forked from imbue-ai/carbs

Cost aware hyperparameter tuning algorithm

Jupyter Notebook MIT License Updated Jun 27, 2024
rebel Public
Forked from facebookresearch/rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

C++ Apache License 2.0 Updated Mar 22, 2024
kaggle-connect-x Public

https://www.kaggle.com/c/connectx

Python GNU General Public License v3.0 Updated Mar 20, 2024
diplomacy_cicero Public
Forked from facebookresearch/diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python Other Updated Mar 18, 2024
pytorch Public
Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Mar 15, 2024
jovsa.github.io Public

SCSS MIT License Updated Dec 4, 2023
safe-haven-reproduction Public

Jupyter Notebook MIT License Updated Oct 13, 2023
mmd Public
Forked from ssokota/mmd

Code for magnetic mirror descent.

Python MIT License Updated Oct 5, 2023
cut-the-knot-probability-riddles Public

MIT License Updated Sep 21, 2023
alpha-zero-general Public
Forked from suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook MIT License Updated Jun 10, 2023
micrograd Public
Forked from karpathy/micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook MIT License Updated Apr 16, 2023
Tensor-Puzzles Public
Forked from srush/Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook MIT License Updated Apr 15, 2023
minGPT Public
Forked from karpathy/minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python MIT License Updated Apr 10, 2023
trlx Public
Forked from CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python MIT License Updated Feb 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jovan Sardinha jovsa

Achievements

Achievements

Block or report jovsa

mla-experiments Public

production-stack Public

trl Public

tinygrad Public

openr Public

cleanrl Public

systems Public

vllm Public

meltingpot Public

dreamerv3-torch Public

open_spiel Public

triton Public

stream-of-search Public

dreamerv3 Public

mctx Public

SWE-agent Public

carbs Public

rebel Public

kaggle-connect-x Public

diplomacy_cicero Public

pytorch Public

jovsa.github.io Public

safe-haven-reproduction Public

mmd Public

cut-the-knot-probability-riddles Public

alpha-zero-general Public

micrograd Public

Tensor-Puzzles Public

minGPT Public

trlx Public