Skip to content
View jovsa's full-sized avatar

Block or report jovsa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Experiments on Multi-Head Latent Attention

    Python Updated Feb 7, 2025
  • Python Apache License 2.0 Updated Feb 7, 2025
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated Feb 2, 2025
  • tinygrad Public

    Forked from tinygrad/tinygrad

    You like pytorch? You like micrograd? You love tinygrad! ❤️

    Python MIT License Updated Jan 10, 2025
  • openr Public

    Forked from openreasoner/openr

    OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

    Python MIT License Updated Nov 15, 2024
  • cleanrl Public

    Forked from vwxyzjn/cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Python Other Updated Nov 14, 2024
  • systems Public

    Forked from lethain/systems

    systems is a set of tools for describing, running and visualizing systems diagrams.

    HTML MIT License Updated Oct 29, 2024
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Oct 8, 2024
  • A suite of test scenarios for multi-agent reinforcement learning.

    Python Apache License 2.0 Updated Oct 1, 2024
  • Implementation of Dreamer v3 in pytorch.

    Python MIT License Updated Sep 27, 2024
  • OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

    C++ Apache License 2.0 Updated Sep 23, 2024
  • triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++ MIT License Updated Sep 15, 2024
  • Repository for the paper Stream of Search: Learning to Search in Language

    Python Apache License 2.0 Updated Aug 10, 2024
  • dreamerv3 Public

    Forked from danijar/dreamerv3

    Mastering Diverse Domains through World Models

    Python MIT License Updated Jul 29, 2024
  • mctx Public

    Forked from google-deepmind/mctx

    Monte Carlo tree search in JAX

    Python Apache License 2.0 Updated Jul 25, 2024
  • SWE-agent Public

    Forked from SWE-agent/SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

    Python MIT License Updated Jul 10, 2024
  • carbs Public

    Forked from imbue-ai/carbs

    Cost aware hyperparameter tuning algorithm

    Jupyter Notebook MIT License Updated Jun 27, 2024
  • rebel Public

    Forked from facebookresearch/rebel

    An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

    C++ Apache License 2.0 Updated Mar 22, 2024
  • https://www.kaggle.com/c/connectx

    Python GNU General Public License v3.0 Updated Mar 20, 2024
  • Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

    Python Other Updated Mar 18, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Mar 15, 2024
  • SCSS MIT License Updated Dec 4, 2023
  • Jupyter Notebook MIT License Updated Oct 13, 2023
  • mmd Public

    Forked from ssokota/mmd

    Code for magnetic mirror descent.

    Python MIT License Updated Oct 5, 2023
  • MIT License Updated Sep 21, 2023
  • A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

    Jupyter Notebook MIT License Updated Jun 10, 2023
  • micrograd Public

    Forked from karpathy/micrograd

    A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

    Jupyter Notebook MIT License Updated Apr 16, 2023
  • Solve puzzles. Improve your pytorch.

    Jupyter Notebook MIT License Updated Apr 15, 2023
  • minGPT Public

    Forked from karpathy/minGPT

    A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

    Python MIT License Updated Apr 10, 2023
  • trlx Public

    Forked from CarperAI/trlx

    A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

    Python MIT License Updated Feb 19, 2023