Skip to content
View kushalarora's full-sized avatar

Block or report kushalarora

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.

Jupyter Notebook 14 2 Updated May 5, 2022

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,371 131 Updated Sep 18, 2024

This is a fork of the awesome Joey-NMT with Reinforcement Learning algorithms like Policy Gradient, MRT and Advantage Actor Critic.

Python 27 7 Updated Feb 10, 2023

Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.

TypeScript 10,946 539 Updated Feb 12, 2025

MLE-Guided Parameter Search (AAAI 2021)

Python 11 1 Updated Sep 16, 2021

Discontinuous Hamiltonian Monte Carlo in JAX

Jupyter Notebook 41 2 Updated Feb 24, 2020
Python 129 37 Updated Sep 17, 2023

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,900 495 Updated Feb 14, 2023

Unsupervised Statistical Machine Translation

Java 229 40 Updated Aug 30, 2020

The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)

Python 33 1 Updated Dec 2, 2021

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 712 174 Updated May 29, 2022

Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).

Python 308 66 Updated Apr 30, 2024

From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction

Python 9 2 Updated May 5, 2018

Reinforcement Learning for Neural Machine Translation

Python 188 48 Updated Dec 29, 2024

Compositional generalization through meta sequence-to-sequence learning

Python 83 12 Updated Jan 7, 2020

Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for natural language modeling fail when such compositional generali…

Python 27 8 Updated Apr 23, 2020

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Python 348 80 Updated Apr 1, 2019

pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

Python 51 13 Updated Dec 8, 2022

Python Multi-Agent Reinforcement Learning framework

Python 1,943 393 Updated Dec 8, 2022

EGG: Emergence of lanGuage in Games

Jupyter Notebook 295 106 Updated Apr 4, 2024

Code for Emergent Translation in Multi-Agent Communication

Python 80 14 Updated Jun 6, 2018

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python 530 73 Updated Nov 15, 2021

A neural machine translation model in PyTorch

Python 118 24 Updated Jul 3, 2019

Implementation of Dual Learning NMT on PyTorch

Python 163 30 Updated Mar 13, 2018

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,527 940 Updated Feb 3, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,092 2,670 Updated Feb 12, 2025

Code for "Unsupervised State Representation Learning in Atari"

Python 245 51 Updated Nov 2, 2023

Graph-based Deep Q Network for Web Navigation

Python 47 10 Updated Jul 8, 2019

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

2,612 716 Updated May 19, 2023
Next