Starred repositories
Democratizing Reinforcement Learning for LLMs
A Survey on Efficient Reasoning for LLMs
verl: Volcano Engine Reinforcement Learning for LLMs
Code accompanying the paper "Massive Activations in Large Language Models"
🌎💪 BrowserGym, a Gym environment for web task automation
Building a comprehensive and handy list of papers for GUI agents
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Code for paper Empowering Large Language Model Agents through Action Learning
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
Statsmodels: statistical modeling and econometrics in Python
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Lean 3 material for Kevin Buzzard's 2021 TCC courrse on formalising mathematics. Lean 4 version available here: https://github.com/ImperialCollegeLondon/formalising-mathematics-2024
Companion webpage for the book "Bayesian Optimization" by Roman Garnett
Universal and Transferable Attacks on Aligned Language Models
The official Python library for the OpenAI API
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
A curated list of foundation models for vision and language tasks
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A new adversarial purification method that uses the forward and reverse processes of diffusion models to remove adversarial perturbations.
Google Research