- London, UK
- www.jacquesthibodeau.com
- @JacquesThibs
-
Future prosaic AIs will likely shape their own development or that of successor AIs. We're trying to make sure they don't go insane.
-
-
-
uk-bio-bank-chat-app Public
This project creates a SQL database for the UK bio bank and then allows users to query the database via an LLM.
Python MIT License UpdatedOct 19, 2024 -
-
-
-
Exploring NLP weak supervision approaches to train text classification models. The project is also a prototype for a semi-automated text data labelling platform. Approaches: Snorkel and Zero-Shot L…
-
anti-misinfo-helper Public
An AI-aided tool to add to Community Notes to improve efficiency and help people write notes.
-
RLHI Public
Forked from daveshap/RLHIReinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition
Python MIT License UpdatedApr 27, 2023 -
elk Public
Forked from EleutherAI/elkKeeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)
Python MIT License UpdatedApr 16, 2023 -
-
-
rome-experiments Public
Forked from kmeng01/romeLocating and editing factual associations in pre-trained transformers
Jupyter Notebook MIT License UpdatedDec 29, 2022 -
trl-textworld Public
Forked from MichaelEinhorn/trl-textworldPython Apache License 2.0 UpdatedDec 8, 2022 -
white-box-rome Public
Forked from AlignmentResearch/tuned-lensUsing tuned lens to better understand the properties being projected at a specific layer-token.
Jupyter Notebook MIT License UpdatedNov 22, 2022 -
gpt-experiments Public
This repository contains various experiments and prototypes to get use to working with GPT-like models and being creative with them.
-
This is the repository for an 8-week research project that was worked on while attending SERI MATS.
-
-
aligning-language-models Public template
This repository contains experiments on aligning language models.
Jupyter Notebook MIT License UpdatedJul 16, 2022 -
mesh-transformer-jax Public
Forked from kingoflolz/mesh-transformer-jaxModel parallel transformers in JAX and Haiku
Python Apache License 2.0 UpdatedJun 23, 2022 -
A collection of the software engineering best practices for data scientists and ML engineers.
Python UpdatedJun 18, 2022 -
alignment-research-dataset Public
Forked from moirage/alignment-research-datasetA dataset of alignment research and code to reproduce it
-
A tool to get all the latest AI alignment paers from arxiv.
Jupyter Notebook MIT License UpdatedJun 2, 2022 -
ai-safety-scrape Public
Scraping different AI Safety resources.
-
mlab Public
Forked from redwoodresearch/mlabMachine Learning for Alignment Bootcamp
Jupyter Notebook UpdatedApr 27, 2022 -
ai-safety-prize-challenge Public
A webapp for finding "bad" outputs of LLMs.
-
-
transformers-from-scratch Public
This is repository for learning about building transformer models from scratch.
Jupyter Notebook MIT License UpdatedJan 18, 2022 -
This repository focuses on training semantic segmentation models to predict the presence of floodwater for disaster prevention. Models were trained using SageMaker and Colab.