Skip to content
@AlignmentResearch

FAR.AI

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

Popular repositories Loading

  1. tuned-lens tuned-lens Public

    Tools for understanding how transformer predictions are built layer-by-layer

    Python 448 48

  2. go_attack go_attack Public

    Python 84 7

  3. vlmrm vlmrm Public

    Python 46 12

  4. gpt-4-novel-apis-attacks gpt-4-novel-apis-attacks Public

    17 1

  5. learned-planner learned-planner Public

    Interpretability tools for recurrent networks that play Sokoban

    Python 10 2

  6. KataGo-custom KataGo-custom Public

    Child repository of https://github.com/HumanCompatibleAI/go_attack.

    C++ 4 1

Repositories

Showing 10 of 30 repositories
  • train-learned-planner Public

    Experimenting with CleanRL for learned-planners

    AlignmentResearch/train-learned-planner’s past year of commit activity
    Python 4 0 0 0 Updated Dec 3, 2024
  • gym-sokoban Public

    Sokoban environment for Gym

    AlignmentResearch/gym-sokoban’s past year of commit activity
    Python 0 MIT 0 0 1 Updated Dec 3, 2024
  • AlignmentResearch/KataGo-custom’s past year of commit activity
    C++ 4 1 6 1 Updated Nov 27, 2024
  • kubespray Public Forked from kubernetes-sigs/kubespray

    Deploy a Production Ready Kubernetes Cluster

    AlignmentResearch/kubespray’s past year of commit activity
    Jinja 0 Apache-2.0 6,633 0 0 Updated Nov 22, 2024
  • AlignmentResearch/scaling-poisoning’s past year of commit activity
    Python 4 0 0 2 Updated Nov 18, 2024
  • learned-planner Public

    Interpretability tools for recurrent networks that play Sokoban

    AlignmentResearch/learned-planner’s past year of commit activity
    Python 10 Apache-2.0 2 0 0 Updated Oct 19, 2024
  • lp_sae Public Forked from jbloomAus/SAELens

    Training Sparse Autoencoders on DRC networks

    AlignmentResearch/lp_sae’s past year of commit activity
    HTML 0 MIT 132 0 0 Updated Oct 17, 2024
  • learned-planners-stable-baselines3 Public Forked from AlignmentResearch/stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    AlignmentResearch/learned-planners-stable-baselines3’s past year of commit activity
    Python 2 MIT 1,798 0 0 Updated Oct 17, 2024
  • envpool Public Forked from sail-sg/envpool

    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

    AlignmentResearch/envpool’s past year of commit activity
    C++ 0 Apache-2.0 109 0 0 Updated Oct 17, 2024
  • azure-storage-fuse Public Forked from Azure/azure-storage-fuse

    A virtual file system adapter for Azure Blob storage

    AlignmentResearch/azure-storage-fuse’s past year of commit activity
    Go 0 218 0 0 Updated Oct 9, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…