Skip to content
@ai-fail-safe

ai-fail-safe

Popular repositories

  1. safe-reward safe-reward Public

    a prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation

    Python 8

  2. life-span life-span Public

    a project to ensure an artificial agent will eventually reach the end of its existence

    1

  3. gene-drive gene-drive Public

    a project to ensure that all child processes created by an agent "inherit" the agent's safety controls

    1

  4. mulligan mulligan Public

    a library designed to shut down an agent exhibiting unexpected behavior providing a potential "mulligan" to human civilization; IN CASE OF FAILURE, DO NOT JUST REMOVE THIS CONSTRAINT AND START IT B…

    1

  5. honeypot honeypot Public

    a project to detect environment tampering on the part of an agent

    1

Repositories

Showing 5 of 5 repositories
  • safe-reward Public

    a prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation

    Python 8 MIT 0 1 0 Updated Nov 8, 2022
  • honeypot Public

    a project to detect environment tampering on the part of an agent

    1 MIT 0 0 0 Updated Oct 31, 2022
  • mulligan Public

    a library designed to shut down an agent exhibiting unexpected behavior providing a potential "mulligan" to human civilization; IN CASE OF FAILURE, DO NOT JUST REMOVE THIS CONSTRAINT AND START IT BACK UP AGAIN

    1 MIT 0 0 0 Updated Oct 30, 2022
  • gene-drive Public

    a project to ensure that all child processes created by an agent "inherit" the agent's safety controls

    1 MIT 0 0 0 Updated Oct 29, 2022
  • life-span Public

    a project to ensure an artificial agent will eventually reach the end of its existence

    1 MIT 0 0 0 Updated Oct 29, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…