Faulty Reward Functions in the Wild

This is a Perma.cc record

Captured April 23, 2022 9:51 am

View Mode: Standard Screenshot

View the live page

What is Perma.cc?

Source page URL

Title

Faulty Reward Functions in the Wild

Description

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we'll explore one failure mode, which is where you misspecify your reward function.

Flag as inappropriate