Show record details
This is a
Perma.cc
record
Captured
April 23, 2022 9:51 am
April 23, 2022 9:51 a.m.
View
Mode
:
Standard
Screenshot
View the live page
What is Perma.cc?
Source page URL
Title
Faulty Reward Functions in the Wild
Description
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we'll explore one failure mode, which is where you misspecify your reward function.
Flag
as inappropriate