news · Reddit r/MachineLearning

A debugger for RL reward functions that detects reward hacking during training [P]

<table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1uga687/a_debugger_for_rl_reward_functions_that_detects/"> <img alt="A debugger for RL reward functions that detects reward hacking during training [P]" src="https://preview.redd.it/r5m95bf5cn9h1.gif?width=640&crop=smart&s=f9e1900b5e007ea3a72c74d4089c56fdeed22f49" title="A debugger for RL reward functions that detects reward hacking during training [P]" /> </a> </td><td> <div class="md"><p>While ex

Want the primary source?View original →

Research Reddit r/MachineLearning