Angestrom
news · Reddit r/MachineLearning

A debugger for RL reward functions that detects reward hacking during training [P]

<table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1uga687/a_debugger_for_rl_reward_functions_that_detects/"> <img alt="A debugger for RL reward functions that detects reward hacking during training [P]" src="https://preview.redd.it/r5m95bf5cn9h1.gif?width=640&amp;crop=smart&amp;s=f9e1900b5e007ea3a72c74d4089c56fdeed22f49" title="A debugger for RL reward functions that detects reward hacking during training [P]" /> </a> </td><td> <!-- SC_OFF --><div class="md"><p>While ex

Want the primary source?View original →