Carnegie Mellon Machine Learning Lunch seminar

Carnegie Mellon Machine Learning Lunch seminar

瀏覽:338
日期:2025-10-06
Abstract One of the fundamental challenges in reinforcement learning (RL) is to guarantee that a newly proposed policy that has not yet been deployed will be an improvement upon the current policy---that the RL algorithm is "safe". Such an algorithm would...看更多