Simulink的强化学习代理开始无输出结果的解决方法

Simulink进行强化学习代理训练时,出现如下错误:

1
2
3
Derivative of state '2' in block 'physicalModel/Nonlinear Second-Order Actuator/Integrator, Second-Order
Limited' at time 0.0 is not finite. The simulation will be stopped. There may be a singularity in the solution. If not, try reducing the step size
(either by reducing the fixed step size or by tightening the error tolerances)

经过分析,导致该错误的原因是产生强化学习代理初始输入信号的Memory模块的初始值设为inf导致的。解决方法是修改Memory模块的初始值。

参考链接

  1. Simulink扩展卡尔曼滤波输出无结果的解决,by 渣渣丰.