xxxx18一60岁hd中国/日韩女同互慰一区二区/西西人体扒开双腿无遮挡/日韩欧美黄色一级片 - 色护士精品影院www

資源簡介

Q強化學(xué)習(xí)matlab源代碼,注釋詳細(xì),本人親自運行測試。

資源截圖

代碼片段和文件信息


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%?Q?learning?of?single?agent?move?in?N?rooms?
%?Matlab?Code?companion?of?
%?Q?Learning?by?Example
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%?
function?q=ReinforcementLearning
clc;
format?short
format?compact

????%?Two?input:?R?and?gamma
????%?immediate?reward?matrix;?
????%?row?and?column?=?states;?-Inf?=?no?door?between?room
????R=[-inf-inf-inf-inf???0?-inf;
???????-inf-inf-inf???0-inf?100;
???????-inf-inf-inf???0-inf?-inf;
???????-inf???0???0-inf???0?-inf;
??????????0-inf-inf???0-inf?100;
???????-inf???0-inf-inf???0?100];

????gamma=0.80;????????????%?learning?parameter

????q=zeros(size(R));??????%?initialize?Q?as?zeroq的行數(shù)和列數(shù)等于矩陣R的。
????q1=ones(size(R))*inf;??%?initialize?previo

評論

共有 條評論