上传者: wuyamonyx
|
上传时间:2025/10/5 9:16:40
|
文件大小:1.71MB
|
文件类型:pdf
Algorithmsforreinforcementlearning
主要责任者Szepesvári,Csaba.题名Algorithmsforreinforcementlearning[electronicresource]/CsabaSzepesvári.出版资料SanRafael,Calif.(1537FourthStreet,SanRafael,CA94901USA):Morgan&Claypool,c2010.摘要附注Reinforcementlearningisalearningparadigmconcernedwithlearningtocontrolasystemsoastomaximizeanumericalperformancemeasurethatexpressesalong-termobjective.Whatdistinguishesreinforcementlearningfromsupervisedlearningisthatonlypartialfeedbackisgiventothelearneraboutthelearner'spredictions.Further,thepredictionsmayhavelongtermeffectsthroughinfluencingthefuturestateofthecontrolledsystem.Thus,timeplaysaspecialrole.Thegoalinreinforcementlearningistodevelopefficientlearningalgorithms,aswellastounderstandthealgorithms'meritsandlimitations.Reinforcementlearningisofgreatinterestbecauseofthelargenumberofpracticalapplicationsthatitcanbeusedtoaddress,rangingfromproblemsinartificialintelligencetooperationsresearchorcontrolengineering.Inthisbook,wefocusonthosealgorithmsofreinforcementlearningthatbuildonthepowerfultheoryofdynamicprogramming.Wegiveafairlycomprehensivecatalogoflearningproblems,describethecoreideas,notealargenumberofstateoftheartalgorithms,followedbythediscussionoftheirtheoreticalpropertiesandlimitations.
本软件ID:10030934