学科分类
/ 1
1 个结果
  • 简介:Thispaperpresentsamodel-basedapproximateλ-policyiterationapproachusingtemporaldifferencesforoptimizingpathsonlineforapursuit-evasionproblem,whereanagentmustvisitseveraltargetpositionswithinaregionofinterestwhilesimultaneouslyavoidingoneormoreactivelypursuingadversaries.Thismethodisrelevanttoapplications,suchasroboticpathplanning,mobile-sensorapplications,andpathexposure.Themethodologydescribedutilizescelldecompositiontoconstructadecisiontreeand...

  • 标签: 近似动态编程 加强学习 路径计划 追求避免比赛