000 00954nam a2200277Ia 4500
003 OSt
005 20221205164205.0
008 211118s9999 xx 000 0 und d
020 _a978-1-60845-493-8
040 _cIITB
041 _aeng
100 _aNauman, Feliz
_921987
_eAuthor
245 0 _aAlgorithms for reinforcement learning (e-book)
260 _aSan Rafael
_bMorgan and Claypool /
_bIEEE Press /
_bSpringer
_c2010
440 _aSynthesis lectures on data management
_921986
500 _aIEEE Morgan and Claypool Computer and Information Science (CIS) collection
650 0 _91269
_aMachine learning
650 0 _912282
_aMarkov processes
650 0 _913744
_aStochastic approximation
650 0 _9705
_aMonte Carlo method
650 0 _94177
_aComputer simulation
650 0 _aNatural gradient
_923583
700 _aHerschel Melanie
_eAuthor
_921989
856 _uhttps://ieeexplore.ieee.org/document/6813120
942 _cEB
_2udc
999 _c278336
_d278336