Proizvod vam ne odgovara? Nema veze! Možete nam vratiti unutar 30 dana
S poklon bonom ne možete pogriješiti. Za poklon bon primatelj može odabrati bilo što iz naše ponude.
30 dana za povrat kupljenih proizvoda
Presents sequential decision theory from a novel algorithmic information theory perspective. This book introduces the two different ideas and removes the limitations by unifying them to one parameter-free theory of an optimal reinforcement learning agent embedded in an unknown environment.