Proceedings of the IEEE Conference on Decision and Control
IEEE
667 - 672
Yes
1424401712
45th IEEE Conference on Decision and Control
13-15 December 2006
San Diego
USA
[en] This paper addresses the problem of computing optimal structured treatment interruption strategies for HIV infected patients. We show that reinforcement learning may be useful to extract such strategies directly from clinical data, without the need of an accurate mathematical model of HIV infection dynamics. To support our claims, we report simulation results obtained by running a recently proposed batch-mode reinforcement learning algorithm, known as fitted Q iteration, on numerically generated data.