by Zlatan Ajanović, Bakir Lačević, and Jens Kober
Reference:
Zlatan Ajanović, Bakir Lačević, and Jens Kober. Value Function Learning via Prolonged Backward Heuristic Search. In ICAPS 2023 Wokshop: PRL Workshop – Bridging the Gap Between AI Planning and Reinforcement Learning, 2023.
Bibtex Entry:
@InProceedings{Ajanovic2023ICAPS_WS,
author = {Ajanovi\'{c}, Zlatan and La\v{c}evi\'{c}, Bakir and Kober, Jens},
booktitle = {ICAPS 2023 Wokshop: PRL Workshop – Bridging the Gap Between AI Planning and Reinforcement Learning},
title = {Value Function Learning via Prolonged Backward Heuristic Search},
year = {2023},
project = {TERI},
}