Sfoglia per Relatore
Safe policy iteration : a monotonically improving approximate policy iteration approach
2011/2012 PECORINO, ALESSIO
Safe policy optimization
2020/2021 Papini, Matteo
Sales funnel simulation and sales forecasting with Markov chains
2020/2021 Fontana, Fabio
Scalable power network control with reinforcement learning
2021/2022 Paletti, Daniele
Solving time-varying maze with deep reinforcement learning for tiny devices
2021/2022 Colella, Stefano
Stochastic multi-armed bandit with switching costs : an empirical analysis
2017/2018 SCANNAPIECO, LUCA
Stochastic variance reduced policy gradient
2017/2018 CANONACO, GIUSEPPE
Studio e analisi di algoritmi di apprendimento per rinforzo policy gradient per la risoluzione di problemi decisionali multiobiettivo
2012/2013 PARISI, SIMONE
TargExp: an Algorithm for Audience Expansion and Profit Maximization for Online Advertising
2022/2023 EL KHOURY, JANA
TargOpt: a targeting optimization algorithm for online advertising
2021/2022 Gentile, Nicole
Task-agnostic exploration via maximum state entropy policy optimization
2019/2020 Pratissoli, Lorenzo
Teaching a learner driver using reinforcement learning and planning strategies
2020/2021 VALERIANI, ANGELICA SOFIA
Time-variant distribution learning with importance sampling regularization: the Forex case study
2023/2024 Lunardi, Chiara
Time-variant variational transfer for value functions
2019/2020 Soprani, Andrea
Tourism analysis on a large scale using mobile location data
2018/2019 TULLII, FRANCESCO
Towards Automated Reinforcement Learning
2020/2021 Lombarda, Davide
Towards making importance sampling practical
2020/2021 Russo, Alessio
Towards robust machine learning applications : a framework for monitoring, retraining, and online model selection
2021/2022 Fabris, Matteo
Transfer in policy gradients via multiple importance sampling
2018/2019 SALVINI, MATTIA
Transfer learning for actor-critic methods in Lipschitz Markov decision processes
2016/2017 VACCA MANRIQUE, DANIEL FELIPE
Legenda icone accesso al fulltext
- File accessibili da tutti
- File accessibili dagli utenti autorizzati
- File accessibili da tutti o solo dagli utenti autorizzati, a partire dalla la data indicata nella scheda
- File non accessibili