Sfoglia per Relatore
Mostrati risultati da 1 a 8 di 8
Best model selection via stochastic rising bandits
2022/2023 MONTENEGRO, ALESSANDRO
Exploiting sub-optimal expert behaviors in inverse reinforcement learning
2022/2023 Curti, Gabriele
On the sample complexity of inverse reinforcement learning
2022/2023 Lazzati, Filippo
Online learning for PID controller tuning
2023/2024 Abbattista, Valentina
Sample complexity of inverse reinforcement learning in linear quadratic regulator
2023/2024 PESCE, LEONARDO
Smoothed OMD: an Algorithm for No-regret Learning in Adversarial MDPs with Revealed Transitions
2023/2024 Corso, Federico
A theory-driven approach to Large Language Models alignment with human feedback
2023/2024 SIMEONE, MICHELE
Towards fully-adaptive regret minimization in heavy-tailed bandits
2022/2023 Marsigli, Lupo
Mostrati risultati da 1 a 8 di 8
Legenda icone accesso al fulltext
- File accessibili da tutti
- File accessibili dagli utenti autorizzati
- File accessibili da tutti o solo dagli utenti autorizzati, a partire dalla la data indicata nella scheda
- File non accessibili