Sfoglia per Relatore
Mostrati risultati da 1 a 10 di 10
Best model selection via stochastic rising bandits
2022/2023 MONTENEGRO, ALESSANDRO
Exploiting sub-optimal expert behaviors in inverse reinforcement learning
2022/2023 Curti, Gabriele
On the sample complexity of inverse reinforcement learning
2022/2023 Lazzati, Filippo
Online learning for PID controller tuning
2023/2024 Abbattista, Valentina
Sample complexity of inverse reinforcement learning in linear quadratic regulator
2023/2024 PESCE, LEONARDO
Smoothed OMD: an Algorithm for No-regret Learning in Adversarial MDPs with Revealed Transitions
2023/2024 Corso, Federico
A theory-driven approach to Large Language Models alignment with human feedback
2023/2024 SIMEONE, MICHELE
Towards closing the gap in Restless Rising Bandits
2024/2025 MIGALI, CRISTIANO
Towards fully-adaptive regret minimization in heavy-tailed bandits
2022/2023 Marsigli, Lupo
Trajectory reuse in policy gradients
2024/2025 MANSUTTI, FEDERICO
Mostrati risultati da 1 a 10 di 10
Legenda icone accesso al fulltext
- File accessibili da tutti
- File accessibili dagli utenti autorizzati
- File accessibili da tutti o solo dagli utenti autorizzati, a partire dalla la data indicata nella scheda
- File non accessibili