logo-polimi
Loading...
Manifesto
Struttura Corso di Studi
Cerca/Visualizza Manifesto
Regolamento didattico
Internazionalizzazione
Orario Personalizzato
Il tuo orario personalizzato è disabilitato
Abilita
Ricerche
Cerca Docenti
Attività docente
Cerca Insegnamenti
Cerca insegnamenti degli Ordinamenti precedenti al D.M.509
Erogati in lingua Inglese
Le informazioni sulla didattica, sulla ricerca e sui compiti istituzionali riportate in questa pagina sono certificate dall'Ateneo; ulteriori informazioni, redatte a cura del docente, sono disponibili sulla pagina web personale e nel curriculum vitae indicati nella scheda.
Informazioni sul docente
DocenteRestelli Marcello
QualificaProfessore associato a tempo pieno
Dipartimento d'afferenzaDipartimento di Elettronica, Informazione e Bioingegneria
Settore Scientifico DisciplinareING-INF/05 - Sistemi Di Elaborazione Delle Informazioni
Curriculum VitaeScarica il CV (357.26Kb - 02/12/2019)
OrcIDhttps://orcid.org/0000-0002-6322-1076

Contatti
Orario di ricevimento
DipartimentoPianoUfficioGiornoOrarioTelefonoFaxNote
DEI----MercoledìDalle 11:00
Alle 13:00
4015--Si consiglia di prendere appuntamento via email con il docente
E-mailmarcello.restelli@polimi.it
Pagina web redatta a cura del docentehttp://home.deib.polimi.it/restelli/

Fonte dati: RE.PUBLIC@POLIMI - Research Publications at Politecnico di Milano

Elenco delle pubblicazioni e dei prodotti della ricerca per l'anno 2022 (Mostra tutto | Nascondi tutto)
Tipologia Titolo Pubblicazione/Prodotto
Contributi su volumi (Capitolo o Saggio)
AI, Machine Learning e Data Mining (Mostra >>)
Articoli su riviste
Machine Learning Using Real-World and Translational Data to Improve Treatment Selection for NSCLC Patients Treated with Immunotherapy (Mostra >>)
Online joint bid/daily budget optimization of Internet advertising campaigns (Mostra >>)


Elenco delle pubblicazioni e dei prodotti della ricerca per l'anno 2021 (Mostra tutto | Nascondi tutto)
Tipologia Titolo Pubblicazione/Prodotto
Abstract in Rivista
Abstract PO-065: Artificial intelligence to improve selection for NSCLC patients treated with immunotherapy (Mostra >>)
Abstract in Atti di convegno
Advancing drought monitoring via feature extraction (Mostra >>)
The Human Nasal Cavity: Towards the Optimal Surgery with CFD and Machine Learning (Mostra >>)
Brevetti
Metodo implementato mediante computer per compilazione quantistica in tempo reale basato su intelligenza artificiale (Mostra >>)
Contributo in Atti di convegno
Conservative Online Convex Optimization (Mostra >>)
Exploiting History Data for Nonstationary Multi-armed Bandit (Mostra >>)
Inferring Functional Properties from Fluid Dynamics Features (Mostra >>)
Learning a Belief Representation for Delayed Reinforcement Learning (Mostra >>)
Learning in Non-Cooperative Configurable Markov Decision Processes (Mostra >>)
Leveraging Good Representations in Linear Contextual Bandits (Mostra >>)
Meta-Reinforcement Learning by Tracking Task Non-stationarity (Mostra >>)
Newton Optimization on Helmholtz Decomposition for Continuous Games (Mostra >>)
Policy Optimization as Online Learning with Mediator Feedback (Mostra >>)
Provably Efficient Learning of Transferable Rewards (Mostra >>)
Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning (Mostra >>)
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate (Mostra >>)
Time-Variant Variational Transfer for Value Functions (Mostra >>)
Articoli su riviste
A voltage dynamic-based state of charge estimation method for batteries storage systems (Mostra >>)
Data-driven indicators for the detection and prediction of stuck-pipe events in oil&gas drilling operations (Mostra >>)
Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems (Mostra >>)
Gaussian approximation for bias reduction in Q-learning (Mostra >>)
MushroomRL: Simplifying Reinforcement Learning Research (Mostra >>)
Policy space identification in configurable environments (Mostra >>)
Quantum compiling by deep reinforcement learning (Mostra >>)
Safe policy iteration: A monotonically improving approximate policy iteration approach (Mostra >>)


Elenco delle pubblicazioni e dei prodotti della ricerca per l'anno 2020 (Mostra tutto | Nascondi tutto)
Tipologia Titolo Pubblicazione/Prodotto
Contributo in Atti di convegno
A Data-Based Approach for the Prediction of Stuck-Pipe Events in Oil Drilling Operations (Mostra >>)
A Novel Confidence-Based Algorithm for Structured Bandits (Mostra >>)
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits (Mostra >>)
An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies (Mostra >>)
Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration (Mostra >>)
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning (Mostra >>)
Dealing with Transaction Costs in Portfolio Optimization: Online Gradient Descent with Momentum (Mostra >>)
Driving exploration by maximum distribution in gaussian process bandits (Mostra >>)
Fast direct calibration of interest rate derivatives pricing models (Mostra >>)
Foreign exchange trading: A risk-averse batch reinforcement learning approach (Mostra >>)
Gradient-Aware Model-Based Policy Search (Mostra >>)
Inverse Reinforcement Learning from a Gradient-based Learner (Mostra >>)
Model-Free Non-Stationarity Detection and Adaptation in Reinforcement Learning (Mostra >>)
Option Hedging with Risk Averse Reinforcement Learning (Mostra >>)
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction (Mostra >>)
Sequential Transfer in Reinforcement Learning with a Generative Model (Mostra >>)
Sharing Knowledge in Multi-Task Deep Reinforcement Learning (Mostra >>)
Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions (Mostra >>)
Articoli su riviste
Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving (Mostra >>)
Importance Sampling Techniques for Policy Optimization (Mostra >>)
On the use of the policy gradient and Hessian in inverse reinforcement learning (Mostra >>)
Sliding-Window Thompson Sampling for Non-Stationary Settings (Mostra >>)


Elenco delle pubblicazioni e dei prodotti della ricerca per l'anno 2019 (Mostra tutto | Nascondi tutto)
Tipologia Titolo Pubblicazione/Prodotto
Contributo in Atti di convegno
Dealing with interdependencies and uncertainty in multi-channel advertising campaigns optimization (Mostra >>)
Exploiting Action-Value Uncertainty to Drive Exploration in Reinforcement Learning (Mostra >>)
Exploration Driven by an Optimistic Bellman Equation (Mostra >>)
Feature Selection via Mutual Information: New Theoretical Insights (Mostra >>)
IDIL: Exploiting Interdependence to Optimize Multi-Channel Advertising Campaigns (Mostra >>)
Optimistic Policy Optimization via Multiple Importance Sampling (Mostra >>)
Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters (Mostra >>)
Reinforcement Learning Based Control of Coherent Transport by Adiabatic Passage of Spin Qubits (Mostra >>)
Reinforcement Learning in Configurable Continuous Environments (Mostra >>)
Transfer of Samples in Policy Search via Multiple Importance Sampling (Mostra >>)
Articoli su riviste
Coherent transport of quantum states by deep reinforcement learning (Mostra >>)


Elenco delle pubblicazioni e dei prodotti della ricerca per l'anno 2018 (Mostra tutto | Nascondi tutto)
Tipologia Titolo Pubblicazione/Prodotto
Contributo in Atti di convegno
A Combinatorial-Bandit Algorithm for the Online Joint Bid/Budget Optimization of Pay-per-Click Advertising Campaigns (Mostra >>)
An upper limb Functional Electrical Stimulation controller based on Reinforcement Learning: A feasibility case study. (Mostra >>)
Configurable Markov Decision Processes (Mostra >>)
Does Reinforcement Learning outperform PID in the control of FES-induced elbow flex-extension? (Mostra >>)
Importance Weighted Transfer of Samples in Reinforcement Learning (Mostra >>)
Improving Multi-Armed Bandit Algorithms for Pricing (Mostra >>)
Online Follower's Behaviour Identification in Leadership Games (Mostra >>)
Online Joint Bid/Budget Optimization of Pay-per-click Advertising Campaigns (Mostra >>)
Policy optimization via importance sampling (Mostra >>)
Reinforcement Learning Control of Functional Electrical Stimulation of the upper limb: a feasibility study. (Mostra >>)
Stochastic Variance-Reduced Policy Gradient (Mostra >>)
Targeting Optimization for Internet Advertising by Learning from Logged Bandit Feedback (Mostra >>)
Transfer of Value Functions via Variational Methods (Mostra >>)
When Gaussian Processes Meet Combinatorial Bandits: GCB (Mostra >>)
Articoli su riviste
Improving multi-armed bandit algorithms in online pricing settings (Mostra >>)
manifesti v. 3.4.22 / 3.4.22
Area Servizi ICT
23/05/2022