logo-polimi
Loading...
Degree programme
Programme Structure
Show/Search Programme
Degree Programme
International context
Customized Schedule
Your customized time schedule has been disabled
Enable
Search
Search a Professor
Professor's activities
Search a Course
Search a Course (system prior D.M. n. 509)
Search Lessons taught in English
Information on didactic, research and institutional assignments on this page are certified by the University; more information, prepared by the professor, are available on the personal web page and in the curriculum vitae indicated on this webpage.
Information on professor
ProfessorRestelli Marcello
QualificationAssociate professor
Belonging DepartmentDipartimento di Elettronica, Informazione e Bioingegneria
Scientific-Disciplinary SectorING-INF/05 - Information Processing Systems
Curriculum Vitae--
OrcIDhttps://orcid.org/0000-0002-6322-1076

Contacts
Professor's office hours
DepartmentFloorOfficeDayTimetableTelephoneFaxNotes
DEI----WednesdayFrom 11:00
To 13:00
4015--Si consiglia di prendere appuntamento via email con il docente
E-mailmarcello.restelli@polimi.it
Professor's personal website--

Data source: RE.PUBLIC@POLIMI - Research Publications at Politecnico di Milano

List of publications and reserach products for the year 2019
No product yet registered in the year 2019


List of publications and reserach products for the year 2018 (Show all details | Hide all details)
Type Title of the Publicaiton/Product
Journal Articles
Improving multi-armed bandit algorithms in online pricing settings (Show >>)
Conference proceedings
Importance Weighted Transfer of Samples in Reinforcement Learning (Show >>)
Configurable Markov Decision Processes (Show >>)
Stochastic Variance-Reduced Policy Gradient (Show >>)
A Combinatorial-Bandit Algorithm for the Online Joint Bid/Budget Optimization of Pay-per-Click Advertising Campaigns (Show >>)
Does Reinforcement Learning outperform PID in the control of FES-induced elbow flex-extension? (Show >>)
Reinforcement Learning Control of Functional Electrical Stimulation of the upper limb: a feasibility study. (Show >>)
An upper limb Functional Electrical Stimulation controller based on Reinforcement Learning: A feasibility case study. (Show >>)
Targeting Optimization for Internet Advertising by Learning from Logged Bandit Feedback (Show >>)


List of publications and reserach products for the year 2017 (Show all details | Hide all details)
Type Title of the Publicaiton/Product
Conference proceedings
Adaptive Batch Size for Safe Policy Gradients (Show >>)
Estimating the maximum expected value in continuous reinforcement learning problems (Show >>)
Compatible Reward Inverse Reinforcement Learning (Show >>)
Boosted Fitted Q-Iteration (Show >>)
Exploiting structure and uncertainty of Bellman updates in Markov decision processes (Show >>)
Gradient-based minimization for multi-expert Inverse Reinforcement Learning (Show >>)
Designing Learning Algorithms over the Sequence Form of an Extensive-Form Game (Show >>)
User context estimation for public travel assistance and intelligent service scheduling (Show >>)


List of publications and reserach products for the year 2016 (Show all details | Hide all details)
Type Title of the Publicaiton/Product
Journal Articles
Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics (Show >>)
Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation (Show >>)
Policy Search for the Optimal Control of Markov Decision Processes: A Novel Particle-Based Iterative Scheme (Show >>)
Conference proceedings
Sequence-Form and Evolutionary Dynamics: Realization Equivalence to Agent Form and Logit Dynamics (Show >>)
Inverse Reinforcement Learning through Policy Gradient Minimization (Show >>)
Estimating Maximum Expected Value through Gaussian Approximation (Show >>)


List of publications and reserach products for the year 2015 (Show all details | Hide all details)
Type Title of the Publicaiton/Product
Journal Articles
Policy gradient in Lipschitz Markov Decision Processes (Show >>)
Sparse multi-task reinforcement learning (Show >>)
Conference proceedings
Estimating a mean-path from a set of 2-D curves (Show >>)
Following Newton direction in Policy Gradient with parameter exploration (Show >>)
Multi-objective reinforcement learning with continuous pareto frontier approximation (Show >>)
manifesti v. 3.1.3 / 3.1.3
Area Servizi ICT
17/10/2019