Projekter pr. år
Abstract
This paper proposes an adaptive controller based on Reinforcement Learning (RL), which copes with HVAC-systems consisting of slow thermodynamics. Two different RL algorithms with Q-Networks (QNs) are investigated. The HVAC-system is in this study an underfloor heating system. Underfloor heating is of great interest because it is very common in Scandinavia, but this research can be applied to a wide range of HVAC-systems, industrial processes and other control applications that are dominated by very slow dynamics. The environments consist of one, two, and four zones within a house in a simulation environment meaning that agents will be exposed to gradually more complex environments separated into test levels. The novelty of this paper is the incorporation of two different RL algorithms for industrial process control; a QN and a QN + Eligibility Trace (QN+ET). The reason for using eligibility trace is that an underfloor heating environment is dominated by slow dynamics and by using eligibility trace the agent can find correlations between the reward and actions taken in earlier iterations.
Originalsprog | Engelsk |
---|---|
Titel | 29th International Conference on Flexible Automation and Intelligent Manufacturing : FAIM 2019 |
Antal sider | 8 |
Vol/bind | 38 |
Forlag | Elsevier |
Publikationsdato | 2019 |
Sider | 1308-1315 |
DOI | |
Status | Udgivet - 2019 |
Begivenhed | 29th International Conference on Flexible Automation and Intelligent Manufacturing (FAIM2019) - Limerick, Irland Varighed: 24 jun. 2019 → 28 jun. 2019 https://faim2019.org |
Konference
Konference | 29th International Conference on Flexible Automation and Intelligent Manufacturing (FAIM2019) |
---|---|
Land/Område | Irland |
By | Limerick |
Periode | 24/06/2019 → 28/06/2019 |
Internetadresse |
Navn | Procedia Manufacturing |
---|---|
ISSN | 2351-9789 |
Emneord
- Reinforcement Learning
- Artificial intelligence
- Sustainable Manufacturing Engineering
- HVAC Systems
Fingeraftryk
Dyk ned i forskningsemnerne om 'Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning'. Sammen danner de et unikt fingeraftryk.Projekter
- 1 Afsluttet
-
Reinforcement Learning Baseret Styring til Gulvvarme Systemer
Bøgh, S. (PI (principal investigator)) & Blad, C. (Andet)
01/01/2019 → 31/12/2021
Projekter: Projekt › Forskning