Dalarna University's logo and link to the university's website

du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Policy-based Reinforcement learning control for window opening and closing in an office building
Dalarna University, School of Technology and Business Studies, Microdata Analysis.
Dalarna University, School of Technology and Business Studies, Microdata Analysis.
2020 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The level of indoor comfort can highly be influenced by window opening and closing behavior of the occupant in an office building. It will not only affect the comfort level but also affects the energy consumption, if not properly managed. This occupant behavior is not easy to predict and control in conventional way. Nowadays, to call a system smart it must learn user behavior, as it gives valuable information to the controlling system. To make an efficient way of controlling a window, we propose RL (Reinforcement Learning) in our thesis which should be able to learn user behavior and maintain optimal indoor climate. This model free nature of RL gives the flexibility in developing an intelligent control system in a simpler way, compared to that of the conventional techniques. Data in our thesis is taken from an office building in Beijing. There has been implementation of Value-based Reinforcement learning before for controlling the window, but here in this thesis we are applying policy-based RL (REINFORCE algorithm) and also compare our results with value-based (Q-learning) and there by getting a better idea, which suits better for the task that we have in our hand and also to explore how they behave. Based on our work it is found that policy based RL provides a great trade-off in maintaining optimal indoor temperature and learning occupant’s behavior, which is important for a system to be called smart.

Place, publisher, year, edition, pages
2020.
Keywords [en]
Markov decision processes, Policy-based Reinforcement learning, Value-based Reinforcement learning, Q-learning, REINFORCE, policy gradient, window control, indoor comfort level
National Category
Social Sciences
Identifiers
URN: urn:nbn:se:du-34420OAI: oai:DiVA.org:du-34420DiVA, id: diva2:1451039
Available from: 2020-07-02 Created: 2020-07-02

Open Access in DiVA

fulltext(877 kB)438 downloads
File information
File name FULLTEXT01.pdfFile size 877 kBChecksum SHA-512
655052f89f1a9c8c0532226515411fb069fd25e6b7aa5394def907bebc453294c585322ec36f35cbe4d26c3b3698b70bdbd20e9fa44d1236c1ba20d3088d9270
Type fulltextMimetype application/pdf

By organisation
Microdata Analysis
Social Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 438 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 825 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf