On-line Building Energy Optimization using Deep Reinforcement Learning

Published 18 Jul 2017 in cs.LG, cs.AI, and math.OC | (1707.05878v1)

Abstract: Unprecedented high volumes of data are becoming available with the growth of the advanced metering infrastructure. These are expected to benefit planning and operation of the future power system, and to help the customers transition from a passive to an active role. In this paper, we explore for the first time in the smart grid context the benefits of using Deep Reinforcement Learning, a hybrid type of methods that combines Reinforcement Learning with Deep Learning, to perform on-line optimization of schedules for building energy management systems. The learning procedure was explored using two methods, Deep Q-learning and Deep Policy Gradient, both of them being extended to perform multiple actions simultaneously. The proposed approach was validated on the large-scale Pecan Street Inc. database. This highly-dimensional database includes information about photovoltaic power generation, electric vehicles as well as buildings appliances. Moreover, these on-line energy scheduling strategies could be used to provide real-time feedback to consumers to encourage more efficient use of electricity.

Abstract PDF Upgrade to Chat

Authors (7)

Citations (439)

View on Semantic Scholar

Summary

The paper demonstrates that DRL methods, particularly DQN and DPG, can effectively optimize building energy consumption.
The paper highlights significant reductions in peak demand (up to 26.3%) and operational costs (up to 27.4%) across various scenarios.
The paper shows that DRL models adapt to dynamic energy systems, proving their scalability across clusters of up to 48 buildings.

Deep Reinforcement Learning for Building Energy Optimization

The paper "On-line Building Energy Optimization using Deep Reinforcement Learning" explores the application of Deep Reinforcement Learning (DRL) to optimize energy management in building systems within a smart grid context. This approach aims at managing and minimizing building energy consumption or cost through advanced scheduling of electrical devices’ activities. The authors leverage the Pecan Street Inc. database, a comprehensive high-dimensional dataset, to demonstrate the efficacy of their proposed methods. There are significant implications for how buildings interact with the power grid, potentially smoothing demand curves and enhancing the integration of renewable resources like solar and wind energy.

The core analytical focus of this paper rests on two DRL implementations: Deep Q-learning (DQN) and Deep Policy Gradient (DPG). These methods are employed to enhance learning and decision-making in complex energy systems with the ability to process vast streams of incoming data in real-time. DQN is recognized for its prowess in approximating value functions, whereas DPG offers efficiency in handling continuous action spaces, such as the simultaneous operation of multiple devices. Both approaches adapt the respective DRL algorithms to make multiple concurrent actions within the constraints of DRL to cater to the dynamic demands of building energy systems.

This study adopts a Markov Decision Process (MDP) framework for modeling the stochastic nature of energy flows and user behavior. The focus on demand-side management allows it to capture the stochastic characteristics of renewable energy sources and user interactions. This positions the research well within the current landscape of attempts to harness Artificial Intelligence methods for energy system optimization.

Main Findings

Energy and Cost Optimization: The numerical results for both peak reduction and cost minimization highlight that optimized strategies can significantly lower building energy demand peaks and overall operational costs. Specifically, the DPG method is noted to achieve superior performance over DQN, resulting in a peak demand reduction ranging from 22.9% to 26.3% across different building scenarios, and cost reductions of 20.8% to 27.4%.
Scalability Across Multiple Buildings: When applied at scale, the proposed DRL techniques manage to maintain their optimization efficacy across clusters of up to 48 buildings. This scalability is crucial for advancing these methods from theoretical constructs to practicable deployments on smart grid infrastructure.
Learning and Adaptation Capabilities: The DRL models exhibit efficient learning capabilities by adapting to different building conditions and dynamic pricing signals over multiple episodes, highlighting an advantage in learning optimal strategies for diverse energy consumption patterns over time.

Implications for AI and Future Research

The methods outlined in this paper underscore potential pathways for integrating AI in smart grids and energy management systems. DRL algorithms, with their capability to manage high-dimensional datasets in an adaptive, real-time manner, offer promising scalability and efficiency improvements over traditional optimization methods. Given the inherent complexity and variability of energy consumption patterns, advanced machine learning solutions provide a robust framework for enhancing decision-making in energy scheduling and reducing reliance on computationally expensive heuristic methods.

Future research directions might include extending the breadth of this analysis to other forms of energy consumption devices and integrating broader sets of external datasets like weather forecasts, to bolster prediction accuracy. Additionally, exploring the integration of other AI frameworks to complement DRL approaches could further expand their optimization capabilities.

Overall, this research presents a forward-thinking exploration of DRL applications in building energy optimization, showcasing both its practical merits and its alignment with the broader goals of sustainable energy management and smart grid evolution.

Markdown Report Issue