Authors | Olu-Ajayi, R., Alaka, H., Sulaimon, I., Balogun, H., Wusu, G., Yusuf, W. and Adegoke, M. |
---|
Abstract | The advancement of smart meters using evolving technologies such as the Internet of Things (IoT) is producing more data for the training of energy prediction models. Since many machine learning techniques were not premeditated to handle a large number of irrelevant features, it has engendered the search for optimal techniques to decrease the generated features and potentially identify the most relevant features that have an impact on building energy efficiency. Feature selection is considered one of the most suitable methods of pinpointing the best features combination. However, the fraction of studies that deliver comprehensive insights on the incorporation of feature selection with machine learning is still limited, notwithstanding the capabilities of feature selection to produce a good result in terms of accuracy and speed. To address this gap, this study investigates feature selection methods centred on building energy consumption prediction using machine learning. This study conducted a comparative analysis of 14 machine learning algorithms on 5 different data sizes and explored the effect of 7 feature selection methods on model performance for predicting energy consumption in buildings. Furthermore, this study identifies the most effective feature selection methods and machine learning models for energy use prediction. The experimental results demonstrate that feature selection can affect model’s performance positively or negatively, depending on the algorithm employed. Nevertheless, the filter method was noted as the most appropriate method for most Machine Learning (ML) classification algorithms. Moreover, Gradient Boosting (GB) was identified as the most effective model for predicting energy performance in buildings. Additionally, the reliability analysis confirms the hypothesis that “the larger the data, the more accurate the result” but only for specific algorithms such as Deep Neural Networks (DNN). This study also presents the theoretical and practical implications of this research. |
---|