Dalarna University's logo and link to the university's website

du.sePublications
Change search
Link to record
Permanent link

Direct link
Song, William Wei, ProfessorORCID iD iconorcid.org/0000-0003-3681-8173
Publications (10 of 56) Show all publications
Shu, S., Wang, M. & Song, W. W. (2025). An Approximation Computation Approach to the Enhancement of Big Data Analysis Methods for Time-Series Anomaly Detection of PV Systems. In: : . Paper presented at 10th International Conference on Cloud Computing and Big Data Analytics, Chengdu, China, 24-26 April 2025 (pp. 143-150). IEEE
Open this publication in new window or tab >>An Approximation Computation Approach to the Enhancement of Big Data Analysis Methods for Time-Series Anomaly Detection of PV Systems
2025 (English)Conference paper, Published paper (Refereed)
Abstract [en]

This paper follows the principles of the Big Data Analysis (BDA) framework to construct a set of experiments using a photovoltaic (PV) system dataset, providing an empirical evaluation of these principles. We propose a novel two-dimensional approach to improving time-series clustering and anomaly detection. Firstly, we enhance the DBSCAN clustering algorithm with LSTM-based residual forecasting (LSTM+DBSCAN) to capture temporal dependencies. Secondly, we incorporate incremental data ingestion to assess the scalability and adaptability of the framework. Our methodology builds upon the Approximation Computation Approach (ACA), which emphasizes a Pattern-Method-Assertion (PMA) framework for data analysis. By leveraging LSTM to extract meaningful temporal patterns and DBSCAN for clustering, we validate and extend the principles of ACA to the context of time-series anomaly detection. Our results demonstrate that the framework could provide guidelines for experiments in both the design of the methods and the datasets. © 2025 Elsevier B.V., All rights reserved.

Place, publisher, year, edition, pages
IEEE, 2025
Keywords
Anomaly Detection, Approximation Computation Approach, DBSCAN, Incremental Data Analysis, LSTM, Big data, Clustering algorithms, Data mining, Data reduction, Pattern recognition, Time series, Time series analysis, Analysis frameworks, Data analysis-methods, Incremental data, Incremental data analyze, Photovoltaic systems, Times series
National Category
Computer Sciences
Identifiers
urn:nbn:se:du-51440 (URN)10.1109/ICCCBDA64898.2025.11030498 (DOI)001600475300022 ()2-s2.0-105009411044 (Scopus ID)9798331530808 (ISBN)
Conference
10th International Conference on Cloud Computing and Big Data Analytics, Chengdu, China, 24-26 April 2025
Available from: 2025-10-13 Created: 2025-10-13 Last updated: 2026-01-13Bibliographically approved
Xu, Y., Liu, Z., Zha, Z., Wang, X. & Song, W. W. (2025). Short time step prediction of moisture content of construction materials based on VAR modeling. In: 2025 IEEE 17th International Conference on Computer Research and Development, ICCRD 2025: . Paper presented at 17th IEEE International Conference on Computer Research and Development, ICCRD 2025, Shangrao, China, 17-19 January 2025 (pp. 297-303). Institute of Electrical and Electronics Engineers Inc.
Open this publication in new window or tab >>Short time step prediction of moisture content of construction materials based on VAR modeling
Show others...
2025 (English)In: 2025 IEEE 17th International Conference on Computer Research and Development, ICCRD 2025, Institute of Electrical and Electronics Engineers Inc. , 2025, p. 297-303Conference paper, Published paper (Refereed)
Abstract [en]

The purpose of this paper is to explore the feasibility and effectiveness of utilizing Vector Autoregression (VAR) model to predict the moisture content of construction materials. The moisture content of construction materials has an important impact on their performance, durability and safety. Reasonable control of moisture content can prolong the service life of materials, improve the quality and efficiency of construction, reduce energy consumption, lower economic costs, and improve environmental adaptability. However, traditional water content prediction methods mostly rely on empirical data and simple statistical models, which are difficult to accurately capture the complex relationship between multiple variables. For this reason, this paper proposes a prediction method based on the VAR model in order to overcome the limitations of the traditional methods. Experimental results show that the VAR model exhibits high accuracy in predicting the water content of construction materials, and its MAE, RMSE, and MAPE are better than those of the commonly used deep learning models such as CNN and LSTM. This may be attributed to the lower computational complexity of the VAR model and its effective capture of stable relationships. © 2025 IEEE.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2025
Keywords
construction materials, Moisture content, prediction, time series, %moisture, Content of constructions, Economic costs, Material-based, Performance durability, Prediction methods, Reduce energy consumption, Time step, Times series, Vector autoregression models, Water content
National Category
Computer and Information Sciences Building Technologies
Identifiers
urn:nbn:se:du-50659 (URN)10.1109/ICCRD64588.2025.10962833 (DOI)2-s2.0-105004724130 (Scopus ID)9798331531881 (ISBN)
Conference
17th IEEE International Conference on Computer Research and Development, ICCRD 2025, Shangrao, China, 17-19 January 2025
Available from: 2025-05-27 Created: 2025-05-27 Last updated: 2025-10-09Bibliographically approved
Xu, Y., Ye, Q. & Song, W. W. (2024). A Random Forest Based Prediction Method for Moisture Content in Wood Materials. In: 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA: . Paper presented at 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024 (pp. 89-94). IEEE conference proceedings
Open this publication in new window or tab >>A Random Forest Based Prediction Method for Moisture Content in Wood Materials
2024 (English)In: 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA, IEEE conference proceedings, 2024, p. 89-94Conference paper, Published paper (Refereed)
Abstract [en]

Missing values is a crucial problem in the area of big data analysis, which hinders data integrity. Various regression methods have been employed for estimating missing values, but they exhibit significant prediction errors. To ensure the integrity of data collected from a wood sensor monitoring system and address the issue of data loss and anomaly, we propose a missing value estimation method based on the random forest regression model. This study focuses on the environmental data, including temperature, relative humidity, and absolute humidity surrounding the wood subjects. We simulate a number of methods on the data for comparison purpose. The experiment results in terms of prediction performance indicate that the random forest regression model algorithm we developed for estimating moisture content's missing values yields favourable outcomes with consistently low estimation errors. © 2024 IEEE.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Keywords
data analysis, missing value estimation, moisture content, random forest, Data handling, Forecasting, Forestry, Information analysis, Moisture determination, Random errors, Regression analysis, Data integrity, Missing values, Prediction errors, Prediction methods, Random forests, Regression method, Regression modelling, Sensor monitoring, Wood materials, Moisture
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:du-49382 (URN)10.1109/ICCCBDA61447.2024.10569555 (DOI)2-s2.0-85198460881 (Scopus ID)9798350373554 (ISBN)
Conference
2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024
Available from: 2024-09-20 Created: 2024-09-20 Last updated: 2025-10-09Bibliographically approved
Nie, N., Guo, H. & Song, W. W. (2024). Authenticity Classification of WeChat Group Chat Messages Based on LDA and NLP. In: 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA: . Paper presented at 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024 (pp. 313-319). Institute of Electrical and Electronics Engineers Inc.
Open this publication in new window or tab >>Authenticity Classification of WeChat Group Chat Messages Based on LDA and NLP
2024 (English)In: 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA, Institute of Electrical and Electronics Engineers Inc. , 2024, p. 313-319Conference paper, Published paper (Refereed)
Abstract [en]

This study conducts an in-depth verification and analysis of the authenticity of information in WeChat group chats by integrating Latent Dirichlet Allocation (LDA) topic modeling with advanced Natural Language Processing (NLP) techniques such as XLNet and BERT. Leveraging LDA, the thematic structure of group chat content is revealed, and through the integration of NLP technologies like XLNet and BERT, a comprehensive analysis of the information is achieved. Experimental results demonstrate that our developed model performs exceptionally well in identifying the authenticity of information, confirming the effectiveness of this method in the domain of social media information verification. This research not only deepens our understanding of the authenticity of information in WeChat group chats but also provides a more effective tool for social media platforms to detect and prevent the spread of false information. It opens up a new perspective on social media information authentication research and points out future research directions. © 2024 IEEE.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2024
Keywords
information veracity, LDA (latent dirichlet allocation), NLP (natural language processing), WeChat group chats, Classification (of information), Modeling languages, Natural language processing systems, Social networking (online), Statistics, User profile, Language processing, Latent Dirichlet allocation, Natural language processing, Natural languages, Social media informations, Topic Modeling, Verification and analysis, Wechat group chat, Authentication
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:du-49383 (URN)10.1109/ICCCBDA61447.2024.10569975 (DOI)2-s2.0-85198443561 (Scopus ID)9798350373554 (ISBN)
Conference
2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024
Available from: 2024-09-20 Created: 2024-09-20 Last updated: 2025-10-09Bibliographically approved
Zhu, Y., Wang, X., Zha, Z. & Song, W. W. (2024). Using Autoregressive Polynomial Regression Models to Study Moisture Content Dynamics in Wood. In: 2024 9th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA): . Paper presented at 2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024 (pp. 21-27). IEEE conference proceedings
Open this publication in new window or tab >>Using Autoregressive Polynomial Regression Models to Study Moisture Content Dynamics in Wood
2024 (English)In: 2024 9th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), IEEE conference proceedings, 2024, p. 21-27Conference paper, Published paper (Refereed)
Abstract [en]

This study explores the complex relationship between wood moisture content and environmental factors, temperature and relative humidity. Utilizing a novel Autoregressive Polynomial Regression Model (APRM), data from sensors placed in reconstituted bamboo and pine planks at various positions were analyzed. The APRM, adept at handling polynomial and interaction terms, revealed a nuanced, non-linear relationship between moisture content and environmental conditions. The research findings underscore significant material-specific differences in response to environmental changes. This study not only contributes to the understanding of wood-environment interactions but also demonstrates the efficacy of APRM in environmental science, providing a foundational approach for future research in this field. © 2024 IEEE.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Keywords
autoregressive polynomial regression model, data analysis, moisture content, pine planks, reconstituted bamboo, relative humidity, temperature, Moisture, Moisture determination, Polynomials, Regression analysis, Auto-regressive, Complex relationships, Environmental factors, Modeling data, Pine plank, Polynomial regression models, Temperature and relative humidity, Wood moisture content, Bamboo
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:du-49381 (URN)10.1109/ICCCBDA61447.2024.10569597 (DOI)2-s2.0-85198477425 (Scopus ID)9798350373554 (ISBN)
Conference
2024 9th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2024, Chengdu, China, 25-27 April 2024
Available from: 2024-09-20 Created: 2024-09-20 Last updated: 2025-10-09Bibliographically approved
Han, M., Zhu, Y., Song, W. W., Zhang, X., Shen, J., Zhao, J. & Chen, W. (2024). Using Genetic Algorithm to Control Ventilation Systems Based on Demand in a Single-Family House in Sweden. In: Encyclopedia of Sustainable Technologies, Second Edition: Volumes 1-4: (pp. 504-520). Elsevier, 1-4
Open this publication in new window or tab >>Using Genetic Algorithm to Control Ventilation Systems Based on Demand in a Single-Family House in Sweden
Show others...
2024 (English)In: Encyclopedia of Sustainable Technologies, Second Edition: Volumes 1-4, Elsevier , 2024, Vol. 1-4, p. 504-520Chapter in book (Other academic)
Abstract [en]

Building ventilation system needs to be controlled in a smart way to maintain indoor air quality while reducing energy use. Although many demand-controlled methods have been developed, the design of ventilation schedule has to be customized depending on local climate, occupant behavior and system capacity. This article introduces an easy-to-use control strategy based on mathematical modeling, clustering and genetic algorithm. Experimental results improve the performance of current system in an example house and provide a data-driven framework. © 2024 Elsevier Inc. All rights are reserved.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Clustering, Data-driven method, Demand-controlled ventilation, Energy efficiency, Genetic algorithm, Indoor air quality, Mathematical modeling, Occupancy, Optimization, Smart system
National Category
Building Technologies Energy Systems
Identifiers
urn:nbn:se:du-50635 (URN)10.1016/B978-0-323-90386-8.00003-6 (DOI)2-s2.0-105000576296 (Scopus ID)9780323903868 (ISBN)9780443222870 (ISBN)
Available from: 2025-05-20 Created: 2025-05-20 Last updated: 2025-10-09Bibliographically approved
Zhu, Y., Song, W. W., Wang, X., Rybarczyk, Y., Nyberg, R. G. & Fei, B. (2023). A Novel Approach to Discovering Hygrothermal Transfer Patterns in Wooden Building Exterior Walls. Buildings, 13(9), Article ID 2151.
Open this publication in new window or tab >>A Novel Approach to Discovering Hygrothermal Transfer Patterns in Wooden Building Exterior Walls
Show others...
2023 (English)In: Buildings, E-ISSN 2075-5309, Vol. 13, no 9, article id 2151Article in journal (Refereed) Published
Abstract [en]

To maintain the life of building materials, it is critical to understand the hygrothermal transfer mechanisms (HTM) between the walls and the layers inside the walls. Due to the extreme instability of weather data, the actual data models of the HTM—the data being collected for actual buildings using modern sensor technologies—would appear to be a great difference from any theoretical models, in particular, for wood building materials. In this paper, we aim to consider a variety of data analysis tools for hygrothermal transfer features. A novel approach for peak and valley detection is proposed based on the discrete differentiation of the original data. Not to be limited to the measure of peak and valley delays for HTM, we propose a cross-correlation analysis to obtain the general delay between two daily time series, which seems to be representative of the delay in the daily time series. Furthermore, the seasonal pattern of the hygrothermal transfer combined with the correlation analysis reveals a reasonable relationship between the delays and the indoor and outdoor climates. © 2023 by the authors.

Place, publisher, year, edition, pages
MDPI, 2023
Keywords
building exterior wall, data-driven approach, hygrothermal transfer mechanisms, transfer patterns
National Category
Building Technologies
Identifiers
urn:nbn:se:du-47087 (URN)10.3390/buildings13092151 (DOI)001076493300001 ()2-s2.0-85172805131 (Scopus ID)
Available from: 2023-10-09 Created: 2023-10-09 Last updated: 2025-10-09Bibliographically approved
Xu, Y., Wu, G. & Song, W. W. (2023). A Predictive Study of ARIMA Model Based on Multi-Bayesian Estimation Method Fused Data on Building Materials Environment. In: ACM International Conference Proceeding Series: . Paper presented at 6th International Conference on Big Data Technologies, ICBDT 2023 (pp. 22-27). ACM Digital Library
Open this publication in new window or tab >>A Predictive Study of ARIMA Model Based on Multi-Bayesian Estimation Method Fused Data on Building Materials Environment
2023 (English)In: ACM International Conference Proceeding Series, ACM Digital Library, 2023, p. 22-27Conference paper, Published paper (Refereed)
Abstract [en]

Due to the uncertainty and inconsistency of measurement data from multiple sensors in the same space, a multi-sensor data fusion algorithm is used to fuse the measurement data of multiple nodes. We propose a multi-Bayesian estimation method for fusing multi-sensor data, and combine Bayesian estimation with ARIMA model to predict the ambient temperature of bamboo and wood building materials. It can utilize the redundancy of data to reduce this uncertainty and improve the reliability of subsequent predictions. © 2023 ACM.

Place, publisher, year, edition, pages
ACM Digital Library, 2023
Keywords
ARIMA, Building material environment prediction, Data fusion, Multi-Bayesian estimation
National Category
Information Systems
Identifiers
urn:nbn:se:du-47621 (URN)10.1145/3627377.3627381 (DOI)2-s2.0-85180132052 (Scopus ID)
Conference
6th International Conference on Big Data Technologies, ICBDT 2023
Available from: 2024-01-02 Created: 2024-01-02 Last updated: 2025-10-09Bibliographically approved
Zhu, Y., Song, W. W., Nyberg, R. G., Rybarczyk, Y. & Wang, X. (2023). A Review on Data-driven Methods for Studying Hygrothermal Transfer in Building Exterior Walls. In: ICBDT '23: Proceedings of the 2023 6th International Conference on Big Data Technologies: . Paper presented at 6th International Conference on Big Data Technologies, ICBDT 2023 (pp. 33-41). ACM Press
Open this publication in new window or tab >>A Review on Data-driven Methods for Studying Hygrothermal Transfer in Building Exterior Walls
Show others...
2023 (English)In: ICBDT '23: Proceedings of the 2023 6th International Conference on Big Data Technologies, ACM Press, 2023, p. 33-41Conference paper, Published paper (Refereed)
Abstract [en]

This review aims to comprehensively assess and synthesize the existing literature on the use of data-driven methods for studying hygrothermal transfer in building exterior walls. The review is conducted by an exhaustive search strategy to identify relevant articles from Web of Science and Scopus databases. There are 20 eligible studies included in this review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol. The most used data-driven methods are traditional neural networks, such as Multi-Layer Perceptrons and 2D Convolutional Neural Networks. Results suggested that neural network models hold potential for accurately predicting hygrothermal attributes of building exteriors. However, a conspicuous gap in the literature is the absence of studies drawing direct comparisons between data-driven methodologies and conventional simulation techniques. © 2023 ACM.

Place, publisher, year, edition, pages
ACM Press, 2023
Series
ACM International Conference Proceeding Series
Keywords
Hygrothermal performance, Machine learning, Statistical learning, Systematic review
National Category
Energy Systems
Identifiers
urn:nbn:se:du-47617 (URN)10.1145/3627377.3627409 (DOI)2-s2.0-85180131187 (Scopus ID)
Conference
6th International Conference on Big Data Technologies, ICBDT 2023
Available from: 2024-01-02 Created: 2024-01-02 Last updated: 2025-10-09Bibliographically approved
Song, W. W. (2023). An Approximation Computation Approach to Big Data Analysis with a Case Analysis of PV System. In: 8th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2023: . Paper presented at 8th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2023, Chengdu, 26-28 April 2023 (pp. 44-52). Institute of Electrical and Electronics Engineers Inc.
Open this publication in new window or tab >>An Approximation Computation Approach to Big Data Analysis with a Case Analysis of PV System
2023 (English)In: 8th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2023, Institute of Electrical and Electronics Engineers Inc. , 2023, p. 44-52Conference paper, Published paper (Refereed)
Abstract [en]

In the era of big data, it is indispensable to apply data science and technology for big data analysis to solve the big data problems. With the advancement of the big data technologies, we are also facing many problems when dealing with the big data and their studies. It is obvious that big data become "bigger"and "bigger", more complex than before, with a good number of attributes and features in various formats and styles. On the other hand, many data analysis techniques have been proposed for various application domain problems in different purposes. This worsens the situation of choosing an appropriate method for a right problem of right data. In this paper, the author intends to propose an approximation approach toward this problem, through discussing the ways of identification of patterns of the original data, be they of data features or analysis methods. The author attempts to apply the idea to a case of fault detection of a household photovoltaic system. © 2023 IEEE.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2023
Keywords
approximation, big data, data analysis, fault detection, methodologies, photovoltaic systems, Data handling, Information analysis, Case analysis, Data analysis techniques, Data problems, Data technologies, Faults detection, Methodology, PV system, Science and Technology
National Category
Information Systems
Identifiers
urn:nbn:se:du-46631 (URN)10.1109/ICCCBDA56900.2023.10154583 (DOI)001021400000009 ()2-s2.0-85164675555 (Scopus ID)9781665455336 (ISBN)
Conference
8th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2023, Chengdu, 26-28 April 2023
Available from: 2023-08-04 Created: 2023-08-04 Last updated: 2025-10-09Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-3681-8173

Search in DiVA

Show all publications