Dalarna University's logo and link to the university's website

du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep Learning Approach for Assessing Air Quality During COVID-19 Lockdown in Quito
Dalarna University, School of Information and Engineering, Microdata Analysis.ORCID iD: 0000-0003-0525-7617
Grupo de Biodiversidad Medio Ambiente y Salud, Universidad de Las Américas, Quito, Ecuador.
Dalarna University, School of Information and Engineering, Microdata Analysis.
Dalarna University, School of Information and Engineering, Microdata Analysis.ORCID iD: 0000-0002-3650-9162
2022 (English)In: Frontiers in Big Data, ISSN 2624-909X, Vol. 5, article id 842455Article in journal (Refereed) Published
Abstract [en]

Weather Normalized Models (WNMs) are modeling methods used for assessing air contaminants under a business-as-usual (BAU) assumption. Therefore, WNMs are used to assess the impact of many events on urban pollution. Recently, different approaches have been implemented to develop WNMs and quantify the lockdown effects of COVID-19 on air quality, including Machine Learning (ML). However, more advanced methods, such as Deep Learning (DL), have never been applied for developing WNMs. In this study, we proposed WNMs based on DL algorithms, aiming to test five DL architectures and compare their performances to a recent ML approach, namely Gradient Boosting Machine (GBM). The concentrations of five air pollutants (CO, NO<sub>2</sub>, PM<sub>2.5</sub>, SO<sub>2</sub>, and O<sub>3</sub>) are studied in the city of Quito, Ecuador. The results show that Long-Short Term Memory (LSTM) and Bidirectional Recurrent Neural Network (BiRNN) outperform the other algorithms and, consequently, are recommended as appropriate WNMs to quantify the effects of the lockdowns on air pollution. Furthermore, examining the variable importance in the LSTM and BiRNN models, we identify that the most relevant temporal and meteorological features for predicting air quality are Hours (time of day), Index (1 is the first collected data and increases by one after each instance), Julian Day (day of the year), Relative Humidity, Wind Speed, and Solar Radiation. During the full lockdown, the concentration of most pollutants has decreased drastically: −48.75%, for CO, −45.76%, for SO<sub>2</sub>, −42.17%, for PM<sub>2.5</sub>, and −63.98%, for NO<sub>2</sub>. The reduction of this latter gas has induced an increase of O<sub>3</sub> by +26.54%.

Place, publisher, year, edition, pages
2022. Vol. 5, article id 842455
Keywords [en]
air pollution, machine learning, deep learning - artificial neural network (DL-ANN), data-driven modeling and optimization, COVID-19
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:du-41194DOI: 10.3389/fdata.2022.842455ISI: 000790572900001PubMedID: 35445191Scopus ID: 2-s2.0-85128467880OAI: oai:DiVA.org:du-41194DiVA, id: diva2:1649677
Available from: 2022-04-04 Created: 2022-04-04 Last updated: 2024-01-29Bibliographically approved
In thesis
1. Machine Learning Approaches to Develop Weather Normalize Models for Urban Air Quality
Open this publication in new window or tab >>Machine Learning Approaches to Develop Weather Normalize Models for Urban Air Quality
2024 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

According to the World Health Organization, almost all human population (99%) lives in 117 countries with over 6000 cities, where air pollutant concentration exceeds recommended thresholds. The most common, so-called criteria, air pollutants that affect human lives, are particulate matter (PM) and gas-phase (SO2, CO, NO2, O3 and others). Therefore, many countries or regions worldwide have imposed regulations or interventions to reduce these effects. Whenever an intervention occurs, air quality changes due to changes in ambient factors, such as weather characteristics and human activities. One approach for assessing the effects of interventions or events on air quality is through the use of the Weather Normalized Model (WNM). However, current deterministic models struggle to accurately capture the complex, non-linear relationship between pollutant concentrations and their emission sources. Hence, the primary objective of this thesis is to examine the power of machine learning (ML) and deep learning (DL) techniques to develop and improve WNMs. Subsequently, these enhanced WNMs are employed to assess the impact of events on air quality. Furthermore, these ML/DL-based WNMs can serve as valuable tools for conducting exploratory data analysis (EDA) to uncover the correlations between independent variables (meteorological and temporal features) and air pollutant concentrations within the models. 

It has been discovered that DL techniques demonstrated their efficiency and high performance in different fields, such as natural language processing, image processing, biology, and environment. Therefore, several appropriate DL architectures (Long Short-Term Memory - LSTM, Recurrent Neural Network - RNN, Bidirectional Recurrent Neural Network - BIRNN, Convolutional Neural Network - CNN, and Gated Recurrent Unit - GRU) were tested to develop the WNMs presented in Paper I. When comparing these DL architectures and Gradient Boosting Machine (GBM), LSTM-based methods (LSTM, BiRNN) have obtained superior results in developing WNMs. The study also showed that our WNMs (DL-based) could capture the correlations between input variables (meteorological and temporal variables) and five criteria contaminants (SO2, CO, NO2, O3 and PM2.5). This is because the SHapley Additive exPlanations (SHAP) library allowed us to discover the significant factors in DL-based WNMs. Additionally, these WNMs were used to assess the air quality changes during COVID-19 lockdown periods in Ecuador. The existing normalized models operate based on the original units of pollutants and are designed for assessing pollutant concentrations under “average” or consistent weather conditions. Predicting pollution peaks presents an even greater challenge because they often lack discernible patterns. To address this, we enhanced the Weather Normalized Models (WNMs) to boost their performance specifically during daily concentration peak conditions. In the second paper, we accomplished this by developing supervised learning techniques, including Ensemble Deep Learning methods, to distinguish between daily peak and non-peak pollutant concentrations. This approach offers flexibility in categorizing pollutant concentrations as either daily concentration peaks or non-daily concentration peaks. However, it is worth noting that this method may introduce potential bias when selecting non-peak values. In the third paper, WNMs are directly applied to daily concentration peaks to predict and analyse the correlations between meteorological, temporal features and daily concentration peaks of air pollutants.

Place, publisher, year, edition, pages
Borlänge: Högskolan Dalarna, 2024
Series
Dalarna Licentiate Theses ; 20
Keywords
Weather Normalized Models (WNMs), Air Pollution, Data-Driven Modeling and Optimization, Deep Learning - Artificial Neural Network (DL-ANN), Machine Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:du-47290 (URN)978-91-88679-56-7 (ISBN)
Presentation
2024-02-20, room Clas Ohlson, Campus Borlänge, 14:00 (English)
Opponent
Supervisors
Available from: 2024-01-29 Created: 2023-11-21 Last updated: 2024-01-29Bibliographically approved

Open Access in DiVA

fulltext(1950 kB)170 downloads
File information
File name FULLTEXT01.pdfFile size 1950 kBChecksum SHA-512
3017a45a35fe7900edef888a2ece31ef626fc1bb0591d1accc3d4ec02286cb7bb3a513054a0017b1fd4288422ff3c4cb489400237b29e542669a9ff769accfd5
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Ngoc Phuong, ChauThomas, IliasRybarczyk, Yves

Search in DiVA

By author/editor
Ngoc Phuong, ChauThomas, IliasRybarczyk, Yves
By organisation
Microdata Analysis
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 170 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 394 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf