TY - JOUR
T1 - What is the spatiotemporal pattern of benzene concentration spread over susceptible area surrounding the Hartman Park community, Houston, Texas?
AU - Asri, Aji Kusumaning
AU - Newman, Galen D.
AU - Tao, Zhihan
AU - Zhu, Rui
AU - Chen, Hsiu Ling
AU - Lung, Shih Chun Candice
AU - Wu, Chih Da
N1 - This study was funded by the National Science and Technology Council, R.O.C.- Taiwan (NSTC 112-2121-M-006-015-; NSTC 112-2123-M-001-008-; NSTC 112-2121-M-006-004-) and the National Institute of Environmental Health Sciences Superfund Grant #P42ES027704-01. This work was also supported partially by the Research Center for Precision Environmental Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE), Taiwan and by Kaohsiung Medical University Research Center Grant (KMU-TC113A01). The data availability was supported by the City of Houston Geographic Information System (COHGIS), Houston-Galveston Area Council (H-GAC); the National Aeronautics and Space Administration (NASA), and the United States Geological Survey (USGS) which provided satellite-derived data.
This study was granted by the National Science and Technology Council, Taiwan (MOST 110-2628-M-006-001-MY3; NSTC 112-2121-M-006-015-; NSTC 112-2123-M-001-008-; NSTC 112-2121-M-006-004-), and the \u201CInnovation and Development Center of Sustainable Agriculture\u201D from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. Additional funding support was provided by the Research Center for Precision Environmental Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan and by Kaohsiung Medical University Research Center Grant (KMU-TC113A01). We also acknowledge support from the National Institute of Environmental Health Sciences Superfund Grant #P42ES027704\u201301. Data availability was supported by the City of Houston Geographic Information System (COHGIS), Houston-Galveston Area Council (H-GAC), the National Aeronautics and Space Administration (NASA), and the United States Geological Survey (USGS) for providing satellite-derived data.
PY - 2024/8/5
Y1 - 2024/8/5
N2 - The Hartman Park community in Houston, Texas-USA, is in a highly polluted area which poses significant risks to its predominantly Hispanic and lower-income residents. Surrounded by dense clustering of industrial facilities compounds health and safety hazards, exacerbating environmental and social inequalities. Such conditions emphasize the urgent need for environmental measures that focus on investigating ambient air quality. This study estimated benzene, one of the most reported pollutants in Hartman Park, using machine learning-based approaches. Benzene data was collected in residential areas in the neighborhood and analyzed using a combination of five machine-learning algorithms (i.e., XGBR, GBR, LGBMR, CBR, RFR) through a newly developed ensemble learning model. Evaluations on model robustness, overfitting tests, 10-fold cross-validation, internal and stratified validation were performed. We found that the ensemble model depicted about 98.7% spatial variability of benzene (Adj. R2 =0.987). Through rigorous validations, stability of model performance was confirmed. Several predictors that contribute to benzene were identified, including temperature, developed intensity areas, leaking petroleum storage tank, and traffic-related factors. Analyzing spatial patterns, we found high benzene spread over areas near industrial zones as well as in residential areas. Overall, our study area was exposed to high benzene levels and requires extra attention from relevant authorities.
AB - The Hartman Park community in Houston, Texas-USA, is in a highly polluted area which poses significant risks to its predominantly Hispanic and lower-income residents. Surrounded by dense clustering of industrial facilities compounds health and safety hazards, exacerbating environmental and social inequalities. Such conditions emphasize the urgent need for environmental measures that focus on investigating ambient air quality. This study estimated benzene, one of the most reported pollutants in Hartman Park, using machine learning-based approaches. Benzene data was collected in residential areas in the neighborhood and analyzed using a combination of five machine-learning algorithms (i.e., XGBR, GBR, LGBMR, CBR, RFR) through a newly developed ensemble learning model. Evaluations on model robustness, overfitting tests, 10-fold cross-validation, internal and stratified validation were performed. We found that the ensemble model depicted about 98.7% spatial variability of benzene (Adj. R2 =0.987). Through rigorous validations, stability of model performance was confirmed. Several predictors that contribute to benzene were identified, including temperature, developed intensity areas, leaking petroleum storage tank, and traffic-related factors. Analyzing spatial patterns, we found high benzene spread over areas near industrial zones as well as in residential areas. Overall, our study area was exposed to high benzene levels and requires extra attention from relevant authorities.
KW - Benzene concentration
KW - Ensemble learning model
KW - Hazardous pollution zone
KW - Machine-learning algorithms
KW - Vulnerable community
UR - http://www.scopus.com/inward/record.url?scp=85194366051&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194366051&partnerID=8YFLogxK
U2 - 10.1016/j.jhazmat.2024.134666
DO - 10.1016/j.jhazmat.2024.134666
M3 - Article
C2 - 38815389
AN - SCOPUS:85194366051
SN - 0304-3894
VL - 474
JO - Journal of Hazardous Materials
JF - Journal of Hazardous Materials
M1 - 134666
ER -