Skip to main content
Journal cover image

Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods.

Publication ,  Journal Article
Musaazi, IG; Liu, L; Shaw, A; Zaniolo, M; Stadler, LB; Delgado Vela, J
Published in: Journal of environmental management
September 2025

High-flow events that significantly impact Water Resource Recovery Facility (WRRF) operations are rare, but accurately predicting these flows could improve treatment operations. Data-driven modeling approaches could be used; however, high flow events that impact operation are an infrequent occurrence, providing limited data from which to learn meaningful patterns. The performance of a statistical model (logistic regression) and two machine learning (ML) models (support vector machine and random forest) were evaluated to predict high flow events one-day-ahead to two plants located in different parts of the United States, Northern Virginia and the Gulf Coast of Texas, with combined and separate sewers, respectively. We compared baseline models (no synthetic data added) to models trained with synthetic data added from two different sampling techniques (SMOTE and ADASYN) that increased the representation of rare events in the training data. Both techniques enhanced the sample size of the very high-flow class, but ADASYN, which focused on generating synthetic samples near decision boundaries, led to greater improvements in model performance (reduced misclassification rates). Random forest combined with ADASYN achieved the best overall performance for both plants, demonstrating its robustness in identifying one-day-ahead extreme flow events to treatment plants. These results suggest that combining sampling techniques with ML has the potential to significantly improve the modeling of high-flow events at treatment plants. Our work will prove useful in building reliable predictive models that can inform management decisions needed for the better control of treatment operations.

Duke Scholars

Published In

Journal of environmental management

DOI

EISSN

1095-8630

ISSN

0301-4797

Publication Date

September 2025

Volume

392

Start / End Page

126592

Related Subject Headings

  • Water Purification
  • Wastewater
  • Waste Disposal, Fluid
  • Virginia
  • Texas
  • Models, Theoretical
  • Machine Learning
  • Environmental Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Musaazi, I. G., Liu, L., Shaw, A., Zaniolo, M., Stadler, L. B., & Delgado Vela, J. (2025). Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods. Journal of Environmental Management, 392, 126592. https://doi.org/10.1016/j.jenvman.2025.126592
Musaazi, Isaac G., Lu Liu, Andrew Shaw, Marta Zaniolo, Lauren B. Stadler, and Jeseth Delgado Vela. “Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods.Journal of Environmental Management 392 (September 2025): 126592. https://doi.org/10.1016/j.jenvman.2025.126592.
Musaazi IG, Liu L, Shaw A, Zaniolo M, Stadler LB, Delgado Vela J. Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods. Journal of environmental management. 2025 Sep;392:126592.
Musaazi, Isaac G., et al. “Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods.Journal of Environmental Management, vol. 392, Sept. 2025, p. 126592. Epmc, doi:10.1016/j.jenvman.2025.126592.
Musaazi IG, Liu L, Shaw A, Zaniolo M, Stadler LB, Delgado Vela J. Optimizing models for the prediction of one step ahead extreme flows to wastewater treatment plants using different synthetic sampling methods. Journal of environmental management. 2025 Sep;392:126592.
Journal cover image

Published In

Journal of environmental management

DOI

EISSN

1095-8630

ISSN

0301-4797

Publication Date

September 2025

Volume

392

Start / End Page

126592

Related Subject Headings

  • Water Purification
  • Wastewater
  • Waste Disposal, Fluid
  • Virginia
  • Texas
  • Models, Theoretical
  • Machine Learning
  • Environmental Sciences