10th International Congress on Information and Communication Technology in concurrent with ICT Excellence Awards (ICICT 2025) will be held at London, United Kingdom | February 18 - 21 2025.
Authors - Larissa de Lima, Priscila Capriles, Nathan Oliveira Abstract - This paper explores the use of machine learning (ML) with various physical, chemical, and biological parameter combinations to predict water quality, focusing on the Water Quality Index (WQI). We assess the performance of several regression algorithms across five different data combinations and examine the impact of inference and class balancing techniques on model outcomes. Our analysis reveals that LightGBM achieved the highest accuracy in WQI regression at 93%. This research introduces a novel approach to calculatingWQI by automating the traditional manual and complex parameter collection and calculation process. By streamlining water quality monitoring, our ML-based method offers a more efficient and innovative solution. Additionally, the study provides practical insights into handling data scarcity and using statistical inference for skewed sampling distributions.