Water-quality data imputation with a high percentage of missing values : A machine learning approach

Rodríguez Núñez, Rafael - Pastorini, Marcos - Etcheverry, Lorena - Chreties, Christian - Fossati, Mónica - Castro, Alberto - Gorgoglione, Angela

Resumen:

The monitoring of surface-water quality followed by water-quality modeling and analysis are essential for generating effective strategies in surface-water-resource management. However, worldwide, particularly in developing countries, water-quality studies are limited due to the lack of a complete and reliable dataset of surface-water-quality variables. In this context, several statistical and machine-learning models were assessed for imputing water-quality data at six monitoring stations located in the Santa Lucía Chico river (Uruguay), a mixed lotic and lentic river system. The challenge of this study is represented by the high percentage of missing data (between 50% and 70%) and the high temporal and spatial variability that characterizes the water-quality variables. The competing algorithms implement univariate and multivariate imputation methods (inverse distance weighting (IDW), Random Forest Regressor (RFR), Ridge (R), Bayesian Ridge (BR), AdaBoost (AB), Hubber Regressor (HR), Support Vector Regressor (SVR) and K-nearest neighbors Regressor (KNNR)). According to the results, more than 76% of the imputation outcomes are considered “satisfactory” (NSE > 0.45). The imputation performance shows better results at the monitoring stations located inside the reservoir than those positioned along the mainstream. IDW was the model with the best imputation results, followed by RFR, HR and SVR. The approach proposed in this study is expected to aid water-resource researchers and managers in augmenting water-quality datasets and overcoming the missing data issue to increase the number of future studies related to the water-quality matter.


Detalles Bibliográficos
2021
Data scarcity
Water quality
Missing data
Univariate imputation
Multivariate imputation
Machine learning
Hydroinformatics
Inglés
Universidad de la República
COLIBRI
https://hdl.handle.net/20.500.12008/28284
Acceso abierto
Licencia Creative Commons Atribución (CC - By 4.0)
_version_ 1807522897211686912
author Rodríguez Núñez, Rafael
author2 Pastorini, Marcos
Etcheverry, Lorena
Chreties, Christian
Fossati, Mónica
Castro, Alberto
Gorgoglione, Angela
author2_role author
author
author
author
author
author
author_facet Rodríguez Núñez, Rafael
Pastorini, Marcos
Etcheverry, Lorena
Chreties, Christian
Fossati, Mónica
Castro, Alberto
Gorgoglione, Angela
author_role author
bitstream.checksum.fl_str_mv 6429389a7df7277b72b7924fdc7d47a9
a0ebbeafb9d2ec7cbb19d7137ebc392c
d606c60c5d78967c4ed7a729e5bb402f
9fdbed07f52437945402c4e70fa4773e
429a60f739a547ddab3f2c05da908c56
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
bitstream.url.fl_str_mv http://localhost:8080/xmlui/bitstream/20.500.12008/28284/5/license.txt
http://localhost:8080/xmlui/bitstream/20.500.12008/28284/2/license_url
http://localhost:8080/xmlui/bitstream/20.500.12008/28284/3/license_text
http://localhost:8080/xmlui/bitstream/20.500.12008/28284/4/license_rdf
http://localhost:8080/xmlui/bitstream/20.500.12008/28284/1/RPECFCG21.pdf
collection COLIBRI
dc.contributor.filiacion.none.fl_str_mv Rodríguez Núñez Rafael, Universidad de la República (Uruguay). Facultad de Ingeniería.
Pastorini Marcos, Universidad de la República (Uruguay). Facultad de Ingeniería.
Etcheverry Lorena, Universidad de la República (Uruguay). Facultad de Ingeniería.
Chreties Christian, Universidad de la República (Uruguay). Facultad de Ingeniería.
Fossati Mónica, Universidad de la República (Uruguay). Facultad de Ingeniería.
Castro Alberto, Universidad de la República (Uruguay). Facultad de Ingeniería.
Gorgoglione Angela, Universidad de la República (Uruguay). Facultad de Ingeniería.
dc.coverage.spatial.es.fl_str_mv Río Santa Lucía Chico, Departamento de Florida, Uruguay.
dc.creator.none.fl_str_mv Rodríguez Núñez, Rafael
Pastorini, Marcos
Etcheverry, Lorena
Chreties, Christian
Fossati, Mónica
Castro, Alberto
Gorgoglione, Angela
dc.date.accessioned.none.fl_str_mv 2021-06-21T19:18:16Z
dc.date.available.none.fl_str_mv 2021-06-21T19:18:16Z
dc.date.issued.none.fl_str_mv 2021
dc.description.abstract.none.fl_txt_mv The monitoring of surface-water quality followed by water-quality modeling and analysis are essential for generating effective strategies in surface-water-resource management. However, worldwide, particularly in developing countries, water-quality studies are limited due to the lack of a complete and reliable dataset of surface-water-quality variables. In this context, several statistical and machine-learning models were assessed for imputing water-quality data at six monitoring stations located in the Santa Lucía Chico river (Uruguay), a mixed lotic and lentic river system. The challenge of this study is represented by the high percentage of missing data (between 50% and 70%) and the high temporal and spatial variability that characterizes the water-quality variables. The competing algorithms implement univariate and multivariate imputation methods (inverse distance weighting (IDW), Random Forest Regressor (RFR), Ridge (R), Bayesian Ridge (BR), AdaBoost (AB), Hubber Regressor (HR), Support Vector Regressor (SVR) and K-nearest neighbors Regressor (KNNR)). According to the results, more than 76% of the imputation outcomes are considered “satisfactory” (NSE > 0.45). The imputation performance shows better results at the monitoring stations located inside the reservoir than those positioned along the mainstream. IDW was the model with the best imputation results, followed by RFR, HR and SVR. The approach proposed in this study is expected to aid water-resource researchers and managers in augmenting water-quality datasets and overcoming the missing data issue to increase the number of future studies related to the water-quality matter.
dc.description.es.fl_txt_mv Publicación producida a partir de un Proyecto financiado por la ANII
dc.format.extent.es.fl_str_mv 17 p.
dc.format.mimetype.es.fl_str_mv application/pdf
dc.identifier.citation.es.fl_str_mv Rodríguez Núñez, R., Pastorini, M., Etcheverry, L. y otros. "Water-quality data imputation with a high percentage of missing values : A machine learning approach". Sustainability. [en línea]. 2021 vol. 13, no 11, pp. 1-17. DOI: 10.3390/su13116318
dc.identifier.doi.none.fl_str_mv 10.3390/su13116318
dc.identifier.issn.none.fl_str_mv 2071-1050
dc.identifier.uri.none.fl_str_mv https://hdl.handle.net/20.500.12008/28284
dc.language.iso.none.fl_str_mv en
eng
dc.publisher.es.fl_str_mv MDPI
dc.relation.ispartof.es.fl_str_mv Sustainability, vol. 13, no 11, pp. 1-17, jun 2021
dc.rights.license.none.fl_str_mv Licencia Creative Commons Atribución (CC - By 4.0)
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.source.none.fl_str_mv reponame:COLIBRI
instname:Universidad de la República
instacron:Universidad de la República
dc.subject.en.fl_str_mv Data scarcity
Water quality
Missing data
Univariate imputation
Multivariate imputation
Machine learning
Hydroinformatics
dc.title.none.fl_str_mv Water-quality data imputation with a high percentage of missing values : A machine learning approach
dc.type.es.fl_str_mv Artículo
dc.type.none.fl_str_mv info:eu-repo/semantics/article
dc.type.version.none.fl_str_mv info:eu-repo/semantics/publishedVersion
description Publicación producida a partir de un Proyecto financiado por la ANII
eu_rights_str_mv openAccess
format article
id COLIBRI_b929b0aea0b423f671db9a9ad703deca
identifier_str_mv Rodríguez Núñez, R., Pastorini, M., Etcheverry, L. y otros. "Water-quality data imputation with a high percentage of missing values : A machine learning approach". Sustainability. [en línea]. 2021 vol. 13, no 11, pp. 1-17. DOI: 10.3390/su13116318
2071-1050
10.3390/su13116318
instacron_str Universidad de la República
institution Universidad de la República
instname_str Universidad de la República
language eng
language_invalid_str_mv en
network_acronym_str COLIBRI
network_name_str COLIBRI
oai_identifier_str oai:colibri.udelar.edu.uy:20.500.12008/28284
publishDate 2021
reponame_str COLIBRI
repository.mail.fl_str_mv mabel.seroubian@seciu.edu.uy
repository.name.fl_str_mv COLIBRI - Universidad de la República
repository_id_str 4771
rights_invalid_str_mv Licencia Creative Commons Atribución (CC - By 4.0)
spelling Rodríguez Núñez Rafael, Universidad de la República (Uruguay). Facultad de Ingeniería.Pastorini Marcos, Universidad de la República (Uruguay). Facultad de Ingeniería.Etcheverry Lorena, Universidad de la República (Uruguay). Facultad de Ingeniería.Chreties Christian, Universidad de la República (Uruguay). Facultad de Ingeniería.Fossati Mónica, Universidad de la República (Uruguay). Facultad de Ingeniería.Castro Alberto, Universidad de la República (Uruguay). Facultad de Ingeniería.Gorgoglione Angela, Universidad de la República (Uruguay). Facultad de Ingeniería.Río Santa Lucía Chico, Departamento de Florida, Uruguay.2021-06-21T19:18:16Z2021-06-21T19:18:16Z2021Rodríguez Núñez, R., Pastorini, M., Etcheverry, L. y otros. "Water-quality data imputation with a high percentage of missing values : A machine learning approach". Sustainability. [en línea]. 2021 vol. 13, no 11, pp. 1-17. DOI: 10.3390/su131163182071-1050https://hdl.handle.net/20.500.12008/2828410.3390/su13116318Publicación producida a partir de un Proyecto financiado por la ANIIThe monitoring of surface-water quality followed by water-quality modeling and analysis are essential for generating effective strategies in surface-water-resource management. However, worldwide, particularly in developing countries, water-quality studies are limited due to the lack of a complete and reliable dataset of surface-water-quality variables. In this context, several statistical and machine-learning models were assessed for imputing water-quality data at six monitoring stations located in the Santa Lucía Chico river (Uruguay), a mixed lotic and lentic river system. The challenge of this study is represented by the high percentage of missing data (between 50% and 70%) and the high temporal and spatial variability that characterizes the water-quality variables. The competing algorithms implement univariate and multivariate imputation methods (inverse distance weighting (IDW), Random Forest Regressor (RFR), Ridge (R), Bayesian Ridge (BR), AdaBoost (AB), Hubber Regressor (HR), Support Vector Regressor (SVR) and K-nearest neighbors Regressor (KNNR)). According to the results, more than 76% of the imputation outcomes are considered “satisfactory” (NSE > 0.45). The imputation performance shows better results at the monitoring stations located inside the reservoir than those positioned along the mainstream. IDW was the model with the best imputation results, followed by RFR, HR and SVR. The approach proposed in this study is expected to aid water-resource researchers and managers in augmenting water-quality datasets and overcoming the missing data issue to increase the number of future studies related to the water-quality matter.Submitted by Ribeiro Jorge (jribeiro@fing.edu.uy) on 2021-06-21T17:06:34Z No. of bitstreams: 2 license_rdf: 19875 bytes, checksum: 9fdbed07f52437945402c4e70fa4773e (MD5) RPECFCG21.pdf: 3976341 bytes, checksum: 429a60f739a547ddab3f2c05da908c56 (MD5)Approved for entry into archive by Machado Jimena (jmachado@fing.edu.uy) on 2021-06-21T18:58:04Z (GMT) No. of bitstreams: 2 license_rdf: 19875 bytes, checksum: 9fdbed07f52437945402c4e70fa4773e (MD5) RPECFCG21.pdf: 3976341 bytes, checksum: 429a60f739a547ddab3f2c05da908c56 (MD5)Made available in DSpace by Luna Fabiana (fabiana.luna@seciu.edu.uy) on 2021-06-21T19:18:16Z (GMT). No. of bitstreams: 2 license_rdf: 19875 bytes, checksum: 9fdbed07f52437945402c4e70fa4773e (MD5) RPECFCG21.pdf: 3976341 bytes, checksum: 429a60f739a547ddab3f2c05da908c56 (MD5) Previous issue date: 202117 p.application/pdfenengMDPISustainability, vol. 13, no 11, pp. 1-17, jun 2021Las obras depositadas en el Repositorio se rigen por la Ordenanza de los Derechos de la Propiedad Intelectual de la Universidad de la República.(Res. Nº 91 de C.D.C. de 8/III/1994 – D.O. 7/IV/1994) y por la Ordenanza del Repositorio Abierto de la Universidad de la República (Res. Nº 16 de C.D.C. de 07/10/2014)info:eu-repo/semantics/openAccessLicencia Creative Commons Atribución (CC - By 4.0)Data scarcityWater qualityMissing dataUnivariate imputationMultivariate imputationMachine learningHydroinformaticsWater-quality data imputation with a high percentage of missing values : A machine learning approachArtículoinfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionreponame:COLIBRIinstname:Universidad de la Repúblicainstacron:Universidad de la RepúblicaRodríguez Núñez, RafaelPastorini, MarcosEtcheverry, LorenaChreties, ChristianFossati, MónicaCastro, AlbertoGorgoglione, AngelaLICENSElicense.txtlicense.txttext/plain; charset=utf-84267http://localhost:8080/xmlui/bitstream/20.500.12008/28284/5/license.txt6429389a7df7277b72b7924fdc7d47a9MD55CC-LICENSElicense_urllicense_urltext/plain; charset=utf-844http://localhost:8080/xmlui/bitstream/20.500.12008/28284/2/license_urla0ebbeafb9d2ec7cbb19d7137ebc392cMD52license_textlicense_texttext/html; charset=utf-838395http://localhost:8080/xmlui/bitstream/20.500.12008/28284/3/license_textd606c60c5d78967c4ed7a729e5bb402fMD53license_rdflicense_rdfapplication/rdf+xml; charset=utf-819875http://localhost:8080/xmlui/bitstream/20.500.12008/28284/4/license_rdf9fdbed07f52437945402c4e70fa4773eMD54ORIGINALRPECFCG21.pdfRPECFCG21.pdfapplication/pdf3976341http://localhost:8080/xmlui/bitstream/20.500.12008/28284/1/RPECFCG21.pdf429a60f739a547ddab3f2c05da908c56MD5120.500.12008/282842021-11-24 12:27:51.483oai:colibri.udelar.edu.uy:20.500.12008/28284VGVybWlub3MgeSBjb25kaWNpb25lcyByZWxhdGl2YXMgYWwgZGVwb3NpdG8gZGUgb2JyYXMKCgpMYXMgb2JyYXMgZGVwb3NpdGFkYXMgZW4gZWwgUmVwb3NpdG9yaW8gc2UgcmlnZW4gcG9yIGxhIE9yZGVuYW56YSBkZSBsb3MgRGVyZWNob3MgZGUgbGEgUHJvcGllZGFkIEludGVsZWN0dWFsICBkZSBsYSBVbml2ZXJzaWRhZCBEZSBMYSBSZXDDumJsaWNhLiAoUmVzLiBOwrogOTEgZGUgQy5ELkMuIGRlIDgvSUlJLzE5OTQg4oCTIEQuTy4gNy9JVi8xOTk0KSB5ICBwb3IgbGEgT3JkZW5hbnphIGRlbCBSZXBvc2l0b3JpbyBBYmllcnRvIGRlIGxhIFVuaXZlcnNpZGFkIGRlIGxhIFJlcMO6YmxpY2EgKFJlcy4gTsK6IDE2IGRlIEMuRC5DLiBkZSAwNy8xMC8yMDE0KS4gCgpBY2VwdGFuZG8gZWwgYXV0b3IgZXN0b3MgdMOpcm1pbm9zIHkgY29uZGljaW9uZXMgZGUgZGVww7NzaXRvIGVuIENPTElCUkksIGxhIFVuaXZlcnNpZGFkIGRlIFJlcMO6YmxpY2EgcHJvY2VkZXLDoSBhOiAgCgphKSBhcmNoaXZhciBtw6FzIGRlIHVuYSBjb3BpYSBkZSBsYSBvYnJhIGVuIGxvcyBzZXJ2aWRvcmVzIGRlIGxhIFVuaXZlcnNpZGFkIGEgbG9zIGVmZWN0b3MgZGUgZ2FyYW50aXphciBhY2Nlc28sIHNlZ3VyaWRhZCB5IHByZXNlcnZhY2nDs24KYikgY29udmVydGlyIGxhIG9icmEgYSBvdHJvcyBmb3JtYXRvcyBzaSBmdWVyYSBuZWNlc2FyaW8gIHBhcmEgZmFjaWxpdGFyIHN1IHByZXNlcnZhY2nDs24geSBhY2Nlc2liaWxpZGFkIHNpbiBhbHRlcmFyIHN1IGNvbnRlbmlkby4KYykgcmVhbGl6YXIgbGEgY29tdW5pY2FjacOzbiBww7pibGljYSB5IGRpc3BvbmVyIGVsIGFjY2VzbyBsaWJyZSB5IGdyYXR1aXRvIGEgdHJhdsOpcyBkZSBJbnRlcm5ldCBtZWRpYW50ZSBsYSBwdWJsaWNhY2nDs24gZGUgbGEgb2JyYSBiYWpvIGxhIGxpY2VuY2lhIENyZWF0aXZlIENvbW1vbnMgc2VsZWNjaW9uYWRhIHBvciBlbCBwcm9waW8gYXV0b3IuCgoKRW4gY2FzbyBxdWUgZWwgYXV0b3IgaGF5YSBkaWZ1bmRpZG8geSBkYWRvIGEgcHVibGljaWRhZCBhIGxhIG9icmEgZW4gZm9ybWEgcHJldmlhLCAgcG9kcsOhIHNvbGljaXRhciB1biBwZXLDrW9kbyBkZSBlbWJhcmdvIHNvYnJlIGxhIGRpc3BvbmliaWxpZGFkIHDDumJsaWNhIGRlIGxhIG1pc21hLCBlbCBjdWFsIGNvbWVuemFyw6EgYSBwYXJ0aXIgZGUgbGEgYWNlcHRhY2nDs24gZGUgZXN0ZSBkb2N1bWVudG8geSBoYXN0YSBsYSBmZWNoYSBxdWUgaW5kaXF1ZSAuCgpFbCBhdXRvciBhc2VndXJhIHF1ZSBsYSBvYnJhIG5vIGluZnJpZ2UgbmluZ8O6biBkZXJlY2hvIHNvYnJlIHRlcmNlcm9zLCB5YSBzZWEgZGUgcHJvcGllZGFkIGludGVsZWN0dWFsIG8gY3VhbHF1aWVyIG90cm8uCgpFbCBhdXRvciBnYXJhbnRpemEgcXVlIHNpIGVsIGRvY3VtZW50byBjb250aWVuZSBtYXRlcmlhbGVzIGRlIGxvcyBjdWFsZXMgbm8gdGllbmUgbG9zIGRlcmVjaG9zIGRlIGF1dG9yLCAgaGEgb2J0ZW5pZG8gZWwgcGVybWlzbyBkZWwgcHJvcGlldGFyaW8gZGUgbG9zIGRlcmVjaG9zIGRlIGF1dG9yLCB5IHF1ZSBlc2UgbWF0ZXJpYWwgY3V5b3MgZGVyZWNob3Mgc29uIGRlIHRlcmNlcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIHkgcmVjb25vY2lkbyBlbiBlbCB0ZXh0byBvIGNvbnRlbmlkbyBkZWwgZG9jdW1lbnRvIGRlcG9zaXRhZG8gZW4gZWwgUmVwb3NpdG9yaW8uCgpFbiBvYnJhcyBkZSBhdXRvcsOtYSBtw7psdGlwbGUgL3NlIHByZXN1bWUvIHF1ZSBlbCBhdXRvciBkZXBvc2l0YW50ZSBkZWNsYXJhIHF1ZSBoYSByZWNhYmFkbyBlbCBjb25zZW50aW1pZW50byBkZSB0b2RvcyBsb3MgYXV0b3JlcyBwYXJhIHB1YmxpY2FybGEgZW4gZWwgUmVwb3NpdG9yaW8sIHNpZW5kbyDDqXN0ZSBlbCDDum5pY28gcmVzcG9uc2FibGUgZnJlbnRlIGEgY3VhbHF1aWVyIHRpcG8gZGUgcmVjbGFtYWNpw7NuIGRlIGxvcyBvdHJvcyBjb2F1dG9yZXMuCgpFbCBhdXRvciBzZXLDoSByZXNwb25zYWJsZSBkZWwgY29udGVuaWRvIGRlIGxvcyBkb2N1bWVudG9zIHF1ZSBkZXBvc2l0YS4gTGEgVURFTEFSIG5vIHNlcsOhIHJlc3BvbnNhYmxlIHBvciBsYXMgZXZlbnR1YWxlcyB2aW9sYWNpb25lcyBhbCBkZXJlY2hvIGRlIHByb3BpZWRhZCBpbnRlbGVjdHVhbCBlbiBxdWUgcHVlZGEgaW5jdXJyaXIgZWwgYXV0b3IuCgpBbnRlIGN1YWxxdWllciBkZW51bmNpYSBkZSB2aW9sYWNpw7NuIGRlIGRlcmVjaG9zIGRlIHByb3BpZWRhZCBpbnRlbGVjdHVhbCwgbGEgVURFTEFSICBhZG9wdGFyw6EgdG9kYXMgbGFzIG1lZGlkYXMgbmVjZXNhcmlhcyBwYXJhIGV2aXRhciBsYSBjb250aW51YWNpw7NuIGRlIGRpY2hhIGluZnJhY2Npw7NuLCBsYXMgcXVlIHBvZHLDoW4gaW5jbHVpciBlbCByZXRpcm8gZGVsIGFjY2VzbyBhIGxvcyBjb250ZW5pZG9zIHkvbyBtZXRhZGF0b3MgZGVsIGRvY3VtZW50byByZXNwZWN0aXZvLgoKTGEgb2JyYSBzZSBwb25kcsOhIGEgZGlzcG9zaWNpw7NuIGRlbCBww7pibGljbyBhIHRyYXbDqXMgZGUgbGFzIGxpY2VuY2lhcyBDcmVhdGl2ZSBDb21tb25zLCBlbCBhdXRvciBwb2Ryw6Egc2VsZWNjaW9uYXIgdW5hIGRlIGxhcyA2IGxpY2VuY2lhcyBkaXNwb25pYmxlczoKCgpBdHJpYnVjacOzbiAoQ0MgLSBCeSk6IFBlcm1pdGUgdXNhciBsYSBvYnJhIHkgZ2VuZXJhciBvYnJhcyBkZXJpdmFkYXMsIGluY2x1c28gY29uIGZpbmVzIGNvbWVyY2lhbGVzLCBzaWVtcHJlIHF1ZSBzZSByZWNvbm96Y2EgYWwgYXV0b3IuCgpBdHJpYnVjacOzbiDigJMgQ29tcGFydGlyIElndWFsIChDQyAtIEJ5LVNBKTogUGVybWl0ZSB1c2FyIGxhIG9icmEgeSBnZW5lcmFyIG9icmFzIGRlcml2YWRhcywgaW5jbHVzbyBjb24gZmluZXMgY29tZXJjaWFsZXMsIHBlcm8gbGEgZGlzdHJpYnVjacOzbiBkZSBsYXMgb2JyYXMgZGVyaXZhZGFzIGRlYmUgaGFjZXJzZSBtZWRpYW50ZSB1bmEgbGljZW5jaWEgaWTDqW50aWNhIGEgbGEgZGUgbGEgb2JyYSBvcmlnaW5hbCwgcmVjb25vY2llbmRvIGEgbG9zIGF1dG9yZXMuCgpBdHJpYnVjacOzbiDigJMgTm8gQ29tZXJjaWFsIChDQyAtIEJ5LU5DKTogUGVybWl0ZSB1c2FyIGxhIG9icmEgeSBnZW5lcmFyIG9icmFzIGRlcml2YWRhcywgc2llbXByZSB5IGN1YW5kbyBlc29zIHVzb3Mgbm8gdGVuZ2FuIGZpbmVzIGNvbWVyY2lhbGVzLCByZWNvbm9jaWVuZG8gYWwgYXV0b3IuCgpBdHJpYnVjacOzbiDigJMgU2luIERlcml2YWRhcyAoQ0MgLSBCeS1ORCk6IFBlcm1pdGUgZWwgdXNvIGRlIGxhIG9icmEsIGluY2x1c28gY29uIGZpbmVzIGNvbWVyY2lhbGVzLCBwZXJvIG5vIHNlIHBlcm1pdGUgZ2VuZXJhciBvYnJhcyBkZXJpdmFkYXMsIGRlYmllbmRvIHJlY29ub2NlciBhbCBhdXRvci4KCkF0cmlidWNpw7NuIOKAkyBObyBDb21lcmNpYWwg4oCTIENvbXBhcnRpciBJZ3VhbCAoQ0Mg4oCTIEJ5LU5DLVNBKTogUGVybWl0ZSB1c2FyIGxhIG9icmEgeSBnZW5lcmFyIG9icmFzIGRlcml2YWRhcywgc2llbXByZSB5IGN1YW5kbyBlc29zIHVzb3Mgbm8gdGVuZ2FuIGZpbmVzIGNvbWVyY2lhbGVzIHkgbGEgZGlzdHJpYnVjacOzbiBkZSBsYXMgb2JyYXMgZGVyaXZhZGFzIHNlIGhhZ2EgbWVkaWFudGUgbGljZW5jaWEgaWTDqW50aWNhIGEgbGEgZGUgbGEgb2JyYSBvcmlnaW5hbCwgcmVjb25vY2llbmRvIGEgbG9zIGF1dG9yZXMuCgpBdHJpYnVjacOzbiDigJMgTm8gQ29tZXJjaWFsIOKAkyBTaW4gRGVyaXZhZGFzIChDQyAtIEJ5LU5DLU5EKTogUGVybWl0ZSB1c2FyIGxhIG9icmEsIHBlcm8gbm8gc2UgcGVybWl0ZSBnZW5lcmFyIG9icmFzIGRlcml2YWRhcyB5IG5vIHNlIHBlcm1pdGUgdXNvIGNvbiBmaW5lcyBjb21lcmNpYWxlcywgZGViaWVuZG8gcmVjb25vY2VyIGFsIGF1dG9yLgoKTG9zIHVzb3MgcHJldmlzdG9zIGVuIGxhcyBsaWNlbmNpYXMgaW5jbHV5ZW4gbGEgZW5hamVuYWNpw7NuLCByZXByb2R1Y2Npw7NuLCBjb211bmljYWNpw7NuLCBwdWJsaWNhY2nDs24sIGRpc3RyaWJ1Y2nDs24geSBwdWVzdGEgYSBkaXNwb3NpY2nDs24gZGVsIHDDumJsaWNvLiBMYSBjcmVhY2nDs24gZGUgb2JyYXMgZGVyaXZhZGFzIGluY2x1eWUgbGEgYWRhcHRhY2nDs24sIHRyYWR1Y2Npw7NuIHkgZWwgcmVtaXguCgpDdWFuZG8gc2Ugc2VsZWNjaW9uZSB1bmEgbGljZW5jaWEgcXVlIGhhYmlsaXRlIHVzb3MgY29tZXJjaWFsZXMsIGVsIGRlcMOzc2l0byBkZWJlcsOhIHNlciBhY29tcGHDsWFkbyBkZWwgYXZhbCBkZWwgamVyYXJjYSBtw6F4aW1vIGRlbCBTZXJ2aWNpbyBjb3JyZXNwb25kaWVudGUuCg==Universidadhttps://udelar.edu.uy/https://www.colibri.udelar.edu.uy/oai/requestmabel.seroubian@seciu.edu.uyUruguayopendoar:47712024-07-25T14:33:11.491242COLIBRI - Universidad de la Repúblicafalse
spellingShingle Water-quality data imputation with a high percentage of missing values : A machine learning approach
Rodríguez Núñez, Rafael
Data scarcity
Water quality
Missing data
Univariate imputation
Multivariate imputation
Machine learning
Hydroinformatics
status_str publishedVersion
title Water-quality data imputation with a high percentage of missing values : A machine learning approach
title_full Water-quality data imputation with a high percentage of missing values : A machine learning approach
title_fullStr Water-quality data imputation with a high percentage of missing values : A machine learning approach
title_full_unstemmed Water-quality data imputation with a high percentage of missing values : A machine learning approach
title_short Water-quality data imputation with a high percentage of missing values : A machine learning approach
title_sort Water-quality data imputation with a high percentage of missing values : A machine learning approach
topic Data scarcity
Water quality
Missing data
Univariate imputation
Multivariate imputation
Machine learning
Hydroinformatics
url https://hdl.handle.net/20.500.12008/28284