Exportar Publicação
A publicação pode ser exportada nos seguintes formatos: referência da APA (American Psychological Association), referência do IEEE (Institute of Electrical and Electronics Engineers), BibTeX e RIS.
Mariano, P., Almeida, S. M. & Santana, P. (N/A). On the automated learning of air pollution prediction models from data collected by mobile sensor networks. Energy Sources, Part A: Recovery, Utilization, and Environmental Effects. N/A, 1-17
P. L. Mariano et al., "On the automated learning of air pollution prediction models from data collected by mobile sensor networks", in Energy Sources, Part A: Recovery, Utilization, and Environmental Effects, vol. N/A, pp. 1-17, N/A
@article{marianoN/A_1732205748818, author = "Mariano, P. and Almeida, S. M. and Santana, P.", title = "On the automated learning of air pollution prediction models from data collected by mobile sensor networks", journal = "Energy Sources, Part A: Recovery, Utilization, and Environmental Effects", year = "N/A", volume = "N/A", number = "", doi = "10.1080/15567036.2021.1968076", pages = "1-17", url = "https://www.tandfonline.com/toc/ueso20/current" }
TY - JOUR TI - On the automated learning of air pollution prediction models from data collected by mobile sensor networks T2 - Energy Sources, Part A: Recovery, Utilization, and Environmental Effects VL - N/A AU - Mariano, P. AU - Almeida, S. M. AU - Santana, P. PY - N/A SP - 1-17 SN - 1556-7036 DO - 10.1080/15567036.2021.1968076 UR - https://www.tandfonline.com/toc/ueso20/current AB - This paper addresses the problem of automated learning of air pollution predictive models that were trained using information gathered by a set of mobile low-cost sensors. Concretely, fast to compute machine learning methods (Decision Trees and Support Vector Machines) were used to build regression models that predict air pollution levels for a given location. The models were trained using the data collected by the OpenSense project, in particular, number of particulate matter, particle diameter, and lung deposited surface area (LDSA). We examined two different sets of attributes: one based on a geographical description of the location under analysis (e.g. distribution of households and roads), and another based on a time series of past air pollution observations in that location. Overall, we have found out that past measures lead to better pollution predictions. The best R2 score was 0.751 obtained with the model that predicts LDSA and was trained with the data set with time series attributes, while the worst R2 was 0.009 obtained with the geographical data set to predict number of particles. The performance of the best model is on par with similar air pollution systems. Moreover it can be used in a production system that requires frequent updates. ER -