Human Factors

The emergence and spread of infectious diseases, like COVID-19, are on the rise. Can you identify patterns between population density and COVID-19 cases and identify factors that could help predict hotspots of disease spread?

AND - Artificial Neural Diagnostics

Summary

The project is an index to list the highest risk places and factors of COVID-19 contamination using artificial intelligence to measure the impact of human decisions on the spread of the disease. It will be possible to predict the hotspots and know which measures must be taken to reduce the illness spread.Therefor a neural network is applied to analyze the consequences of these pandemic factors. The database is taken from government organizations such as NASA.A future perspective is to expand this network, so that it can be applied at global level in order to prevent new pandemics.

How We Addressed This Challenge

The challenge was to identify patterns between human activity and COVID-19 cases, and to identify factors that can help to predict critical points in the spread of the disease. This has been accomplished. With the neural network created, it was possible to analyze the influence of human factors with the spread of the virus.

Once the critical factors for the spread of COVID-19 have been obtained and bearing in mind that data such as population density, infrastructure, CO2 emissions (to control the circulation of people) and the rate of adherence to social isolation are available from government organizations, as NASA, it was possible to predict the critical points.

For design issues, it was used data provided by IBGE (Brazilian Institute of Geography and Statistics) and websites of local state governments. However, this project aims to be expanded worldwide.

How We Developed This Project

The initial idea was to create an index to list places with higher risks of contagion of COVID-19 using a neural network, but during the process, we were also curious to know which neural network would have a better result (RBF or MLP).

To be able to compare the results of the networks it is necessary to use the same database in both and have a result for comparison that the two networks can generate.

The database we use was built by our team and contains Brazilian data, which we have already mentioned, but this database can be made anywhere in the world using similar metrics and / or adding others, like NASA data. In the case of comparison, we choose the terms "loss" and "r2" to compare the quality of the networks, where "loss" is the metric that measures the cost function (gradient) when calculating synaptic weights (the error ) and "r2" is the correlation coefficient, which determines how much the predicted model looks like the real one (ranges from 0 to 1, positive or negative)

The choice of data was made based on the availability of the data, here in Brazil we don't have data from so many cities, however, the data provided by NASA was not possible to be used due to the formats and how we should work with them, which would take a much longer time than expected in the hackathon just to learn to manipulate this available data. In order to test our idea and assemble the prototype, we chose Brazilian data because of the ease of manipulation.

The neural networks that have been programmed can be accessed through GitHub and the results are also available on the same link.

The results were not what we wanted but we understood that within the limitations that existed, they were acceptable. To improve the accuracy of the networks, it is necessary to make a database more complete than we have and perhaps put new entries in the neural networks (increasing the number of data types, creating a more complete and more accurate analysis).

For a better understanding of the networks used, the next paragraph is focused on their definition.

Radial Basis Function (RBF) is a linear neural network and has its supervised learning (where the network learns from the data set already defined and the inputs and outputs are known). In this type of architecture, there is only an intermediate layer, where the activation functions are of high dimensionality. RBF can be used as a base function for nonlinear regression models (linear or nonlinear) and can also be used as an activation function of any multilayer network type, such as multilayer perceptron (MLP). A Multi Layer Perceptron is a supervised algorithm that can be used for classification or regression. An MLP consists of a three layers: input layers, hidden layer and an output layer. It utilizes a supervised learning technique called backpropagation for training.

Tags
#artificialinteligence #fragilityindex #reducecovidspread #humansfactor #diseasecontrol #neuralnetwork #factorsweight #covid19 #predictimpactfactor
Global Judging
This project was submitted for consideration during the Space Apps Global Judging process.