11 minute read
3.2 DATA USED IN THE STUDY
from 105547
The present study uses primary data collected in an online questionnaire applied between April 7 and May 11, 2020. The questionnaire has 15 questions that aim to verify how the COVID-19 pandemic has affected people in the city of Rio de Janeiro. The data obtained address issues such as income, employment, activities that were affected by the start of the pandemic, people’s feelings about the pandemic, among others. Additionally, some geographical regions of the city will be used, particularly the planning regions, available at the public geographic database of the city hall (Instituto Pereira Passos – IPP). These data have been geoenriched with the ArcGIS demographic data from the year 2018. Of the variables present in the database, the number of men and women, age, and population density for each planning region in the city will be used.
Concerning the COVID-19 data, the public bases for mapping the pandemic will be used, especially the data provided by the city hall of Rio de Janeiro, considering that in this data information is included by the planning regions used in the present study. The database fed by the Ministério da Saúde (Ministry of Health), for example, does not reach the level of detail necessary to carry out the planned analyzes
Advertisement
Finally, to verify how the questions related to the questionnaire were searched over time, the Google Trends database will be used, as well as the official public databases related to the pandemic in order to situate it in time and space in the analysis performed. In the table below (Figure 11), it is possible to verify the data and sources used for the present study’s analyses.
Data Source
Answers to the questionnaire
Rio de Janeiro’s Planning
Regions Geoenriched
City Hall’s COVID-19
Dashboard
ArcGIS Survey123
(https://survey123.arcgis.com/surveys/a9ade036619c4f04b29d0e33b8849dfd/overview)
Data Rio – Instituto Pereira Passos
(https://www.data.rio/datasets/3033f988051a4ce38a396202bc205a2e_0)
Painel Rio COVID-19
(https://experience.arcgis.com/experience/38efc69787a346959c931568bd9e2cc4)
Google Trends Package gtrends in R Figure 11. Data Source
3.3 FLUXOGRAM
The methodology developed in the present study aimed to be implemented quickly due to the pandemic propagation speed. Thus, it was necessary to find tools that could be integrated easily, without great development effort and the need for programming experts. Since the beginning, as it was conceived to be a collaborative tool, it was fundamental that the adjustments between the first and second rounds could also be easily done.
In Figure 12, it is possible to verify the development of the steps of the methodology proposed in the present investigation. It can be assessed that the survey was developed to accept suggestions after a first round The definition of the study area occurred in the second round of application.
3.4 METHODOLOGY DESCRIPTION
3.4.1 AN OVERVIEW
The present study had as its initial problem the development of a collaborative mapping from an online survey, addressing issues with a quantitative and qualitative focus, related to COVID-19 within a GIS context, something that was not observed in the initial months of the pandemic. Thus, it is possible to interpret certain impacts related to the pandemic beyond the advance of the number of cases and deaths related to COVID-19 in space and time.
A first version of the mapping was carried out without defining a specific study area, testing the tool’s functionality, and collecting feedback from respondents, having been disseminated on the main social media platforms, such as WhatsApp, LinkedIn, Instagram, and Facebook.
3.4.2 ONLINE SURVEY ELABORATION
The primary data used in this research were obtained through independent and collaborative mapping. The steps in obtaining these data are described below.
As aforementioned, several platforms were monitoring the evolution of the pandemic in space and time as soon as the WHO declared it. However, there was no follow-up research focused on the impacts to people's daily lives. In this context, an online survey was developed on Survey123 Connect for ArcGIS, with questions structured in different possible answer formats, such as single choice or multiple-choice answers. An inserted question asked for the interviewee's location, which then generated a coordinate (x, y).
For later integration with a Dashboard for ArcGIS, a decision was made to develop the questionnaire in XLSForm, as explained in item 2.6.1, because it is a more sophisticated way to develop the questionnaire in Survey123. The main question that guided the questionnaire was: "How has COVID-19 affected you?", followed by the questions elaborated in XLSForm. Thus, some questions were asked about how the pandemic was affecting people's lives. In Figure 13a below, it is possible to view the survey creation environment in Survey123 Connect, as well as the XLSForm in which the questions were developed (Figure 13b).
The questionnaire was incorporated in a link into a Story Map, which provided some information about the pandemic (Silva, 2020a). The WHO dashboard related to the pandemic was embedded in the Story Map as a source of information about cases and deaths of COVID-19 worldwide.
At the end of the Story Map, a dashboard was presented, and it was being fed by the answered questionnaires in real-time. As previously mentioned, to be able to integrate the dashboard with the survey data, it was necessary to develop it in XLSForm, as two questions were multiple-choice, in which the interviewee could answer more than one option. Thus, if the questionnaire was created in Survey123 using the web designer, these fields would become text fields (esriFieldTypeString), making integration with the dashboard impossible without manipulating the data. Thus, in order not to require manual data update, the XLSForm was created. The dashboard created (Figure 15) addressed the questions inserted in the questionnaire, and the variables had filters between them, which could make the visualization more interesting for those who were manipulating the data.
The management of data obtained in the survey, story map, and the dashboard was carried out in ArcGIS Online.
As explained, the first round of the survey did not have a specific area of interest and was disclosed through social media. With the first data obtained, the initiative was described in an article published on a geotechnology portal (Silva, 2020b).
From the beginning, the research was open to contributions. Thus, after the first round and disclosure, feedback was given by them to improve some questions. There was also reinforcement in the dissemination part through a research group from Veiga de Almeida University (UVA), members of the Geospatial Studies Laboratory (LEGO) in Rio de Janeiro.
In this way, after some specific research adjustments, a new round was made, focusing on data collection in Rio de Janeiro. The research was then applied between April 07 and May 11, 2020. The 15 questions asked and the types of each question are shown in Figure 16 below. All questions were mandatory.
How has COVID-19 affected you?
Type (The objective of this survey is to map how the coronavirus has affected people's lives, what are the impacts and the needs that the population has felt)
How old are you?
Up to 25 years
Between 25 - 35 years
Between 35 - 45 years
Gender?
Female
Male
Between 45 - 60 years
More than 60 years
I prefer not to inform
How many people with whom you have had personal contact have contracted the coronavirus (including you)?
These may be unconfirmed suspected cases
I didn't get the disease and I don't know anyone who got it
One person
Two people
Three or more people
Are there deaths related to the disease in your personal circle?
These may be unconfirmed suspected cases
There are not
Yes, one person
Yes, two people
Do you find yourself in quarantine (it may be self-isolation)?
If so, since when?
What
Up to 1 minimum wage More than 10 minimum wages 1 to 5 minimum wages
Did the consequences of the pandemic impact you financially? If so, how much?
Do you contribute financially to services you used before the pandemic that you are no longer using? Domestic services (cleaning, gardening, others), gym, schools, therapies, etc.
Single choice from drop-down list
Single choice
Single choice
Single choice
Single choice
Would you say the pandemic has put you in a delicate financial position?
From the following items, select the ones which were hampered after the onset of the disease.
3.4.3 TREATMENT OF DATA OBTAINED
The primary data obtained in the online survey are already spatialized. However, it is necessary to verify the data obtained both exclusively in the municipality of Rio de Janeiro and in the period of interest between April 7 and May 11, 2020. It is essential to clarify that, from the beginning, it was not intended to constitute a representative sample of the city of Rio de Janeiro nor to impose a geographical limitation, which results in this first analysis and filtering of the data obtained.
These data will be segmented according to the geographic division of the city's planning areas, which is information provided by the city hall of Rio de Janeiro. Thus, some analyzes and mappings will be carried out with the results obtained in the research.
Besides, it is necessary to add some demographic and socioeconomic data to this segmentation (Urban & Nakada, 2020) and the official data related to COVID-19 to verify the adherence between the data obtained in the mapping and the variables related to these data observed in reality. For this analysis, mapping and a scatterplot matrix will be built in ArcGIS Pro in order to evaluate the relationships between variables.
As previously explained, the data used with the demographic information of the Rio de Janeiro population will be obtained on the geospatial basis of the 16 planning regions of the city that were geoenriched with data from 2018.
Finally, some qualitative questions in the questionnaire are not addressed by official data related to the pandemic and in another GIS visualization. Thus, this information can be verified through the interest of people over time, observed in the search for certain subjects on Google, via Google Trends (Biasi, 2020), seeking the gradual evolution of words related to mental health after the arrival of the pandemic For example, it can be a way to assess which feelings were arisen in the population due to COVID-19.
4 Results And Discussion
4.1 GENERAL RESULTS
The research's general results present the quantitative data obtained from responses to Survey123 after the definition of the application area in the city of Rio de Janeiro and dissemination in the main social media. It can also be seen how these answers were obtained according to the city’s planning regions and the relation between these variables and some demographic data of the city.
As a result of applying the survey, 250 responses were obtained for Rio de Janeiro’s municipality in the target period of application, between April 7 and May 11, 2020 (Figure 17). On the map below, it is possible to check the interviewees’ points regarding their location and the limits of the city’s planning regions. These locations were informed by the interviewees in the Survey123.
The region in which the highest number of responses were obtained was region 2.2 (Tijuca) with 65 interactions, followed by region 2.1 (Zona Sul), with 37 interactions, and region 1.1 (Centro), with 35 interactions. Region 5.4 (Guaratiba) was the only planning region in which no answers were collected. All answers compiled in the survey are consolidated by the planning region in the appendix section of this work.
Among these responses, 73.20% were given by women, with the remaining 26.80% being answered by men. The proportion found in the city, considering the geoenriched data obtained in the city hall website, is about 47% of men and 53% of women. Among the interviewees' age, groups up to 25 years (31%) and between 45 and 60 years (32%) stood out (Figure 18).
Up to 25 years
Between 25 - 35 years
Between 35 - 45 years
Between 45 - 60 years
More than 60 years
The city hall's enriched demographic data present a proportion of around 20% for each referenced age group, concentrating about 22% of the population between 15 and 29 years old and 23% in the range between 30 and 44 years old, being these groups a higher population proportion (Table 1)
It is a little challenging to compare the data obtained in Survey123 with the city hall's information, given that the adopted age groups are different. However, in the official database, it is possible to affirm that the ranges had more balanced figures considering that it was a representative sample, and significantly different percentages between the ranges can be perceived when comparing to the survey data Additionally, it is possible to verify that the two age groups that correspond (45-59/60 and 60+) have quite different percentages between what was found in the research and reality
Continuing the analysis of the data obtained in the survey, the following map (Figure 19) was built based on two pieces of information: population density in each planning region, based on the city hall’s data, and the number of responses obtained in the online questionnaire.
The analysis performed considers whether the mapped variables have a positive relationship with each other, with low-low, medium-medium, or high-high values in each one, or if the relationships between the variables have different results. This approach will be used in some of the mappings in the sequence.
Some regions have a positive relationship between the variables used, such as regions 5.3 (Santa Cruz), 5.2 (Campo Grande), and 4.2 (Barra da Tijuca) in which there are low values in both variables; in regions 5.1 (Bangu) and 4.1 (Jacarepaguá) the values are intermediate; in 2.1 (Zona Sul) they have high values. On the other hand, there are cases of high values of population density and few responses, as in 3.1 (Ramos), 3.6 (Pavuna), and 3.4 (Inhaúma), as well as a fact verified in 3.7 (Ilha do Governador), with intermediate population density and few responses. In 3.2 (Méier), 3.3 (Madureira), and 3.5 (Penha), there are intermediate values of responses, being places of high population density. Finally, in 2.2 (Tijuca) and 1.1 (Centro), there is a high incidence of answers to the questionnaire, being places of intermediate population density. The region with no answers in the questionnaire, 5.4 (Guraratiba), has the lowest population density in the city.
4.2 MAPPING DATA RELATED TO COVID-19
The data related to the official record of cases will be used to build a map that relates this information to the known cases reported by the interviewees in the survey. It is noteworthy that the official data regarding the number of COVID-19 cases in Rio de Janeiro were the cumulative data recorded up to the end of the research, on May 11, 2020. According to the official data, there were 22,115 accumulated cases of COVID-19 in the city of Rio de Janeiro until the end date of the survey.
Thus, performing a similar analysis that was made in Figure 19 for the sum of the cases reported in the questionnaire and the official data related to COVID-19 displayed in the city hall dashboard (Figure 20), a certain adherence is noted in the number of cases in the following regions: 3.4 (Inhaúma), 3.6 (Pavuna) and 3.7 (Ilha do Governador), which is low for both variables, and 1.1 (Centro), 3.2 (Méier) and 3.3 (Madureira), with intermediate values. The highest occurrence happens in places of accumulated cases at an intermediate level, and which had a low number of responses on this item in the questionnaire, representing six areas. There is a caveat here in the number of cases added to the questionnaire since the maximum number that the interviewee could answer was three or more known people who had contracted COVID-19 (as explained in the table shown in Figure 15). Thus, the number of cases obtained in the questionnaire may be underestimated.
The official data related to deaths in Rio de Janeiro will be used in the following map. As done before, the number of deaths considered is the accumulated number until May 11, 2020. According to the official data, there was a total of 5,459 deaths in Rio de Janeiro by the end of the survey.
Observing what happens with the numbers of deaths (Figure 21), there is an adherence between the official data and that collected in the survey in the following regions: 3.4 (Inhaúma), 3.7 (Ilha do Governador), 4.2 (Barra da Tijuca) and 5.3 (Santa Cruz), presenting low figures, 1.1 (Centro), 3.2 (Méier) and 3.6 (Pavuna), which have intermediate figures and 2.1 (Zona Sul), 3.3 (Madureira) and 5.1 (Bangu), presenting high figures. There are also cases where the accumulated deaths by COVID-19 from official data presented intermediate figures, but the questionnaire data added a high number of deaths, in 2.2 (Tijuca) and 3.5 (Penha), or a low number of deaths, in 3.1 (Ramos) and 5.2 (Campo Grande) values. Finally, in region 4.1 (Jacarepaguá), the official data values are high, while in the survey they are intermediate. In this research variable, the same caveat made in regard to case data is appropriate, as the option with the highest amount in the questionnaire was whether the interviewee knew 2 or more people who died by COVID-19 (as shown in the survey’s questions in Figure 16).