The dataset is about the people who were impacted due to covid 19 in different states and territories of the USA. They are the representative of the main characters of the story as this will help to understand the trend of covid -19 impact on different locations of the USA. The data does not contain any identifying information nor does it have risks of disclosing identifiable information, it is mostly anonymous geographical and medical information.
The dataset records the number of people tested positive, recovered, deceased, hospitalized in each and every state on each day.
Global data is available but we narrowed down the scope of our project to the USA hence we have collected data for all the states and territories of the USA. The state variable in the dataset geographically separates out the data.
The data is collected everyday from official sites of each state and placed at covidtracking.com. As part of the first and second increment, we collected the data in csv format but for next increment we will be consuming API to get the real time data. Covid-19 started spreading in the USA from March 2020, hence the data is available from March and gets updated everyday.
The data is very crucial to understand the ongoing pandemic and its effect on every sector. The data is collected to analyze, understand and identify the gaps in preventive measures taken.