This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
Johns Hopkins University CSSE released the data set that powers their dashboard on GitHub (https://github.com/CSSEGISandData/COVID-19). If you want to work with that data easily, I created the attached workflow and macro to import the daily data found within that repository. Install the package to the root of the folder that Git creates. This will import the daily data, parse the date fields that change formats halfway through the timeseries, update null latitude and longitude fields, and other general cleansing. With that done, you can experiment with daily worldwide COVID-19 confirmed/deaths/recovered case numbers at the country/region and province/state level, with geocoding available for about 99% of records.
If this workflow is useful, please let us know. If you need help or have improvements to the workflow, please share.
We'd also love for you to share what you create or discover by replying to this thread!
EDIT: The workflow has been updated to better clean and regularize the data. A lot of clean up is being now to country and state names, with merging of duplicates being done, and a locality field being parsed out of values such as "Chicago, IL". Review the new workflow for details. This should improve the quality of the output data significantly, although JHU is still working on upstream issues on their end.
JHU stopped publishing county level data on Mar 09. Starting with Mar 10, you'll only have state level data on both the daily report csvs and time series csvs. Just a heads up in case anyone is wondering why all the counties suddenly show "0" for all the county values as of Mar 10. To my knowledge, that only place to get county level stats is to search and find individual state Departments of Health and see if they publish them.
A new version of the workflow that handles all of the irregularities in country, region, and locality data has been uploaded. The time portion of the updated field was removed and a unique tool added. Also, cruise ships were assigned in various ways previously, and now are under "Cruise Ship" for country, as that seemed to be the general direction JHU was heading. Keep that in mind if you'd prefer to handle the disposition of cruise ship data in a different manner.
The country name is now changed to use the ISO-3166-2 name. I've also added the ISO-3166-2 sub-region and intermediate region as new fields. The CSV file containing that data is included in the YXZP if you'd like to bring in additional country-related fields from that.
Not necessarily bad form at all. I chose git because a) I'm a developer and use to it and b) the data only updates daily, so I chose to download it only daily using that method. Thanks for sharing your workflow!
oh this looks promising - thank you very much!!! do you jsut download the CSV? or have you figured out how to get a URL for it? that's what i'm trying to do now - look in the code of the page, but i'm not very experienced in that LOL