Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new data for Canada #502

Open
nataliadgepi opened this issue Apr 11, 2020 · 3 comments
Open

new data for Canada #502

nataliadgepi opened this issue Apr 11, 2020 · 3 comments
Labels
help wanted Extra attention is needed s:data Scope: related to data retrieval, parsing, transformation, storage, update

Comments

@nataliadgepi
Copy link

Hi,
I'm not sure where to put this, the gov't of Canada has just updated their database and made hospitalizations/critical care etc public for all covid positive cases: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=1310076601

I was wondering if this would be useful to update your database.

Thank you!
Natalia

🐛 Bug Report

How to reproduce

Steps to reproduce the issue:

  1. Open the application in a browser

😯 Current Behavior

🤔 Expected Behavior

💁 Possible Solution

🔦 Context

💻 Code Sample

🌍 Your Environment

Software Version(s)
Browser
Operating System

Related

@nataliadgepi nataliadgepi added help wanted Extra attention is needed needs triage Review this and assign labels t:bug Type: bug, error labels Apr 11, 2020
@nnoll nnoll added s:data Scope: related to data retrieval, parsing, transformation, storage, update and removed needs triage Review this and assign labels t:bug Type: bug, error labels Apr 12, 2020
@nnoll
Copy link
Collaborator

nnoll commented Apr 12, 2020

Looks like the Canadian government does provide an API for data downloads for developers, see here. Haven't been able to track down the specifics on how to just get this csv.

@noleti
Copy link
Collaborator

noleti commented Apr 12, 2020

The API is a little bit difficult to work with. As data is provided per case (https://www150.statcan.gc.ca/n1/tbl/csv/14100287-eng.zip), the .csv gets very big for all cases (~1GB before .zipping). So we likely don't want to download and parse full data in our scripts. There is an API to query data for ranges of dates for 'vectors', but each patient seems to be an individual 'vector' in the DB, and I don't see how to predict the IDs of new patients etc.

I can likely write a parser for this, if we want to have such a parser that downloads the entire set each time. Is that the case?

@nnoll
Copy link
Collaborator

nnoll commented Apr 13, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed s:data Scope: related to data retrieval, parsing, transformation, storage, update
Projects
None yet
Development

No branches or pull requests

3 participants