Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calculate summary statistics #1

Closed
1 of 2 tasks
milanmlft opened this issue Aug 8, 2024 — with Linear · 3 comments
Closed
1 of 2 tasks

Calculate summary statistics #1

milanmlft opened this issue Aug 8, 2024 — with Linear · 3 comments
Labels

Comments

Copy link
Member

milanmlft commented Aug 8, 2024

Definition of done

Script that can be run taking an input OMOP dataset and producing summary statistics that will be used in the dashboard (uploading these to a different schema)

For now

  • monthly counts
  • for numeric data- mean and standard deviation (without any time filtering)
  • ignore categorical variables for now

Testing

Using in memory databases, parse example dataset, have tests for a subset of this output to ensure that the statistics are being calculated as expected

  • test using one of the synthae datasets available from CDMConnector
  • have a final summarised dataset against which we test the script, could potentially also be used for SAF-611

Dependencies

Details

Suggested structure for tables:

  • concept ID, concept type (e.g. observation), concept name, vocabulary
  • concept_id, date (10th day of month per year), patient count, readings per patient
  • concept id, summary_attribute (e.g. mean/sd for numeric or categorical variable 1/categorical variable 2…), value_as_string (if categorical, use readable name with ID in brackets), value_as_double
Copy link

linear bot commented Aug 8, 2024

@milanmlft milanmlft transferred this issue from another repository Aug 8, 2024
Copy link

linear bot commented Aug 8, 2024

@stefpiatek stefpiatek changed the title Calculate summary statistics depending on data type Calculate summary statistics Aug 12, 2024
@milanmlft
Copy link
Member Author

Closed by #22

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant