Skip to content

Latest commit

 

History

History
39 lines (28 loc) · 1.83 KB

README.md

File metadata and controls

39 lines (28 loc) · 1.83 KB

Build Status codecov

NOTE: This is a work in progress and the current version is only for demonstration purposes. Once these tools reach Alpha status, this note should be removed. This is a collaboration between Google and the OpenMRS community.

What is this?

This repository includes pipelines to transform data from an OpenMRS instance using the FHIR format into a data warehouse based on Apache Parquet files, or another FHIR server (e.g., a HAPI FHIR server or Google Cloud FHIR store). There is also a query library in Python to make working with FHIR based data warehouses simpler.

These tools are intended to be generic and eventually work with any FHIR-based data source and data warehouse. Here is the list of main directories with a brief description of their content:

  • pipelines/ Batch and streaming pipelines to transform data from a FHIR based source to an analytics friendly data warehouse or another FHIR store.

  • dwh/ Query library for working with distributed FHIR-based data warehouses.

  • bunsen/ A fork of a subset of the Bunsen project.

  • docker/ Docker configurations for various servers/pipelines.

  • doc/ Documentations

  • utils/ Various artifacts for setting up an initial database, running pipelines, etc.

  • e2e-tests/ Scripts for testing pipelines end-to-end.