This dbt package:
- Transforms and aggregates raw web event data collected from the Snowplow JavaScript tracker into a set of derived tables: page views, sessions and users.
- Derives a mapping between user identifiers, allowing for 'session stitching' and the development of a single customer view.
- Processes all web events incrementally. It is not just constrained to page view events - any custom events you are tracking will also be incrementally processed.
- Is designed in a modular manner, allowing you to easily integrate your own custom SQL into the incremental framework provided by the package.
Please refer to the doc site for a full breakdown of the package.
The snowplow-web v0.5.1 package currently supports BigQuery, Redshift, Snowflake & Postgres.
Warehouse | dbt versions | snowplow-web version |
---|---|---|
BigQuery, Redshift, Snowflake & Postgres | >=0.20.0 to <1.1.0 | 0.5.1 |
BigQuery, Redshift & Snowflake | >=0.18.0 to <0.20.0 | 0.4.1 |
Postgres | >=0.19.0 to <0.20.0 | 0.4.1 |
- A dataset of web events from the Snowplow JavaScript tracker must be available in the database.
- Have the
webPage
context enabled.
Check dbt Hub for the latest installation instructions, or read the dbt docs for more information on installing packages.
Please refer to the doc site for details on how to configure and run the package.
The package contains multiple staging models however the mart models are as follows:
Model | Description |
---|---|
snowplow_web_page_views | A table of page views, including engagement metrics such as scroll depth and engaged time. |
snowplow_web_sessions | An aggregated table of page views, grouped on domain_sessionid . |
snowplow_web_users | An aggregated table of sessions to a user level, grouped on domain_userid . |
snowplow_web_user_mapping | Provides a mapping between user identifiers, domain_userid and user_id . |
We welcome all ideas, questions and contributions!
For support requests, please use our community support Discourse forum.
If you find a bug, please report an issue on GitHub.
The snowplow-web package is Copyright 2021-2022 Snowplow Analytics Ltd.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License.
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.