Skip to content
This repository has been archived by the owner on Jul 13, 2023. It is now read-only.

Latest commit

 

History

History
245 lines (193 loc) · 5.77 KB

readme.md

File metadata and controls

245 lines (193 loc) · 5.77 KB

Cube Style Guide

Use this style guide when working on cube_analytics project, building demos, writing documentation. When working with customers, follow customer style guide if they have any, otherwise use this one.

  • Default to YAML for data modeling. Use JS data modeling when you need to have dynamic data models.
  • Use snake case, even with JS data models.
  • Cubes must remain private, set public: false for all cubes. Only views can be exposed to visualization tools.

Structure of our Cube project

Views and cubes folders structure should reflect business units structure.

Views should be designed for data consumers and optimized for consumption in the visualization tools.

cube_project
└── schema
    ├── cubes
    │   ├── finance
    │   │   ├── stripe_invoices.yml
    │   │   └── stripe_payments.yml
    │   └── sales
    │       └── base_opportunities.yml
    └── views
        ├── product
        │   └── cloud_tenants.yml
        └── sales
            └── opportunities.yml 

Cubes

  • A cube's name should represent business entiity and be plural. If cube's name may clash with view's name use prefix base_ for cube's name, e.g. base_opportunities.yml.
  • Use sql_table if possible. E.g. instead of sql: SELECT * FROM schema.table do sql_table: schema.table.
  • Use many_to_one, one_to_many, one_to_one relationship type names instead of belongs_to, has_many, has_one.
  • Applicable cube parameters should be ordered as:
    • name
    • sql_alias
    • extends
    • data_source
    • sql
    • sql_table
    • rewrite_queries
    • title
    • description
    • public
    • refresh_key
    • pre_aggregations
    • joins
    • measures
    • dimensions
  • Primary keys for the cube should be the first dimension listed.

Dimensions & measures

  • Applicable dimensions and measures parameters should be ordered as:
    • name
    • title
    • description
    • sql
    • type
    • primary_key
    • sub_query
    • public
    • format
    • filters
    • drill_members
  • Use description if name is not intuitive.

Example cube

cubes:
  - name: line_items
    sql_table: public.line_items
      
    joins:
      - name: products
        sql: "{CUBE}.product_id = {products}.id"
        relationship: many_to_one
        
      - name: orders
        sql: "{CUBE}.order_id = {orders}.id"
        relationship: many_to_one

    measures:
      - name: count
        type: count

      - name: total_amount
        sql: price
        type: sum

    dimensions:
      - name: id
        sql: id
        type: number
        primary_key: true
        
      - name: created_date
        sql: created_at
        type: time

Views

  • Applicable views parameters should be ordered as
    • name
    • description
    • cubes

Example view

views:
  - name: orders

    cubes:
      - cube: base_orders
        includes:
          # dimensions
          - status
          - created_date

          # measures
          - total_amount
          - total_amlunt_shipped
          - count
          - average_order_value
      
      - cube: base_orders.line_items.products
        includes: 
          - member: name
            name: product

      - cube: base_orders.line_items.products.product_categories
        includes: 
          - member: name
            name: product_category

      - cube: base_orders.users
        prefix: true
        includes: 
          - city

SQL style guide

  • Use trailing commas.
  • Keywords and function names should all be uppercase.
  • Use != instead of <>
  • Always use the AS keyword when aliasing columns, expressions, and tables.
  • Unless SQL query is SELECT * FROM table_name start SQL query from new line.
  • Use new lines, optimize for readability and maintainability
  • Use CTEs rather than subqueries.
  • When joining multiple tables, always prefix the column names with the table name/alias.
  • Use single quotes for strings.
  • Avoid initialisms and unnecessary table aliases.
  • If there's only one thing, put it on the same line as the opening keyword.
  • If there are multiple things, put each one on its own line (including the first one), indented one level more than the opening keyword.
  • Indents should be 2 spaces.

Example SQL

cubes:
  - name: california_users
    sql: 
      SELECT 
        id,
        first_name,
        last_name
      FROM public.users
      WHERE state = 'CA'

    measures:
      - name: count
        type: count

    dimensions:
      - name: id
        sql: id
        type: number
        primary_key: true

      - name: first_name
        sql: first_name
        type: string

      - name: last_name
        sql: last_name
        type: string

YAML style guide

  • Use .yml extension instead of .yaml
  • Indents should use two spaces.
  • List items should be indented.
  • Use a new line to separate list items that are dictionaries, where appropriate.
  • Lines of YAML should be no longer than 80 characters.
  • If quotes are needed around a string, use double quotes

Example YAML

cubes:
  - name: users
    sql_table: public.users

    measures:
      - name: count
        type: count

      - name: total_orders_amount
        sql: "{lifetime_value}"
        type: sum

    dimensions:
      - name: id
        sql: id
        type: number
        primary_key: true

      - name: city
        sql: city
        type: string
        
      - name: lifetime_value
        sql: "{line_items.total_amount}"
        type: number
        sub_query: true

Credits

This style guide was inspired in part by: