diff --git a/_site.yml b/_site.yml index 9936eac..e0b5776 100644 --- a/_site.yml +++ b/_site.yml @@ -33,6 +33,8 @@ navbar: href: notes/08_model-checking-and-cv_bda3-6-7.html - text: "Section 9. Model comparison and selection" href: notes/09_model-selection_bda3-7.html + - text: "Section 10. Decision analysis" + href: notes/10_decision-analysis_bda3-9.html - text: "Exercises" menu: - text: "Chapter 1" @@ -59,6 +61,8 @@ navbar: href: assignments/jhcook-assignment-07.html - text: "Assignment 8" href: assignments/jhcook-assignment-08.html + - text: "Assignment 9" + href: assignments/jhcook-assignment-09.html - text: "About" href: about.html output: distill::distill_article diff --git a/assignments/assignment-09.pdf b/assignments/assignment-09.pdf new file mode 100644 index 0000000..494d99f Binary files /dev/null and b/assignments/assignment-09.pdf differ diff --git a/assignments/jhcook-assignment-09.Rmd b/assignments/jhcook-assignment-09.Rmd new file mode 100644 index 0000000..a987f3a --- /dev/null +++ b/assignments/jhcook-assignment-09.Rmd @@ -0,0 +1,200 @@ +--- +title: "Assignment 9" +date: "2021-11-18" +output: distill::distill_article +--- + +## Setup + +```{r setup} +knitr::opts_chunk$set(echo = TRUE, comment = "#>", dpi = 300) + +library(glue) +library(rstan) +library(tidybayes) +library(tidyverse) + +for (f in list.files(here::here("src"), pattern = "R$", full.names = TRUE)) { + source(f) +} + +rstan_options(auto_write = TRUE) +options(mc.cores = 2) + +theme_set(theme_classic() + theme(strip.background = element_blank())) + +factory <- aaltobda::factory +set.seed(678) +``` + +**[Assignment 9](assignments/assignment-09.pdf)** + +## Exercise 1. Decision analysis for the factory data + +**Your task is to decide whether or not to buy a new (7th) machine for the company.** +**The decision should be based on our best knowledge about the machines.** + +**The following is known about the production process:** + +- **The given data contains quality measurements of single products from the six machines that are ordered from the same seller. (columns: different machines, rows: measurements)** +- **Customers pay 200 euros for each product.** + – **If the quality of the product is below 85, the product cannot be sold** + – **All the products that have sufficient quality are sold.** +- **Raw-materials, the salary of the machine user and the usage cost of the machine for each product cost 106 euros in total.** + – **Usage cost of the machine also involves all investment and repair costs divided by the number of products a machine can create. So there is no need to take the investment cost into account as a separate factor.** +- **The only thing the company owner cares about is money. Thus, as a utility function, use the profit of a new product from a machine.** + +**a) For each of the six machines, compute and report the expected utility of one product of that machine.** + +```{r} +PURCHASE_RPICE <- 200 +MIN_QUALITY_TO_SELL <- 85 +COST_TO_PRODUCE <- 106 + +utility <- function(draws) { + purchased <- PURCHASE_RPICE * sum(draws >= MIN_QUALITY_TO_SELL) + cost_to_produce <- -1 * COST_TO_PRODUCE * length(draws) + u <- (purchased + cost_to_produce) / length(draws) + return(u) +} + +# Test case given in the assignment. +test_y_pred <- c(123.80, 85.23, 70.16, 80.57, 84.91) +test_res <- utility(draws = test_y_pred) +stop_if_not_close_to(test_res, -26) +``` + +Fit the hierarchical model and gather posterior predictions from each machine. + +```{r} +hierarchical_model_code <- here::here( + "models", "assignment07_factories_hierarchical.stan" +) + +hierarchical_model_data <- list( + y = factory, + N = nrow(factory), + J = ncol(factory) +) + +hierarchical_model <- rstan::stan( + hierarchical_model_code, + data = hierarchical_model_data, + verbose = FALSE, + refresh = 0 +) + +print(hierarchical_model, pars = c("alpha", "tau", "mu", "sigma")) +``` + +Extract the posterior predictions for each machine and compare them to the observations. + +```{r} +factory_ypred <- rstan::extract(hierarchical_model, pars = "ypred")$ypred + +tidy_factory_measure_matrix <- function(factory_mat) { + as.data.frame(factory_mat) %>% + set_names(glue("machine {seq(ncol(factory_mat))}")) %>% + pivot_longer(-c(), names_to = "machine", values_to = "quality_measurement") +} + +factory_long <- tidy_factory_measure_matrix(factory) + +tidy_factory_measure_matrix(factory_ypred) %>% + ggplot(aes(x = quality_measurement)) + + facet_wrap(vars(machine), nrow = 2, scales = "free") + + geom_density(fill = "black", alpha = 0.1) + + geom_rug(data = factory_long, color = "blue") + + scale_x_continuous(expand = expansion(c(0, 0))) + + scale_y_continuous(expand = expansion(c(0, 0.02))) + + labs( + x = "quality measurements", + y = "density", + title = "Posterior predictions on current machines" + ) +``` + +Calculate the expected utility for each current machine. + +```{r} +machine_utilities <- apply(factory_ypred, 2, utility) +tibble( + machine = glue("machine {seq(length(machine_utilities))}"), + expected_utility = machine_utilities +) %>% + kableExtra::kbl() +``` + +**b) Rank the machines based on the expected utilities.** +**In other words order the machines from worst to best.** +**Also briefly explain what the utility values tell about the quality of these machines.** +**E.g. Tell which machines are profitable and which are not (if any).** + +Based on their expected utility, the rankings of the machines from worst to best is: 1, 6, 3, 5, 2, 4. +Machine 1 has a negative utility, indicating that it is expected to be unprofitable. + +**c) Compute and report the expected utility of the products of a new (7th) machine.** + +```{r} +machine7_pred <- rstan::extract(hierarchical_model, pars = "y7pred")$y7pred + +ggplot(tibble(x = unlist(machine7_pred)), aes(x = x)) + + geom_density(fill = "black", alpha = 0.1) + + scale_x_continuous(expand = expansion(c(0, 0))) + + scale_y_continuous(expand = expansion(c(0, 0.02))) + + labs( + x = "quality measurements", + y = "density", + title = "Posterior predictions on hypothetical machine 7" + ) +``` + +```{r} +# Expected utility from machine 7. +utility(machine7_pred) +``` + +The expected utility of hypothetical machine 7 is **`r utility(machine7_pred)`**. + +**d) Based on your analysis, discuss briefly whether the company owner should buy a new (7th) machine.** + +Based on this analysis, purchasing another machine would be expected to be profitable. +It might be worth replacing machine 1 with this new machine. + +**e) As usual, remember to include the source code for both Stan and R (or Python).** + +The model is available here ["assignment07_factories_hierarchical.stan"](../models/assignment07_factories_hierarchical.stan). + +The only changes were made in the `generated quantities` block: + +``` +... +generated quantities { + // Compute the predictive distribution for the sixth machine. + real y6pred; // Leave for compatibility with earlier assignments. + vector[J] ypred; + real mu7pred; + real y7pred; + vector[J] log_lik[N]; + + y6pred = normal_rng(mu[6], sigma); + for (j in 1:J) { + ypred[j] = normal_rng(mu[j], sigma); + } + + mu7pred = normal_rng(alpha, tau); + y7pred = normal_rng(mu7pred, sigma); + + for (j in 1:J) { + for (n in 1:N) { + log_lik[n,j] = normal_lpdf(y[n,j] | mu[j], sigma); + } + } +} +``` + +--- + +```{r} +sessionInfo() +``` diff --git a/assignments/jhcook-assignment-09.html b/assignments/jhcook-assignment-09.html new file mode 100644 index 0000000..6f93239 --- /dev/null +++ b/assignments/jhcook-assignment-09.html @@ -0,0 +1,3867 @@ + + + + + + + + + + + + + + + + + + + + Assignment 9 + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+

Assignment 9

+ + + +
+ + +
+

Setup

+
+
+
knitr::opts_chunk$set(echo = TRUE, comment = "#>", dpi = 300)
+
+library(glue)
+library(rstan)
+library(tidybayes)
+library(tidyverse)
+
+for (f in list.files(here::here("src"), pattern = "R$", full.names = TRUE)) {
+  source(f)
+}
+
+rstan_options(auto_write = TRUE)
+options(mc.cores = 2)
+
+theme_set(theme_classic() + theme(strip.background = element_blank()))
+
+factory <- aaltobda::factory
+set.seed(678)
+
+
+
+

Assignment 9

+

Exercise 1. Decision analysis for the factory data

+

Your task is to decide whether or not to buy a new (7th) machine for the company. The decision should be based on our best knowledge about the machines.

+

The following is known about the production process:

+ +

a) For each of the six machines, compute and report the expected utility of one product of that machine.

+
+
+
PURCHASE_RPICE <- 200
+MIN_QUALITY_TO_SELL <- 85
+COST_TO_PRODUCE <- 106
+
+utility <- function(draws) {
+  purchased <- PURCHASE_RPICE * sum(draws >= MIN_QUALITY_TO_SELL)
+  cost_to_produce <- -1 * COST_TO_PRODUCE * length(draws)
+  u <- (purchased + cost_to_produce) / length(draws)
+  return(u)
+}
+
+# Test case given in the assignment.
+test_y_pred <- c(123.80, 85.23, 70.16, 80.57, 84.91)
+test_res <- utility(draws = test_y_pred)
+stop_if_not_close_to(test_res, -26)
+
+
+
+

Fit the hierarchical model and gather posterior predictions from each machine.

+
+
+
hierarchical_model_code <- here::here(
+  "models", "assignment07_factories_hierarchical.stan"
+)
+
+hierarchical_model_data <- list(
+  y = factory,
+  N = nrow(factory),
+  J = ncol(factory)
+)
+
+hierarchical_model <- rstan::stan(
+  hierarchical_model_code,
+  data = hierarchical_model_data,
+  verbose = FALSE,
+  refresh = 0
+)
+
+print(hierarchical_model, pars = c("alpha", "tau", "mu", "sigma"))
+
+
+
#> Inference for Stan model: assignment07_factories_hierarchical.
+#> 4 chains, each with iter=2000; warmup=1000; thin=1;
+#> post-warmup draws per chain=1000, total post-warmup draws=4000.
+#>
+#>         mean se_mean   sd  2.5%    25%    50%    75%  97.5% n_eff
+#> alpha  94.55    0.08 4.83 85.43  91.32  94.37  97.72 104.49  3501
+#> tau    11.10    0.12 4.16  4.08   8.23  10.66  13.54  20.47  1212
+#> mu[1]  81.53    0.16 6.34 69.02  77.32  81.42  85.70  94.91  1618
+#> mu[2] 102.52    0.11 5.86 91.11  98.64 102.57 106.40 113.99  3076
+#> mu[3]  89.82    0.09 5.55 79.06  86.07  89.76  93.49 100.91  4114
+#> mu[4] 106.41    0.13 6.13 93.57 102.51 106.57 110.50 118.39  2285
+#> mu[5]  91.29    0.08 5.37 80.40  87.93  91.28  94.86 102.05  4113
+#> mu[6]  88.51    0.10 5.47 78.00  84.76  88.50  92.29  99.23  3143
+#> sigma  14.27    0.04 2.08 10.87  12.79  14.02  15.50  18.81  2940
+#>       Rhat
+#> alpha    1
+#> tau      1
+#> mu[1]    1
+#> mu[2]    1
+#> mu[3]    1
+#> mu[4]    1
+#> mu[5]    1
+#> mu[6]    1
+#> sigma    1
+#>
+#> Samples were drawn using NUTS(diag_e) at Thu Nov 18 07:08:39 2021.
+#> For each parameter, n_eff is a crude measure of effective sample size,
+#> and Rhat is the potential scale reduction factor on split chains (at
+#> convergence, Rhat=1).
+
+

Extract the posterior predictions for each machine and compare them to the observations.

+
+
+
factory_ypred <- rstan::extract(hierarchical_model, pars = "ypred")$ypred
+
+tidy_factory_measure_matrix <- function(factory_mat) {
+  as.data.frame(factory_mat) %>%
+    set_names(glue("machine {seq(ncol(factory_mat))}")) %>%
+    pivot_longer(-c(), names_to = "machine", values_to = "quality_measurement")
+}
+
+factory_long <- tidy_factory_measure_matrix(factory)
+
+tidy_factory_measure_matrix(factory_ypred) %>%
+  ggplot(aes(x = quality_measurement)) +
+  facet_wrap(vars(machine), nrow = 2, scales = "free") +
+  geom_density(fill = "black", alpha = 0.1) +
+  geom_rug(data = factory_long, color = "blue") +
+  scale_x_continuous(expand = expansion(c(0, 0))) +
+  scale_y_continuous(expand = expansion(c(0, 0.02))) +
+  labs(
+    x = "quality measurements",
+    y = "density",
+    title = "Posterior predictions on current machines"
+  )
+
+
+

+
+

Calculate the expected utility for each current machine.

+
+
+
machine_utilities <- apply(factory_ypred, 2, utility)
+tibble(
+  machine = glue("machine {seq(length(machine_utilities))}"),
+  expected_utility = machine_utilities
+) %>%
+  kableExtra::kbl()
+
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+machine + +expected_utility +
+machine 1 + +-26.15 +
+machine 2 + +70.15 +
+machine 3 + +17.50 +
+machine 4 + +76.45 +
+machine 5 + +27.45 +
+machine 6 + +11.05 +
+
+

b) Rank the machines based on the expected utilities. In other words order the machines from worst to best. Also briefly explain what the utility values tell about the quality of these machines. E.g. Tell which machines are profitable and which are not (if any).

+

Based on their expected utility, the rankings of the machines from worst to best is: 1, 6, 3, 5, 2, 4. Machine 1 has a negative utility, indicating that it is expected to be unprofitable.

+

c) Compute and report the expected utility of the products of a new (7th) machine.

+
+
+
machine7_pred <- rstan::extract(hierarchical_model, pars = "y7pred")$y7pred
+
+ggplot(tibble(x = unlist(machine7_pred)), aes(x = x)) +
+  geom_density(fill = "black", alpha = 0.1) +
+  scale_x_continuous(expand = expansion(c(0, 0))) +
+  scale_y_continuous(expand = expansion(c(0, 0.02))) +
+  labs(
+    x = "quality measurements",
+    y = "density",
+    title = "Posterior predictions on hypothetical machine 7"
+  )
+
+
+

+
+
+
+
# Expected utility from machine 7.
+utility(machine7_pred)
+
+
+
#> [1] 30.7
+
+

The expected utility of hypothetical machine 7 is 30.7.

+

d) Based on your analysis, discuss briefly whether the company owner should buy a new (7th) machine.

+

Based on this analysis, purchasing another machine would be expected to be profitable. It might be worth replacing machine 1 with this new machine.

+

e) As usual, remember to include the source code for both Stan and R (or Python).

+

The model is available here “assignment07_factories_hierarchical.stan”.

+

The only changes were made in the generated quantities block:

+
...
+generated quantities {
+  // Compute the predictive distribution for the sixth machine.
+  real y6pred;  // Leave for compatibility with earlier assignments.
+  vector[J] ypred;
+  real mu7pred;
+  real y7pred;
+  vector[J] log_lik[N];
+
+  y6pred = normal_rng(mu[6], sigma);
+  for (j in 1:J) {
+    ypred[j] = normal_rng(mu[j], sigma);
+  }
+
+  mu7pred = normal_rng(alpha, tau);
+  y7pred = normal_rng(mu7pred, sigma);
+
+  for (j in 1:J) {
+    for (n in 1:N) {
+      log_lik[n,j] = normal_lpdf(y[n,j] | mu[j], sigma);
+    }
+  }
+}
+
+
+
+
sessionInfo()
+
+
+
#> R version 4.1.2 (2021-11-01)
+#> Platform: x86_64-apple-darwin17.0 (64-bit)
+#> Running under: macOS Big Sur 10.16
+#>
+#> Matrix products: default
+#> BLAS:   /Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRblas.0.dylib
+#> LAPACK: /Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRlapack.dylib
+#>
+#> locale:
+#> [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
+#>
+#> attached base packages:
+#> [1] stats     graphics  grDevices datasets  utils     methods
+#> [7] base
+#>
+#> other attached packages:
+#>  [1] forcats_0.5.1        stringr_1.4.0        dplyr_1.0.7
+#>  [4] purrr_0.3.4          readr_2.0.1          tidyr_1.1.3
+#>  [7] tibble_3.1.3         tidyverse_1.3.1      tidybayes_3.0.1
+#> [10] rstan_2.21.2         ggplot2_3.3.5        StanHeaders_2.21.0-7
+#> [13] glue_1.4.2
+#>
+#> loaded via a namespace (and not attached):
+#>  [1] matrixStats_0.61.0   fs_1.5.0             lubridate_1.7.10
+#>  [4] webshot_0.5.2        httr_1.4.2           rprojroot_2.0.2
+#>  [7] tensorA_0.36.2       tools_4.1.2          backports_1.2.1
+#> [10] bslib_0.2.5.1        utf8_1.2.2           R6_2.5.0
+#> [13] DBI_1.1.1            colorspace_2.0-2     ggdist_3.0.0
+#> [16] withr_2.4.2          tidyselect_1.1.1     gridExtra_2.3
+#> [19] prettyunits_1.1.1    processx_3.5.2       downlit_0.2.1
+#> [22] curl_4.3.2           compiler_4.1.2       cli_3.0.1
+#> [25] rvest_1.0.1          arrayhelpers_1.1-0   xml2_1.3.2
+#> [28] labeling_0.4.2       posterior_1.1.0      sass_0.4.0
+#> [31] scales_1.1.1         checkmate_2.0.0      aaltobda_0.3.1
+#> [34] callr_3.7.0          systemfonts_1.0.3    digest_0.6.27
+#> [37] svglite_2.0.0        rmarkdown_2.10       pkgconfig_2.0.3
+#> [40] htmltools_0.5.1.1    highr_0.9            dbplyr_2.1.1
+#> [43] rlang_0.4.11         readxl_1.3.1         rstudioapi_0.13
+#> [46] jquerylib_0.1.4      farver_2.1.0         generics_0.1.0
+#> [49] svUnit_1.0.6         jsonlite_1.7.2       distill_1.2
+#> [52] distributional_0.2.2 inline_0.3.19        magrittr_2.0.1
+#> [55] kableExtra_1.3.4     loo_2.4.1            Rcpp_1.0.7
+#> [58] munsell_0.5.0        fansi_0.5.0          abind_1.4-5
+#> [61] lifecycle_1.0.0      stringi_1.7.3        yaml_2.2.1
+#> [64] pkgbuild_1.2.0       grid_4.1.2           parallel_4.1.2
+#> [67] crayon_1.4.1         lattice_0.20-45      haven_2.4.3
+#> [70] hms_1.1.0            knitr_1.33           ps_1.6.0
+#> [73] pillar_1.6.2         codetools_0.2-18     clisymbols_1.2.0
+#> [76] stats4_4.1.2         reprex_2.0.1         evaluate_0.14
+#> [79] V8_3.4.2             renv_0.14.0          RcppParallel_5.1.4
+#> [82] modelr_0.1.8         vctrs_0.3.8          tzdb_0.1.2
+#> [85] cellranger_1.1.0     gtable_0.3.0         assertthat_0.2.1
+#> [88] xfun_0.25            broom_0.7.9          coda_0.19-4
+#> [91] viridisLite_0.4.0    ellipsis_0.3.2       here_1.0.1
+
+
+ + +
+ +
+
+ + + + + +
+ + + + + + + diff --git a/docs/about.html b/docs/about.html index a4f0b2a..f884857 100644 --- a/docs/about.html +++ b/docs/about.html @@ -2393,6 +2393,7 @@

${suggestion.title}

Section 7. Hierarchical models and exchangeability Section 8. Model checking & Cross-validation Section 9. Model comparison and selection +Section 10. Decision analysis About diff --git a/docs/assignments/assignment-09.pdf b/docs/assignments/assignment-09.pdf new file mode 100644 index 0000000..494d99f Binary files /dev/null and b/docs/assignments/assignment-09.pdf differ diff --git a/docs/assignments/jhcook-assignment-09.Rmd b/docs/assignments/jhcook-assignment-09.Rmd new file mode 100644 index 0000000..a987f3a --- /dev/null +++ b/docs/assignments/jhcook-assignment-09.Rmd @@ -0,0 +1,200 @@ +--- +title: "Assignment 9" +date: "2021-11-18" +output: distill::distill_article +--- + +## Setup + +```{r setup} +knitr::opts_chunk$set(echo = TRUE, comment = "#>", dpi = 300) + +library(glue) +library(rstan) +library(tidybayes) +library(tidyverse) + +for (f in list.files(here::here("src"), pattern = "R$", full.names = TRUE)) { + source(f) +} + +rstan_options(auto_write = TRUE) +options(mc.cores = 2) + +theme_set(theme_classic() + theme(strip.background = element_blank())) + +factory <- aaltobda::factory +set.seed(678) +``` + +**[Assignment 9](assignments/assignment-09.pdf)** + +## Exercise 1. Decision analysis for the factory data + +**Your task is to decide whether or not to buy a new (7th) machine for the company.** +**The decision should be based on our best knowledge about the machines.** + +**The following is known about the production process:** + +- **The given data contains quality measurements of single products from the six machines that are ordered from the same seller. (columns: different machines, rows: measurements)** +- **Customers pay 200 euros for each product.** + – **If the quality of the product is below 85, the product cannot be sold** + – **All the products that have sufficient quality are sold.** +- **Raw-materials, the salary of the machine user and the usage cost of the machine for each product cost 106 euros in total.** + – **Usage cost of the machine also involves all investment and repair costs divided by the number of products a machine can create. So there is no need to take the investment cost into account as a separate factor.** +- **The only thing the company owner cares about is money. Thus, as a utility function, use the profit of a new product from a machine.** + +**a) For each of the six machines, compute and report the expected utility of one product of that machine.** + +```{r} +PURCHASE_RPICE <- 200 +MIN_QUALITY_TO_SELL <- 85 +COST_TO_PRODUCE <- 106 + +utility <- function(draws) { + purchased <- PURCHASE_RPICE * sum(draws >= MIN_QUALITY_TO_SELL) + cost_to_produce <- -1 * COST_TO_PRODUCE * length(draws) + u <- (purchased + cost_to_produce) / length(draws) + return(u) +} + +# Test case given in the assignment. +test_y_pred <- c(123.80, 85.23, 70.16, 80.57, 84.91) +test_res <- utility(draws = test_y_pred) +stop_if_not_close_to(test_res, -26) +``` + +Fit the hierarchical model and gather posterior predictions from each machine. + +```{r} +hierarchical_model_code <- here::here( + "models", "assignment07_factories_hierarchical.stan" +) + +hierarchical_model_data <- list( + y = factory, + N = nrow(factory), + J = ncol(factory) +) + +hierarchical_model <- rstan::stan( + hierarchical_model_code, + data = hierarchical_model_data, + verbose = FALSE, + refresh = 0 +) + +print(hierarchical_model, pars = c("alpha", "tau", "mu", "sigma")) +``` + +Extract the posterior predictions for each machine and compare them to the observations. + +```{r} +factory_ypred <- rstan::extract(hierarchical_model, pars = "ypred")$ypred + +tidy_factory_measure_matrix <- function(factory_mat) { + as.data.frame(factory_mat) %>% + set_names(glue("machine {seq(ncol(factory_mat))}")) %>% + pivot_longer(-c(), names_to = "machine", values_to = "quality_measurement") +} + +factory_long <- tidy_factory_measure_matrix(factory) + +tidy_factory_measure_matrix(factory_ypred) %>% + ggplot(aes(x = quality_measurement)) + + facet_wrap(vars(machine), nrow = 2, scales = "free") + + geom_density(fill = "black", alpha = 0.1) + + geom_rug(data = factory_long, color = "blue") + + scale_x_continuous(expand = expansion(c(0, 0))) + + scale_y_continuous(expand = expansion(c(0, 0.02))) + + labs( + x = "quality measurements", + y = "density", + title = "Posterior predictions on current machines" + ) +``` + +Calculate the expected utility for each current machine. + +```{r} +machine_utilities <- apply(factory_ypred, 2, utility) +tibble( + machine = glue("machine {seq(length(machine_utilities))}"), + expected_utility = machine_utilities +) %>% + kableExtra::kbl() +``` + +**b) Rank the machines based on the expected utilities.** +**In other words order the machines from worst to best.** +**Also briefly explain what the utility values tell about the quality of these machines.** +**E.g. Tell which machines are profitable and which are not (if any).** + +Based on their expected utility, the rankings of the machines from worst to best is: 1, 6, 3, 5, 2, 4. +Machine 1 has a negative utility, indicating that it is expected to be unprofitable. + +**c) Compute and report the expected utility of the products of a new (7th) machine.** + +```{r} +machine7_pred <- rstan::extract(hierarchical_model, pars = "y7pred")$y7pred + +ggplot(tibble(x = unlist(machine7_pred)), aes(x = x)) + + geom_density(fill = "black", alpha = 0.1) + + scale_x_continuous(expand = expansion(c(0, 0))) + + scale_y_continuous(expand = expansion(c(0, 0.02))) + + labs( + x = "quality measurements", + y = "density", + title = "Posterior predictions on hypothetical machine 7" + ) +``` + +```{r} +# Expected utility from machine 7. +utility(machine7_pred) +``` + +The expected utility of hypothetical machine 7 is **`r utility(machine7_pred)`**. + +**d) Based on your analysis, discuss briefly whether the company owner should buy a new (7th) machine.** + +Based on this analysis, purchasing another machine would be expected to be profitable. +It might be worth replacing machine 1 with this new machine. + +**e) As usual, remember to include the source code for both Stan and R (or Python).** + +The model is available here ["assignment07_factories_hierarchical.stan"](../models/assignment07_factories_hierarchical.stan). + +The only changes were made in the `generated quantities` block: + +``` +... +generated quantities { + // Compute the predictive distribution for the sixth machine. + real y6pred; // Leave for compatibility with earlier assignments. + vector[J] ypred; + real mu7pred; + real y7pred; + vector[J] log_lik[N]; + + y6pred = normal_rng(mu[6], sigma); + for (j in 1:J) { + ypred[j] = normal_rng(mu[j], sigma); + } + + mu7pred = normal_rng(alpha, tau); + y7pred = normal_rng(mu7pred, sigma); + + for (j in 1:J) { + for (n in 1:N) { + log_lik[n,j] = normal_lpdf(y[n,j] | mu[j], sigma); + } + } +} +``` + +--- + +```{r} +sessionInfo() +``` diff --git a/docs/assignments/jhcook-assignment-09.html b/docs/assignments/jhcook-assignment-09.html new file mode 100644 index 0000000..6f93239 --- /dev/null +++ b/docs/assignments/jhcook-assignment-09.html @@ -0,0 +1,3867 @@ + + + + + + + + + + + + + + + + + + + + Assignment 9 + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+

Assignment 9

+ + + +
+ + +
+

Setup

+
+
+
knitr::opts_chunk$set(echo = TRUE, comment = "#>", dpi = 300)
+
+library(glue)
+library(rstan)
+library(tidybayes)
+library(tidyverse)
+
+for (f in list.files(here::here("src"), pattern = "R$", full.names = TRUE)) {
+  source(f)
+}
+
+rstan_options(auto_write = TRUE)
+options(mc.cores = 2)
+
+theme_set(theme_classic() + theme(strip.background = element_blank()))
+
+factory <- aaltobda::factory
+set.seed(678)
+
+
+
+

Assignment 9

+

Exercise 1. Decision analysis for the factory data

+

Your task is to decide whether or not to buy a new (7th) machine for the company. The decision should be based on our best knowledge about the machines.

+

The following is known about the production process:

+ +

a) For each of the six machines, compute and report the expected utility of one product of that machine.

+
+
+
PURCHASE_RPICE <- 200
+MIN_QUALITY_TO_SELL <- 85
+COST_TO_PRODUCE <- 106
+
+utility <- function(draws) {
+  purchased <- PURCHASE_RPICE * sum(draws >= MIN_QUALITY_TO_SELL)
+  cost_to_produce <- -1 * COST_TO_PRODUCE * length(draws)
+  u <- (purchased + cost_to_produce) / length(draws)
+  return(u)
+}
+
+# Test case given in the assignment.
+test_y_pred <- c(123.80, 85.23, 70.16, 80.57, 84.91)
+test_res <- utility(draws = test_y_pred)
+stop_if_not_close_to(test_res, -26)
+
+
+
+

Fit the hierarchical model and gather posterior predictions from each machine.

+
+
+
hierarchical_model_code <- here::here(
+  "models", "assignment07_factories_hierarchical.stan"
+)
+
+hierarchical_model_data <- list(
+  y = factory,
+  N = nrow(factory),
+  J = ncol(factory)
+)
+
+hierarchical_model <- rstan::stan(
+  hierarchical_model_code,
+  data = hierarchical_model_data,
+  verbose = FALSE,
+  refresh = 0
+)
+
+print(hierarchical_model, pars = c("alpha", "tau", "mu", "sigma"))
+
+
+
#> Inference for Stan model: assignment07_factories_hierarchical.
+#> 4 chains, each with iter=2000; warmup=1000; thin=1;
+#> post-warmup draws per chain=1000, total post-warmup draws=4000.
+#>
+#>         mean se_mean   sd  2.5%    25%    50%    75%  97.5% n_eff
+#> alpha  94.55    0.08 4.83 85.43  91.32  94.37  97.72 104.49  3501
+#> tau    11.10    0.12 4.16  4.08   8.23  10.66  13.54  20.47  1212
+#> mu[1]  81.53    0.16 6.34 69.02  77.32  81.42  85.70  94.91  1618
+#> mu[2] 102.52    0.11 5.86 91.11  98.64 102.57 106.40 113.99  3076
+#> mu[3]  89.82    0.09 5.55 79.06  86.07  89.76  93.49 100.91  4114
+#> mu[4] 106.41    0.13 6.13 93.57 102.51 106.57 110.50 118.39  2285
+#> mu[5]  91.29    0.08 5.37 80.40  87.93  91.28  94.86 102.05  4113
+#> mu[6]  88.51    0.10 5.47 78.00  84.76  88.50  92.29  99.23  3143
+#> sigma  14.27    0.04 2.08 10.87  12.79  14.02  15.50  18.81  2940
+#>       Rhat
+#> alpha    1
+#> tau      1
+#> mu[1]    1
+#> mu[2]    1
+#> mu[3]    1
+#> mu[4]    1
+#> mu[5]    1
+#> mu[6]    1
+#> sigma    1
+#>
+#> Samples were drawn using NUTS(diag_e) at Thu Nov 18 07:08:39 2021.
+#> For each parameter, n_eff is a crude measure of effective sample size,
+#> and Rhat is the potential scale reduction factor on split chains (at
+#> convergence, Rhat=1).
+
+

Extract the posterior predictions for each machine and compare them to the observations.

+
+
+
factory_ypred <- rstan::extract(hierarchical_model, pars = "ypred")$ypred
+
+tidy_factory_measure_matrix <- function(factory_mat) {
+  as.data.frame(factory_mat) %>%
+    set_names(glue("machine {seq(ncol(factory_mat))}")) %>%
+    pivot_longer(-c(), names_to = "machine", values_to = "quality_measurement")
+}
+
+factory_long <- tidy_factory_measure_matrix(factory)
+
+tidy_factory_measure_matrix(factory_ypred) %>%
+  ggplot(aes(x = quality_measurement)) +
+  facet_wrap(vars(machine), nrow = 2, scales = "free") +
+  geom_density(fill = "black", alpha = 0.1) +
+  geom_rug(data = factory_long, color = "blue") +
+  scale_x_continuous(expand = expansion(c(0, 0))) +
+  scale_y_continuous(expand = expansion(c(0, 0.02))) +
+  labs(
+    x = "quality measurements",
+    y = "density",
+    title = "Posterior predictions on current machines"
+  )
+
+
+

+
+

Calculate the expected utility for each current machine.

+
+
+
machine_utilities <- apply(factory_ypred, 2, utility)
+tibble(
+  machine = glue("machine {seq(length(machine_utilities))}"),
+  expected_utility = machine_utilities
+) %>%
+  kableExtra::kbl()
+
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+machine + +expected_utility +
+machine 1 + +-26.15 +
+machine 2 + +70.15 +
+machine 3 + +17.50 +
+machine 4 + +76.45 +
+machine 5 + +27.45 +
+machine 6 + +11.05 +
+
+

b) Rank the machines based on the expected utilities. In other words order the machines from worst to best. Also briefly explain what the utility values tell about the quality of these machines. E.g. Tell which machines are profitable and which are not (if any).

+

Based on their expected utility, the rankings of the machines from worst to best is: 1, 6, 3, 5, 2, 4. Machine 1 has a negative utility, indicating that it is expected to be unprofitable.

+

c) Compute and report the expected utility of the products of a new (7th) machine.

+
+
+
machine7_pred <- rstan::extract(hierarchical_model, pars = "y7pred")$y7pred
+
+ggplot(tibble(x = unlist(machine7_pred)), aes(x = x)) +
+  geom_density(fill = "black", alpha = 0.1) +
+  scale_x_continuous(expand = expansion(c(0, 0))) +
+  scale_y_continuous(expand = expansion(c(0, 0.02))) +
+  labs(
+    x = "quality measurements",
+    y = "density",
+    title = "Posterior predictions on hypothetical machine 7"
+  )
+
+
+

+
+
+
+
# Expected utility from machine 7.
+utility(machine7_pred)
+
+
+
#> [1] 30.7
+
+

The expected utility of hypothetical machine 7 is 30.7.

+

d) Based on your analysis, discuss briefly whether the company owner should buy a new (7th) machine.

+

Based on this analysis, purchasing another machine would be expected to be profitable. It might be worth replacing machine 1 with this new machine.

+

e) As usual, remember to include the source code for both Stan and R (or Python).

+

The model is available here “assignment07_factories_hierarchical.stan”.

+

The only changes were made in the generated quantities block:

+
...
+generated quantities {
+  // Compute the predictive distribution for the sixth machine.
+  real y6pred;  // Leave for compatibility with earlier assignments.
+  vector[J] ypred;
+  real mu7pred;
+  real y7pred;
+  vector[J] log_lik[N];
+
+  y6pred = normal_rng(mu[6], sigma);
+  for (j in 1:J) {
+    ypred[j] = normal_rng(mu[j], sigma);
+  }
+
+  mu7pred = normal_rng(alpha, tau);
+  y7pred = normal_rng(mu7pred, sigma);
+
+  for (j in 1:J) {
+    for (n in 1:N) {
+      log_lik[n,j] = normal_lpdf(y[n,j] | mu[j], sigma);
+    }
+  }
+}
+
+
+
+
sessionInfo()
+
+
+
#> R version 4.1.2 (2021-11-01)
+#> Platform: x86_64-apple-darwin17.0 (64-bit)
+#> Running under: macOS Big Sur 10.16
+#>
+#> Matrix products: default
+#> BLAS:   /Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRblas.0.dylib
+#> LAPACK: /Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRlapack.dylib
+#>
+#> locale:
+#> [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
+#>
+#> attached base packages:
+#> [1] stats     graphics  grDevices datasets  utils     methods
+#> [7] base
+#>
+#> other attached packages:
+#>  [1] forcats_0.5.1        stringr_1.4.0        dplyr_1.0.7
+#>  [4] purrr_0.3.4          readr_2.0.1          tidyr_1.1.3
+#>  [7] tibble_3.1.3         tidyverse_1.3.1      tidybayes_3.0.1
+#> [10] rstan_2.21.2         ggplot2_3.3.5        StanHeaders_2.21.0-7
+#> [13] glue_1.4.2
+#>
+#> loaded via a namespace (and not attached):
+#>  [1] matrixStats_0.61.0   fs_1.5.0             lubridate_1.7.10
+#>  [4] webshot_0.5.2        httr_1.4.2           rprojroot_2.0.2
+#>  [7] tensorA_0.36.2       tools_4.1.2          backports_1.2.1
+#> [10] bslib_0.2.5.1        utf8_1.2.2           R6_2.5.0
+#> [13] DBI_1.1.1            colorspace_2.0-2     ggdist_3.0.0
+#> [16] withr_2.4.2          tidyselect_1.1.1     gridExtra_2.3
+#> [19] prettyunits_1.1.1    processx_3.5.2       downlit_0.2.1
+#> [22] curl_4.3.2           compiler_4.1.2       cli_3.0.1
+#> [25] rvest_1.0.1          arrayhelpers_1.1-0   xml2_1.3.2
+#> [28] labeling_0.4.2       posterior_1.1.0      sass_0.4.0
+#> [31] scales_1.1.1         checkmate_2.0.0      aaltobda_0.3.1
+#> [34] callr_3.7.0          systemfonts_1.0.3    digest_0.6.27
+#> [37] svglite_2.0.0        rmarkdown_2.10       pkgconfig_2.0.3
+#> [40] htmltools_0.5.1.1    highr_0.9            dbplyr_2.1.1
+#> [43] rlang_0.4.11         readxl_1.3.1         rstudioapi_0.13
+#> [46] jquerylib_0.1.4      farver_2.1.0         generics_0.1.0
+#> [49] svUnit_1.0.6         jsonlite_1.7.2       distill_1.2
+#> [52] distributional_0.2.2 inline_0.3.19        magrittr_2.0.1
+#> [55] kableExtra_1.3.4     loo_2.4.1            Rcpp_1.0.7
+#> [58] munsell_0.5.0        fansi_0.5.0          abind_1.4-5
+#> [61] lifecycle_1.0.0      stringi_1.7.3        yaml_2.2.1
+#> [64] pkgbuild_1.2.0       grid_4.1.2           parallel_4.1.2
+#> [67] crayon_1.4.1         lattice_0.20-45      haven_2.4.3
+#> [70] hms_1.1.0            knitr_1.33           ps_1.6.0
+#> [73] pillar_1.6.2         codetools_0.2-18     clisymbols_1.2.0
+#> [76] stats4_4.1.2         reprex_2.0.1         evaluate_0.14
+#> [79] V8_3.4.2             renv_0.14.0          RcppParallel_5.1.4
+#> [82] modelr_0.1.8         vctrs_0.3.8          tzdb_0.1.2
+#> [85] cellranger_1.1.0     gtable_0.3.0         assertthat_0.2.1
+#> [88] xfun_0.25            broom_0.7.9          coda_0.19-4
+#> [91] viridisLite_0.4.0    ellipsis_0.3.2       here_1.0.1
+
+
+ + +
+ +
+
+ + + + + +
+ + + + + + + diff --git a/docs/index.html b/docs/index.html index 76dbf3e..7edb52b 100644 --- a/docs/index.html +++ b/docs/index.html @@ -2394,6 +2394,7 @@

${suggestion.title}

Section 7. Hierarchical models and exchangeability Section 8. Model checking & Cross-validation Section 9. Model comparison and selection +Section 10. Decision analysis About @@ -2541,6 +2543,12 @@

Sections

(none) assignment 8 + +10. Decision analysis +notes +(none) +assignment 9 +

Stan models

@@ -2552,7 +2560,7 @@

Stan models

diff --git a/docs/models/assignment07_factories_hierarchical.rds b/docs/models/assignment07_factories_hierarchical.rds index 3a754f2..67c8997 100644 Binary files a/docs/models/assignment07_factories_hierarchical.rds and b/docs/models/assignment07_factories_hierarchical.rds differ diff --git a/docs/models/assignment07_factories_hierarchical.stan b/docs/models/assignment07_factories_hierarchical.stan index 2a5c15f..d01f9a0 100644 --- a/docs/models/assignment07_factories_hierarchical.stan +++ b/docs/models/assignment07_factories_hierarchical.stan @@ -28,12 +28,19 @@ model { generated quantities { // Compute the predictive distribution for the sixth machine. - real y6pred; + real y6pred; // Leave for compatibility with earlier assignments. + vector[J] ypred; real mu7pred; + real y7pred; vector[J] log_lik[N]; y6pred = normal_rng(mu[6], sigma); + for (j in 1:J) { + ypred[j] = normal_rng(mu[j], sigma); + } + mu7pred = normal_rng(alpha, tau); + y7pred = normal_rng(mu7pred, sigma); for (j in 1:J) { for (n in 1:N) { diff --git a/docs/notes/10_decision-analysis_bda3-9.Rmd b/docs/notes/10_decision-analysis_bda3-9.Rmd new file mode 100644 index 0000000..dab9ef3 --- /dev/null +++ b/docs/notes/10_decision-analysis_bda3-9.Rmd @@ -0,0 +1,67 @@ +--- +title: "10. Decision analysis" +date: "2021-11-15" +output: distill::distill_article +--- + +```{r setup, include=FALSE} +knitr::opts_chunk$set(echo = TRUE, dpi = 300, comment = "#>") +``` + +## Resources + +- reading + - BDA3 chapter 9 + - [reading instructions](../reading-instructions/BDA3_ch09_reading-instructions.pdf) +- lectures: + - ['10.1 Decision analysis'](https://aalto.cloud.panopto.eu/Panopto/Pages/Viewer.aspx?id=82943720-de0f-4195-8639-ab0900ca2085) +- slides: + - [Lecture 10.1](../slides/slides_ch9.pdf) +- [Assignment 9](assignments/assignment-09.pdf) + +## Notes + +### Reading instructions + +- outline of chapter 9 + - 9.1 Context and basic steps (most important part) + - 9.2 Example + - 9.3 Multistage decision analysis (you may skip this example) + - 9.4 Hierarchical decision analysis (you may skip this example) + - 9.5 Personal vs. institutional decision analysis (important) +- the lectures have simpler examples and discuss some challenges in selecting utilities or costs +- ch 7 discusses how model selection con be considered as a decision problem + +### Chapter 9. Decision analysis + +- how can inferences be used in decision making? +- examples in this chapter: + 1. section 9.2: simple example with hierarchical model on how incentives affect survey response rates + - compare expected response rates of various incentive structures to their expected cost + 2. section 9.3: option of performing a diagnostic test before deciding on a treatment for cancer + - example of "value of information" and balancing risks of the screening test against the information it would provide + 3. section 9.4: decision and utility analysis of the risk of radon exposure + - cost of measurement and fixing high exposure + - example of a full integration if inference with decision analysis + +#### 9.1 Bayesian decision theory in difference contexts + +- use Bayesian inference in two ways when balancing costs and benefits of decision options under uncertainty: + 1. a decision depends on the predicted quantities which depend on the parameters of the model and type of data + 2. use Bayesian inference within a decisions analysis to estimate outcomes conditional on information from previous decisions + +##### Bayesian inference and decision trees + +- decision analysis involves optimization over decisions and uncertainties +- **Bayesian decision analysis** is defined as the following steps: + 1. Enumerate the space of all possible decisions $d$ and outcomes $x$. + 2. Determine the probability distribution of $x$ for each decision option $d$. + 3. Define a *utility function* $U(x)$ mapping outcomes onto real numbers (values of interest). + 4. Compute the expected utility $\text{E}(U(x)|d)$ as a function of the decision $d$ and choose the decision with the highest expected utility. +- often, we only do the first two steps and the rest is left to the "decision makers" + +### Lecture notes + +#### 10.1 Decision analysis + +(no new notes) diff --git a/docs/notes/10_decision-analysis_bda3-9.html b/docs/notes/10_decision-analysis_bda3-9.html new file mode 100644 index 0000000..5c31f92 --- /dev/null +++ b/docs/notes/10_decision-analysis_bda3-9.html @@ -0,0 +1,2596 @@ + + + + + + + + + + + + + + + + + + + + 10. Decision analysis + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+

10. Decision analysis

+ + + +
+ + +
+

Resources

+ +

Notes

+

Reading instructions

+ +

Chapter 9. Decision analysis

+ +

9.1 Bayesian decision theory in difference contexts

+ +
Bayesian inference and decision trees
+ +

Lecture notes

+

10.1 Decision analysis

+

(no new notes)

+
+ + +
+ +
+
+ + + + + + + +
+ + + + + + + diff --git a/docs/project-guidelines.html b/docs/project-guidelines.html index 5db2374..11891ba 100644 --- a/docs/project-guidelines.html +++ b/docs/project-guidelines.html @@ -2316,6 +2316,7 @@

${suggestion.title}

Section 7. Hierarchical models and exchangeability Section 8. Model checking & Cross-validation Section 9. Model comparison and selection +Section 10. Decision analysis About diff --git a/docs/reading-instructions/BDA3_ch09_reading-instructions.pdf b/docs/reading-instructions/BDA3_ch09_reading-instructions.pdf new file mode 100644 index 0000000..993a0e2 Binary files /dev/null and b/docs/reading-instructions/BDA3_ch09_reading-instructions.pdf differ diff --git a/docs/search.json b/docs/search.json index 52a92e2..4029a82 100644 --- a/docs/search.json +++ b/docs/search.json @@ -6,7 +6,7 @@ "description": "Some additional details about the website", "author": [], "contents": "\n\n\n\n", - "last_modified": "2021-11-15T06:52:21-05:00" + "last_modified": "2021-11-18T07:09:12-05:00" }, { "path": "index.html", @@ -18,14 +18,14 @@ "url": "https://joshuacook.netlify.app" } ], - "contents": "\nResources\nCourse website\n2021 Schedule\nGitHub repo (my fork)\nBayesian Data Analysis (3e) (BDA3) (exercise solutions)\nChapter Notes\nVideo lectures or individually lists here\nLecture slides\nHow to study\n\nThe following are recommendations from the course creators on how to take the course.\n\nThe recommended way to go through the material is:\nRead the reading instructions for a chapter in the chapter notes.\nRead the chapter in BDA3 and check that you find the terms listed in the reading instructions.\nWatch the corresponding video lecture to get explanations for most important parts.\nRead corresponding additional information in the chapter notes.\nRun the corresponding demos in R demos or Python demos.\nRead the exercise instructions and make the corresponding assignments. Demo codes in R demos and Python demos have a lot of useful examples for handling data and plotting figures. If you have problems, visit TA sessions or ask in course slack channel.\nIf you want to learn more, make also self study exercises listed below.\nSections\nSection\nNotes\nBook exercises\nAssignments\n1. Course Introduction\nnotes\nexercises\nassignment 1\n2. Basics of Bayesian Inference\nnotes\nexercises\nassignment 2\n3. Multidimensional Posterior\nnotes\nexercises\nassignment 3\n4. Monte Carlo\nnotes\nexercises\nassignment 4\n5. Markov chain Monte Carlo\nnotes\nexercises\nassignment 5\n6. HMC, NUTS, and Stan\nnotes\nexercises\nassignment 6\n7. Hierarchical models and exchangeability\nnotes\nexercises\nassignment 7\n8. Model checking & Cross-validation\nnotes\n(none)\n(none)\n9. Model comparison and selection\nnotes\n(none)\nassignment 8\nStan models\nDrug bioassay model for Assignment 6\n8 school SAT model\nDrownings for Assignment 7\nFactory machine measurements for Assignments 7 & 8:\npooled\nseparate\nhierarchical\n\n", - "last_modified": "2021-11-15T06:52:22-05:00" + "contents": "\nResources\nCourse website\n2021 Schedule\nGitHub repo (my fork)\nBayesian Data Analysis (3e) (BDA3) (exercise solutions)\nChapter Notes\nVideo lectures or individually lists here\nLecture slides\nHow to study\n\nThe following are recommendations from the course creators on how to take the course.\n\nThe recommended way to go through the material is:\nRead the reading instructions for a chapter in the chapter notes.\nRead the chapter in BDA3 and check that you find the terms listed in the reading instructions.\nWatch the corresponding video lecture to get explanations for most important parts.\nRead corresponding additional information in the chapter notes.\nRun the corresponding demos in R demos or Python demos.\nRead the exercise instructions and make the corresponding assignments. Demo codes in R demos and Python demos have a lot of useful examples for handling data and plotting figures. If you have problems, visit TA sessions or ask in course slack channel.\nIf you want to learn more, make also self study exercises listed below.\nSections\nSection\nNotes\nBook exercises\nAssignments\n1. Course Introduction\nnotes\nexercises\nassignment 1\n2. Basics of Bayesian Inference\nnotes\nexercises\nassignment 2\n3. Multidimensional Posterior\nnotes\nexercises\nassignment 3\n4. Monte Carlo\nnotes\nexercises\nassignment 4\n5. Markov chain Monte Carlo\nnotes\nexercises\nassignment 5\n6. HMC, NUTS, and Stan\nnotes\nexercises\nassignment 6\n7. Hierarchical models and exchangeability\nnotes\nexercises\nassignment 7\n8. Model checking & Cross-validation\nnotes\n(none)\n(none)\n9. Model comparison and selection\nnotes\n(none)\nassignment 8\n10. Decision analysis\nnotes\n(none)\nassignment 9\nStan models\nDrug bioassay model for Assignment 6\n8 school SAT model\nDrownings for Assignment 7\nFactory machine measurements for Assignments 7 & 8:\npooled\nseparate\nhierarchical (also used in Assignment 9)\n\n", + "last_modified": "2021-11-18T07:09:13-05:00" }, { "path": "project-guidelines.html", "author": [], "contents": "\nProject work details\nProject work involves choosing a data set and performing a whole analysis according to all the parts of Bayesian workflow studied along the course.\nThe project work is meant to be done in period II.\nIn the beginning of the period II\nForm a group. We prefer groups of 3, but the project can be done in groups of 1-2.\nSelect a topic. You may ask in the course chat channel #project for opinion whether it’s a good topic and a good dataset. You can change the topic later.\nStart planning.\n\n\nThe main work for the project and the presentation will be done in the second half of the period II after all the workflow parts have been discussed in the course.\nThe online presentations will be made on the evaluation week after period II.\nAll suspected plagiarism will be reported and investigated. See more about the Aalto University Code of Academic Integrity and Handling Violations Thereof.\nProject schedule\nForm a group and pick a topic. Register the group before end of 8th Nov, 2021.\nGroups of 3 can reserve a presentation slot starting TBA.\nGroups of 2 can reserve a presentation slot starting TBA.\nGroups of 1 can reserve a presentation slot starting TBA.\nGroups that register late can reserve a presentation slot starting TBA.\nWork on the project. TA session queue is also for project questions.\nProject report deadline December 6, 2021. Submit in peergrade (separate “class”, the link will be added to MyCourses).\nProject report peer grading December 7-9, 2021 (so that you’ll get feedback for the report before the presentations).\nProject presentations December 13-17, 2021.\nGroups\nProject work is done in groups of 1-3 persons. Preferred group size is 3, because you learn more when you talk about the project with someone else.\nIf you don’t have a group, you can ask other students in the group chat channel #project. Tell what kind of data you are interested in (e.g. medicine, health, biological, engineering, political, business), whether you prefer R or Python, and whether you have already more concrete idea for the topic.\nGroups of 3 students can choose their presentation time slot before 1-2 student groups. 3 person group is expected to do a bit more work than 1-2 person groups.\nYou can do the project alone, but the amount of work is expected to the same for 2 person groups.\nTA sessions\nThe groups will get help for the project work in TA sessions. When there are no weekly assignments, the TA sessions are still organized for helping in the project work.\nEvaluation\nThe project work’s evaluation consists of:\npeer-graded project report (40%) (within peergrade submission 80% and feedack 20%)\npresentation and oral exam graded by the course staff (60%)\nclarity of slides + use of figures\nclarity of oral presentation + flow of the presentation\nall required parts included (not necessarily all in main slides, but it needs to be clear that all required steps were performed)\naccuracy of use of terms (oral exam)\nresponses to questions (oral exam)\n\nProject report\nIn the project report you practice presenting the problem and data analysis results, which means that minimal listing of code and figures is not a good report. There are different levels for how data analysis project could be reported. This report should be more than a summary of results without workflow steps. While describing the steps and decisions made during the workflow, to keep the report readable some of the diagnostic outputs and code can be put in the appendix. If you are uncertain you can ask TAs in TA sessions whether you are on a good level of amount of details.\nThe report should include\nIntroduction describing\nthe motivation\nthe problem\nand the main modeling idea.\nShowing some illustrative figure is recommended.\nDescription of the data and the analysis problem. Provide information where the data was obtained, and if it has been previously used in some online case study and how your analysis differs from the existing analyses.\nDescription of at least two models, for example:\nnon hierarchical and hierarchical,\nlinear and non linear,\nvariable selection with many models.\nInformative or weakly informative priors, and justification of their choices.\nStan, rstanarm or brms code.\nHow to the Stan model was run, that is, what options were used. This is also more clear as combination of textual explanation and the actual code line.\nConvergence diagnostics (\\(\\widehat{R}\\), ESS, divergences) and what was done if the convergence was not good with the first try.\nPosterior predictive checks and what was done to improve the model.\nModel comparison (e.g. with LOO-CV).\nPredictive performance assessment if applicable (e.g. classification accuracy) and evaluation of practical usefulness of the accuracy.\nSensitivity analysis with respect to prior choices (i.e. checking whether the result changes a lot if prior is changed)\nDiscussion of issues and potential improvements.\nConclusion what was learned from the data analysis.\nSelf-reflection of what the group learned while making the project.\nProject presentation\nIn addition to the submitted report, each project must be presented by the authoring group, according to the following guidelines:\nThe presentation should be high level but sufficiently detailed information should be readily available to help answering questions from the audience.\nThe duration of the presentation should be 10 minutes (groups of 1-2 students) or 15 minutes (groups of 3 students).\nAt the end of the presentation there will be an extra 5-10 minutes of questions by anyone in the audience or two members of the course staff who are present. The questions from lecturer/TAs can be considered as an oral exam questions, and if answers to these questions reveal weak knowledge of the methods and workflow steps which should be part of the project, that can reduce the grade.\nGrading will be done by the two members of the course staff using standardized grading instructions.\nSpecific recommendations for the presentations include:\nThe first slide should include project’s title and group members’ names.\nThe chosen statistical model(s), including observation model and priors, must be explained and justified,\nMake sure the font is big enough to be easily readable by the audience. This includes figure captions, legends and axis information,\nThe last slide should be a summary or take-home-messages and include contact information or link to a further information. (The grade will be reduced by one if the last slide has only something like “Thank you” or “Questions?”),\nIn general, the best presentations are often given by teams that have frequently attended TA sessions and gotten feedback, so we strongly recommend attending these sessions.\nMore details on the presentation sessions\nIf you don’t have microphone or video camera (e.g. in your laptop or mobile phone) then we’ll arrange your presentation on campus in period III.\nIf you reserved a presentation slot but need to cancel, do it asap.\nZoom meeting link for all time slots available in the course chat.\nAs we have many presentation in each slot join the meeting in time. Late arrivals will lower the grade. Very late arrivals will fail the presentation and can present in period III.\nPresenting group needs to have video and audio on.\nIt is easiest if just one from the group shares the slides, but it is expected that all group members present some part of the presentation orally.\nPresentation time is 10 min for 1-2 person groups and 15min for 3 person groups\nTime limit is strict. It’s good idea to practice the talk so that you get the timing right. Staff will announce 2min and 1min left and time ended. Going overtime reduces the grade.\nAfter the presentation there will be 5min for questions, answers, and feedback.\nEach student has to come up with at least one question during the session. Students can ask more questions. Questions by students are posted in chat, and they can be posted already during the presentation.\nStaff Will ask further questions (kind of oral exam)\nGrading of the project presentation takes int account\nclarity of slides + use of figures\nclarity of oral presentation + flow of the presentation\nall required parts included (not necessarily all in main slides, but it needs to be clear that all required steps were performed)\naccuracy of use of terms (oral exam)\nresponses to questions (oral exam)\n\nStudents will also self-evaluate their project. After the presentation each student who just presented sends a private message to one of the staff members with a self evaluation grade from themselves and for each group member (if applicable).\nData sets\nAs some data sets have been overused for these particular goals, note that the following ones are forbidden in this work (more can be added to this list so make sure to check it regularly):\nextremly common data sets like titanic, mtcars, iris\nBaseball batting (used by Bob Carpenter’s StanCon case study).\nData sets used in the course demos\nIt’s best to use a dataset for which there is no ready made analysis in internet, but if you choose a dataset used already in some online case study, provide the link to previous studies and report how your analysis differs from those (for example if someone has made non-Bayesian analysis and you do the full Bayesian analysis).\nDepending on the model and the structure of the data, a good data set would have more than 100 observations but less than 1 million. If you know an interesting big data set, you can use a smaller subset of the data to keep the computation times feasible. It would be good that the data has some structure, so that it is sensible to use multilevel/hierarchical models.\nModel requirements\nEvery parameter needs to have an explicit proper prior. Improper flat priors are not allowed.\nA hierarchical model is a model where the prior of certain parameter contain other parameters that are also estimated in the model. For instance, b ~ normal(mu, sigma), mu ~ normal(0, 1), sigma ~ exponential(1).\nDo not impose hard constrains on a parameter unless they are natural to them. uniform(a, b) should not be used unless the boundaries are really logical boundaries and values beyond the boundaries are completely impossible.\nAt least some models should include covariates. Modelling the outcome without predictors is likely too simple for the project.\nbrms can be used, but the Stan code must be included, briefly commented, and all priors need to be checked from the Stan code and adjusted to be weakly informative based on some justified explanation.\nSome examples\nThe following case study examples demonstrate how text, equations, figures, and code, and inference results can be included in one report. These examples don’t necessarily have all the workflow steps required in your report, but different steps are illustrated in different case studies and you can get good ideas for your report just by browsing through them.\nBDA R and Python demos are quite minimal in description of the data and discussion of the results, but show many diagnostics and basic plots.\nSome Stan case studies focus on some specific methods, but there are many case studies that are excellent examples for this course. They don’t include all the steps required in this course, but are good examples of writing. Some of them are longer or use more advanced models than required in this course.\nBayesian workflow for disease transmission modeling in Stan\nModel-based Inference for Causal Effects in Completely Randomized Experments\nTagging Basketball Events with HMM in Stan\nModel building and expansion for golf putting\nA Dyadic Item Response Theory Model\nPredator-Prey Population Dynamics: the Lotka-Volterra model in Stan\nSome StanCon case studies (scroll down) can also provide good project ideas.\n", - "last_modified": "2021-11-15T06:52:23-05:00" + "last_modified": "2021-11-18T07:09:14-05:00" } ], "collections": [] diff --git a/docs/sitemap.xml b/docs/sitemap.xml index f2f8a50..af36cec 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -6,10 +6,10 @@ https://jhrcook.github.io/bayesian-data-analysis-course/ - 2021-11-15T06:52:12-05:00 + 2021-11-18T07:07:17-05:00 https://jhrcook.github.io/bayesian-data-analysis-course/project-guidelines.html - 2021-11-15T06:52:23-05:00 + 2021-11-18T07:09:13-05:00 diff --git a/docs/slides/slides_ch9.pdf b/docs/slides/slides_ch9.pdf new file mode 100644 index 0000000..4ee744e Binary files /dev/null and b/docs/slides/slides_ch9.pdf differ diff --git a/index.Rmd b/index.Rmd index 8c8ff21..04b4a86 100644 --- a/index.Rmd +++ b/index.Rmd @@ -53,6 +53,7 @@ The recommended way to go through the material is: | **7. Hierarchical models and exchangeability** | [notes](notes/07_hierarchical-models_bda3-5.html) | [exercises](book-exercises/bda-exercises-07.html) | [assignment 7](assignments/jhcook-assignment-07.html) | | **8. Model checking & Cross-validation** | [notes](notes/08_model-checking-and-cv_bda3-6-7.html) | (none) | (none) | | **9. Model comparison and selection** | [notes](notes/09_model-selection_bda3-7.html) | (none) | [assignment 8](assignments/jhcook-assignment-08.html) | +| **10. Decision analysis** | [notes](notes/10_decision-analysis_bda3-9.html) | (none) | [assignment 9](assignments/jhcook-assignment-09.html) | ## Stan models @@ -62,4 +63,4 @@ The recommended way to go through the material is: - Factory machine measurements for Assignments 7 & 8: - [pooled](models/assignment07_factories_pooled.stan) - [separate](models/assignment07_factories_separate.stan) - - [hierarchical](models/assignment07_factories_hierarchical.stan) + - [hierarchical](models/assignment07_factories_hierarchical.stan) (also used in Assignment 9) diff --git a/models/assignment07_factories_hierarchical.rds b/models/assignment07_factories_hierarchical.rds index 3a754f2..67c8997 100644 Binary files a/models/assignment07_factories_hierarchical.rds and b/models/assignment07_factories_hierarchical.rds differ diff --git a/models/assignment07_factories_hierarchical.stan b/models/assignment07_factories_hierarchical.stan index 2a5c15f..d01f9a0 100644 --- a/models/assignment07_factories_hierarchical.stan +++ b/models/assignment07_factories_hierarchical.stan @@ -28,12 +28,19 @@ model { generated quantities { // Compute the predictive distribution for the sixth machine. - real y6pred; + real y6pred; // Leave for compatibility with earlier assignments. + vector[J] ypred; real mu7pred; + real y7pred; vector[J] log_lik[N]; y6pred = normal_rng(mu[6], sigma); + for (j in 1:J) { + ypred[j] = normal_rng(mu[j], sigma); + } + mu7pred = normal_rng(alpha, tau); + y7pred = normal_rng(mu7pred, sigma); for (j in 1:J) { for (n in 1:N) { diff --git a/notes/10_decision-analysis_bda3-9.Rmd b/notes/10_decision-analysis_bda3-9.Rmd new file mode 100644 index 0000000..dab9ef3 --- /dev/null +++ b/notes/10_decision-analysis_bda3-9.Rmd @@ -0,0 +1,67 @@ +--- +title: "10. Decision analysis" +date: "2021-11-15" +output: distill::distill_article +--- + +```{r setup, include=FALSE} +knitr::opts_chunk$set(echo = TRUE, dpi = 300, comment = "#>") +``` + +## Resources + +- reading + - BDA3 chapter 9 + - [reading instructions](../reading-instructions/BDA3_ch09_reading-instructions.pdf) +- lectures: + - ['10.1 Decision analysis'](https://aalto.cloud.panopto.eu/Panopto/Pages/Viewer.aspx?id=82943720-de0f-4195-8639-ab0900ca2085) +- slides: + - [Lecture 10.1](../slides/slides_ch9.pdf) +- [Assignment 9](assignments/assignment-09.pdf) + +## Notes + +### Reading instructions + +- outline of chapter 9 + - 9.1 Context and basic steps (most important part) + - 9.2 Example + - 9.3 Multistage decision analysis (you may skip this example) + - 9.4 Hierarchical decision analysis (you may skip this example) + - 9.5 Personal vs. institutional decision analysis (important) +- the lectures have simpler examples and discuss some challenges in selecting utilities or costs +- ch 7 discusses how model selection con be considered as a decision problem + +### Chapter 9. Decision analysis + +- how can inferences be used in decision making? +- examples in this chapter: + 1. section 9.2: simple example with hierarchical model on how incentives affect survey response rates + - compare expected response rates of various incentive structures to their expected cost + 2. section 9.3: option of performing a diagnostic test before deciding on a treatment for cancer + - example of "value of information" and balancing risks of the screening test against the information it would provide + 3. section 9.4: decision and utility analysis of the risk of radon exposure + - cost of measurement and fixing high exposure + - example of a full integration if inference with decision analysis + +#### 9.1 Bayesian decision theory in difference contexts + +- use Bayesian inference in two ways when balancing costs and benefits of decision options under uncertainty: + 1. a decision depends on the predicted quantities which depend on the parameters of the model and type of data + 2. use Bayesian inference within a decisions analysis to estimate outcomes conditional on information from previous decisions + +##### Bayesian inference and decision trees + +- decision analysis involves optimization over decisions and uncertainties +- **Bayesian decision analysis** is defined as the following steps: + 1. Enumerate the space of all possible decisions $d$ and outcomes $x$. + 2. Determine the probability distribution of $x$ for each decision option $d$. + 3. Define a *utility function* $U(x)$ mapping outcomes onto real numbers (values of interest). + 4. Compute the expected utility $\text{E}(U(x)|d)$ as a function of the decision $d$ and choose the decision with the highest expected utility. +- often, we only do the first two steps and the rest is left to the "decision makers" + +### Lecture notes + +#### 10.1 Decision analysis + +(no new notes) diff --git a/notes/10_decision-analysis_bda3-9.html b/notes/10_decision-analysis_bda3-9.html new file mode 100644 index 0000000..5c31f92 --- /dev/null +++ b/notes/10_decision-analysis_bda3-9.html @@ -0,0 +1,2596 @@ + + + + + + + + + + + + + + + + + + + + 10. Decision analysis + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+

10. Decision analysis

+ + + +
+ + +
+

Resources

+ +

Notes

+

Reading instructions

+ +

Chapter 9. Decision analysis

+ +

9.1 Bayesian decision theory in difference contexts

+ +
Bayesian inference and decision trees
+ +

Lecture notes

+

10.1 Decision analysis

+

(no new notes)

+
+ + +
+ +
+
+ + + + + + + +
+ + + + + + + diff --git a/reading-instructions/BDA3_ch09_reading-instructions.pdf b/reading-instructions/BDA3_ch09_reading-instructions.pdf new file mode 100644 index 0000000..993a0e2 Binary files /dev/null and b/reading-instructions/BDA3_ch09_reading-instructions.pdf differ diff --git a/slides/slides_ch9.pdf b/slides/slides_ch9.pdf new file mode 100644 index 0000000..4ee744e Binary files /dev/null and b/slides/slides_ch9.pdf differ