|
1 |
| ---- |
2 |
| -title : Introduction to statistical inference |
3 |
| -subtitle : Statistical inference |
4 |
| -author : Brian Caffo, Jeff Leek, Roger Peng |
5 |
| -job : Johns Hopkins Bloomberg School of Public Health |
6 |
| -logo : bloomberg_shield.png |
7 |
| -framework : io2012 # {io2012, html5slides, shower, dzslides, ...} |
8 |
| -highlighter : highlight.js # {highlight.js, prettify, highlight} |
9 |
| -hitheme : tomorrow # |
10 |
| -url: |
11 |
| - lib: ../../librariesNew |
12 |
| - assets: ../../assets |
13 |
| -widgets : [mathjax] # {mathjax, quiz, bootstrap} |
14 |
| -mode : selfcontained # {standalone, draft} |
15 |
| ---- |
16 |
| -## Statistical inference defined |
17 |
| - |
18 |
| -Statistical inference is the process of drawing formal conclusions from |
19 |
| -data. |
20 |
| - |
21 |
| -In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy |
22 |
| -statistical data where uncertainty must be accounted for. |
23 |
| - |
24 |
| ---- |
25 |
| - |
26 |
| -## Motivating example: who's going to win the election? |
27 |
| - |
28 |
| -In every major election, pollsters would like to know, ahead of the |
29 |
| -actual election, who's going to win. Here, the target of |
30 |
| -estimation (the estimand) is clear, the percentage of people in |
31 |
| -a particular group (city, state, county, country or other electoral |
32 |
| -grouping) who will vote for each candidate. |
33 |
| - |
34 |
| -We can not poll everyone. Even if we could, some polled |
35 |
| -may change their vote by the time the election occurs. |
36 |
| -How do we collect a reasonable subset of data and quantify the |
37 |
| -uncertainty in the process to produce a good guess at who will win? |
38 |
| - |
39 |
| ---- |
40 |
| - |
41 |
| -## Motivating example: is hormone replacement therapy effective? |
42 |
| - |
43 |
| -A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.** |
44 |
| - |
45 |
| -Here's there's two inferential problems. |
46 |
| - |
47 |
| -1. Is HRT effective? |
48 |
| -2. How long should we continue the trial in the presence of contrary |
49 |
| -evidence? |
50 |
| - |
51 |
| -See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts |
52 |
| - |
53 |
| ---- |
54 |
| - |
55 |
| -## Motivating example: ECMO |
56 |
| - |
57 |
| -In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. **Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.** |
58 |
| - |
59 |
| -For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88 |
60 |
| - |
61 |
| ---- |
62 |
| - |
63 |
| -## Summary |
64 |
| - |
65 |
| -- These examples illustrate many of the difficulties of trying |
66 |
| -to use data to create general conclusions about a population. |
67 |
| -- Paramount among our concerns are: |
68 |
| - - Is the sample representative of the population that we'd like to draw inferences about? |
69 |
| - - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions? |
70 |
| - - Is there systematic bias created by missing data or the design or conduct of the study? |
71 |
| - - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization |
72 |
| -or random sampling, or implicit as the aggregation of many complex uknown processes. |
73 |
| - - Are we trying to estimate an underlying mechanistic model of phenomena under study? |
74 |
| -- Statistical inference requires navigating the set of assumptions and |
75 |
| -tools and subsequently thinking about how to draw conclusions from data. |
76 |
| - |
77 |
| ---- |
78 |
| -## Example goals of inference |
79 |
| - |
80 |
| -1. Estimate and quantify the uncertainty of an estimate of |
81 |
| -a population quantity (the proportion of people who will |
82 |
| - vote for a candidate). |
83 |
| -2. Determine whether a population quantity |
84 |
| - is a benchmark value ("is the treatment effective?"). |
85 |
| -3. Infer a mechanistic relationship when quantities are measured with |
86 |
| - noise ("What is the slope for Hooke's law?") |
87 |
| -4. Determine the impact of a policy? ("If we reduce polution levels, |
88 |
| - will asthma rates decline?") |
89 |
| - |
90 |
| - |
91 |
| ---- |
92 |
| -## Example tools of the trade |
93 |
| - |
94 |
| -1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest |
95 |
| -2. Random sampling: concerned with obtaining data that is representative |
96 |
| -of the population of interest |
97 |
| -3. Sampling models: concerned with creating a model for the sampling |
98 |
| -process, the most common is so called "iid". |
99 |
| -4. Hypothesis testing: concerned with decision making in the presence of uncertainty |
100 |
| -5. Confidence intervals: concerned with quantifying uncertainty in |
101 |
| -estimation |
102 |
| -6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are |
103 |
| -approximated. |
104 |
| -7. Study design: the process of designing an experiment to minimize biases and variability. |
105 |
| -8. Nonparametric bootstrapping: the process of using the data to, |
106 |
| - with minimal probability model assumptions, create inferences. |
107 |
| -9. Permutation, randomization and exchangeability testing: the process |
108 |
| -of using data permutations to perform inferences. |
109 |
| - |
110 |
| ---- |
111 |
| -## Different thinking about probability leads to different styles of inference |
112 |
| - |
113 |
| -We won't spend too much time talking about this, but there are several different |
114 |
| -styles of inference. Two broad categories that get discussed a lot are: |
115 |
| - |
116 |
| -1. Frequency probability: is the long run proportion of |
117 |
| - times an event occurs in independent, identically distributed |
118 |
| - repetitions. |
119 |
| -2. Frequency inference: uses frequency interpretations of probabilities |
120 |
| -to control error rates. Answers questions like "What should I decide |
121 |
| -given my data controlling the long run proportion of mistakes I make at |
122 |
| -a tolerable level." |
123 |
| -3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules. |
124 |
| -4. Bayesian inference: the use of Bayesian probability representation |
125 |
| -of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what |
126 |
| -should I believe now?" |
127 |
| - |
128 |
| -Data scientists tend to fall within shades of gray of these and various other schools of inference. |
129 |
| - |
130 |
| ---- |
131 |
| -## In this class |
132 |
| - |
133 |
| -* In this class, we will primarily focus on basic sampling models, |
134 |
| -basic probability models and frequency style analyses |
135 |
| -to create standard inferences. |
136 |
| -* Being data scientists, we will also consider some inferential strategies that rely heavily on the observed data, such as permutation testing |
137 |
| -and bootstrapping. |
138 |
| -* As probability modeling will be our starting point, we first build |
139 |
| -up basic probability. |
140 |
| - |
141 |
| ---- |
142 |
| -## Where to learn more on the topics not covered |
143 |
| - |
144 |
| -1. Explicit use of random sampling in inferences: look in references |
145 |
| -on "finite population statistics". Used heavily in polling and |
146 |
| -sample surveys. |
147 |
| -2. Explicit use of randomization in inferences: look in references |
148 |
| -on "causal inference" especially in clinical trials. |
149 |
| -3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many). |
150 |
| -4. Missing data: well covered in biostatistics and econometric |
151 |
| -references; look for references to "multiple imputation", a popular tool for |
152 |
| -addressing missing data. |
153 |
| -5. Study design: consider looking in the subject matter area that |
154 |
| - you are interested in; some examples with rich histories in design: |
155 |
| - 1. The epidemiological literature is very focused on using study design to investigate public health. |
156 |
| - 2. The classical development of study design in agriculture broadly covers design and design principles. |
157 |
| - 3. The industrial quality control literature covers design thoroughly. |
158 |
| - |
| 1 | +--- |
| 2 | +title : Introduction to statistical inference |
| 3 | +subtitle : Statistical inference |
| 4 | +author : Brian Caffo, Jeff Leek, Roger Peng |
| 5 | +job : Johns Hopkins Bloomberg School of Public Health |
| 6 | +logo : bloomberg_shield.png |
| 7 | +framework : io2012 # {io2012, html5slides, shower, dzslides, ...} |
| 8 | +highlighter : highlight.js # {highlight.js, prettify, highlight} |
| 9 | +hitheme : tomorrow # |
| 10 | +url: |
| 11 | + lib: ../../librariesNew |
| 12 | + assets: ../../assets |
| 13 | +widgets : [mathjax] # {mathjax, quiz, bootstrap} |
| 14 | +mode : selfcontained # {standalone, draft} |
| 15 | +--- |
| 16 | + |
| 17 | +## Statistical inference defined |
| 18 | + |
| 19 | +Statistical inference is the process of drawing formal conclusions from |
| 20 | +data. |
| 21 | + |
| 22 | +In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy |
| 23 | +statistical data where uncertainty must be accounted for. |
| 24 | + |
| 25 | +--- |
| 26 | + |
| 27 | +## Motivating example: who's going to win the election? |
| 28 | + |
| 29 | +In every major election, pollsters would like to know, ahead of the |
| 30 | +actual election, who's going to win. Here, the target of |
| 31 | +estimation (the estimand) is clear, the percentage of people in |
| 32 | +a particular group (city, state, county, country or other electoral |
| 33 | +grouping) who will vote for each candidate. |
| 34 | + |
| 35 | +We can not poll everyone. Even if we could, some polled |
| 36 | +may change their vote by the time the election occurs. |
| 37 | +How do we collect a reasonable subset of data and quantify the |
| 38 | +uncertainty in the process to produce a good guess at who will win? |
| 39 | + |
| 40 | +--- |
| 41 | + |
| 42 | +## Motivating example: is hormone replacement therapy effective? |
| 43 | + |
| 44 | +A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.** |
| 45 | + |
| 46 | +Here's there's two inferential problems. |
| 47 | + |
| 48 | +1. Is HRT effective? |
| 49 | +2. How long should we continue the trial in the presence of contrary |
| 50 | +evidence? |
| 51 | + |
| 52 | +See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts |
| 53 | + |
| 54 | +--- |
| 55 | + |
| 56 | +## Motivating example |
| 57 | +### Brain activation |
| 58 | + |
| 59 | + |
| 60 | +http://www.wired.com/2009/09/fmrisalmon/ |
| 61 | + |
| 62 | + |
| 63 | +--- |
| 64 | + |
| 65 | +## Summary |
| 66 | + |
| 67 | +- These examples illustrate many of the difficulties of trying |
| 68 | +to use data to create general conclusions about a population. |
| 69 | +- Paramount among our concerns are: |
| 70 | + - Is the sample representative of the population that we'd like to draw inferences about? |
| 71 | + - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions? |
| 72 | + - Is there systematic bias created by missing data or the design or conduct of the study? |
| 73 | + - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization |
| 74 | +or random sampling, or implicit as the aggregation of many complex uknown processes. |
| 75 | + - Are we trying to estimate an underlying mechanistic model of phenomena under study? |
| 76 | +- Statistical inference requires navigating the set of assumptions and |
| 77 | +tools and subsequently thinking about how to draw conclusions from data. |
| 78 | + |
| 79 | +--- |
| 80 | +## Example goals of inference |
| 81 | + |
| 82 | +1. Estimate and quantify the uncertainty of an estimate of |
| 83 | +a population quantity (the proportion of people who will |
| 84 | + vote for a candidate). |
| 85 | +2. Determine whether a population quantity |
| 86 | + is a benchmark value ("is the treatment effective?"). |
| 87 | +3. Infer a mechanistic relationship when quantities are measured with |
| 88 | + noise ("What is the slope for Hooke's law?") |
| 89 | +4. Determine the impact of a policy? ("If we reduce polution levels, |
| 90 | + will asthma rates decline?") |
| 91 | +5. Talk about the probability that something occurs. |
| 92 | + |
| 93 | +--- |
| 94 | +## Example tools of the trade |
| 95 | + |
| 96 | +1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest |
| 97 | +2. Random sampling: concerned with obtaining data that is representative |
| 98 | +of the population of interest |
| 99 | +3. Sampling models: concerned with creating a model for the sampling |
| 100 | +process, the most common is so called "iid". |
| 101 | +4. Hypothesis testing: concerned with decision making in the presence of uncertainty |
| 102 | +5. Confidence intervals: concerned with quantifying uncertainty in |
| 103 | +estimation |
| 104 | +6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are |
| 105 | +approximated. |
| 106 | +7. Study design: the process of designing an experiment to minimize biases and variability. |
| 107 | +8. Nonparametric bootstrapping: the process of using the data to, |
| 108 | + with minimal probability model assumptions, create inferences. |
| 109 | +9. Permutation, randomization and exchangeability testing: the process |
| 110 | +of using data permutations to perform inferences. |
| 111 | + |
| 112 | +--- |
| 113 | +## Different thinking about probability leads to different styles of inference |
| 114 | + |
| 115 | +We won't spend too much time talking about this, but there are several different |
| 116 | +styles of inference. Two broad categories that get discussed a lot are: |
| 117 | + |
| 118 | +1. Frequency probability: is the long run proportion of |
| 119 | + times an event occurs in independent, identically distributed |
| 120 | + repetitions. |
| 121 | +2. Frequency inference: uses frequency interpretations of probabilities |
| 122 | +to control error rates. Answers questions like "What should I decide |
| 123 | +given my data controlling the long run proportion of mistakes I make at |
| 124 | +a tolerable level." |
| 125 | +3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules. |
| 126 | +4. Bayesian inference: the use of Bayesian probability representation |
| 127 | +of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what |
| 128 | +should I believe now?" |
| 129 | + |
| 130 | +Data scientists tend to fall within shades of gray of these and various other schools of inference. |
| 131 | + |
| 132 | +--- |
| 133 | +## In this class |
| 134 | + |
| 135 | +* In this class, we will primarily focus on basic sampling models, |
| 136 | +basic probability models and frequency style analyses |
| 137 | +to create standard inferences. |
| 138 | +* Being data scientists, we will also consider some inferential strategies that rely heavily on the observed data, such as permutation testing |
| 139 | +and bootstrapping. |
| 140 | +* As probability modeling will be our starting point, we first build |
| 141 | +up basic probability. |
| 142 | + |
| 143 | +--- |
| 144 | +## Where to learn more on the topics not covered |
| 145 | + |
| 146 | +1. Explicit use of random sampling in inferences: look in references |
| 147 | +on "finite population statistics". Used heavily in polling and |
| 148 | +sample surveys. |
| 149 | +2. Explicit use of randomization in inferences: look in references |
| 150 | +on "causal inference" especially in clinical trials. |
| 151 | +3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many). |
| 152 | +4. Missing data: well covered in biostatistics and econometric |
| 153 | +references; look for references to "multiple imputation", a popular tool for |
| 154 | +addressing missing data. |
| 155 | +5. Study design: consider looking in the subject matter area that |
| 156 | + you are interested in; some examples with rich histories in design: |
| 157 | + 1. The epidemiological literature is very focused on using study design to investigate public health. |
| 158 | + 2. The classical development of study design in agriculture broadly covers design and design principles. |
| 159 | + 3. The industrial quality control literature covers design thoroughly. |
| 160 | + |
0 commit comments