Preface |
|
xxi | |
Preface to first edition |
|
xxiii | |
Notation and abbreviations |
|
xxvii | |
I Model structure, properties and methods |
|
1 | (130) |
|
1 Preliminaries: mixtures and Markov chains |
|
|
3 | (26) |
|
|
3 | (3) |
|
1.2 Independent mixture models |
|
|
6 | (8) |
|
1.2.1 Definition and properties |
|
|
6 | (3) |
|
1.2.2 Parameter estimation |
|
|
9 | (2) |
|
1.2.3 Unbounded likelihood in mixtures |
|
|
11 | (1) |
|
1.2.4 Examples of fitted mixture models |
|
|
12 | (2) |
|
|
14 | (9) |
|
1.3.1 Definitions and example |
|
|
14 | (3) |
|
1.3.2 Stationary distributions |
|
|
17 | (1) |
|
1.3.3 Autocorrelation function |
|
|
18 | (1) |
|
1.3.4 Estimating transition probabilities |
|
|
19 | (1) |
|
1.3.5 Higher-order Markov chains |
|
|
20 | (3) |
|
|
23 | (6) |
|
2 Hidden Markov models: definition and properties |
|
|
29 | (18) |
|
2.1 A simple hidden Markov model |
|
|
29 | (1) |
|
|
30 | (4) |
|
2.2.1 Definition and notation |
|
|
30 | (2) |
|
2.2.2 Marginal distributions |
|
|
32 | (1) |
|
|
33 | (1) |
|
|
34 | (7) |
|
2.3.1 The likelihood of a two-state Bernoulli-HMM |
|
|
35 | (1) |
|
2.3.2 The likelihood in general |
|
|
36 | (3) |
|
2.3.3 HMMs are not Markov processes |
|
|
39 | (1) |
|
2.3.4 The likelihood when data are missing |
|
|
40 | (1) |
|
2.3.5 The likelihood when observations are interval-censored |
|
|
41 | (1) |
|
|
41 | (6) |
|
3 Estimation by direct maximization of the likelihood |
|
|
47 | (18) |
|
|
47 | (1) |
|
3.2 Scaling the likelihood computation |
|
|
48 | (2) |
|
3.3 Maximization of the likelihood subject to constraints |
|
|
50 | (3) |
|
3.3.1 Reparametrization to avoid constraints |
|
|
50 | (2) |
|
3.3.2 Embedding in a continuous-time Markov chain |
|
|
52 | (1) |
|
|
53 | (1) |
|
3.4.1 Multiple maxima in the likelihood |
|
|
53 | (1) |
|
3.4.2 Starting values for the iterations |
|
|
53 | (1) |
|
3.4.3 Unbounded likelihood |
|
|
53 | (1) |
|
|
54 | (2) |
|
3.6 Standard errors and confidence intervals |
|
|
56 | (3) |
|
3.6.1 Standard errors via the Hessian |
|
|
56 | (2) |
|
3.6.2 Bootstrap standard errors and confidence intervals |
|
|
58 | (1) |
|
3.7 Example: the parametric bootstrap applied to the three-state model for the earthquakes data |
|
|
59 | (1) |
|
|
60 | (5) |
|
4 Estimation by the EM algorithm |
|
|
65 | (16) |
|
4.1 Forward and backward probabilities |
|
|
65 | (4) |
|
4.1.1 Forward probabilities |
|
|
66 | (1) |
|
4.1.2 Backward probabilities |
|
|
67 | (1) |
|
4.1.3 Properties of forward and backward probabilities |
|
|
68 | (1) |
|
|
69 | (5) |
|
|
70 | (1) |
|
|
70 | (2) |
|
4.2.3 M step for Poisson- and normal-HMMs |
|
|
72 | (1) |
|
4.2.4 Starting from a specified state |
|
|
73 | (1) |
|
4.2.5 EM for the case in which the Markov chain is stationary |
|
|
73 | (1) |
|
4.3 Examples of EM applied to Poisson-HMMs |
|
|
74 | (3) |
|
|
74 | (2) |
|
4.3.2 Foetal movement counts |
|
|
76 | (1) |
|
|
77 | (1) |
|
|
78 | (3) |
|
5 Forecasting, decoding and state prediction |
|
|
81 | (16) |
|
|
81 | (1) |
|
5.2 Conditional distributions |
|
|
82 | (1) |
|
5.3 Forecast distributions |
|
|
83 | (2) |
|
|
85 | (7) |
|
5.4.1 State probabilities and local decoding |
|
|
86 | (2) |
|
|
88 | (4) |
|
|
92 | (1) |
|
5.6 HMMs for classification |
|
|
93 | (1) |
|
|
94 | (3) |
|
6 Model selection and checking |
|
|
97 | (14) |
|
6.1 Model selection by AIC and BIC |
|
|
97 | (4) |
|
6.2 Model checking with pseudo-residuals |
|
|
101 | (5) |
|
6.2.1 Introducing pseudo-residuals |
|
|
101 | (4) |
|
6.2.2 Ordinary pseudo-residuals |
|
|
105 | (1) |
|
6.2.3 Forecast pseudo-residuals |
|
|
105 | (1) |
|
|
106 | (3) |
|
6.3.1 Ordinary pseudo-residuals for the earthquakes |
|
|
106 | (2) |
|
6.3.2 Dependent ordinary pseudo-residuals |
|
|
108 | (1) |
|
|
109 | (1) |
|
|
109 | (2) |
|
7 Bayesian inference for Poisson-hidden Markov models |
|
|
111 | (12) |
|
7.1 Applying the Gibbs sampler to Poisson-HMMs |
|
|
111 | (3) |
|
7.1.1 Introduction and outline |
|
|
111 | (2) |
|
7.1.2 Generating sample paths of the Markov chain |
|
|
113 | (1) |
|
7.1.3 Decomposing the observed counts into regime contributions |
|
|
114 | (1) |
|
7.1.4 Updating the parameters |
|
|
114 | (1) |
|
7.2 Bayesian estimation of the number of states |
|
|
114 | (2) |
|
7.2.1 Use of the integrated likelihood |
|
|
115 | (1) |
|
7.2.2 Model selection by parallel sampling |
|
|
116 | (1) |
|
|
116 | (3) |
|
|
119 | (1) |
|
|
120 | (3) |
|
|
123 | (8) |
|
|
123 | (1) |
|
8.1.1 Model formulation and estimation |
|
|
123 | (1) |
|
|
124 | (1) |
|
8.2 The package HiddenMarkov |
|
|
124 | (2) |
|
8.2.1 Model formulation and estimation |
|
|
124 | (2) |
|
|
126 | (1) |
|
|
126 | (1) |
|
|
126 | (2) |
|
8.3.1 Model formulation and estimation |
|
|
126 | (2) |
|
|
128 | (1) |
|
8.4 The package R2OpenBUGS |
|
|
128 | (1) |
|
|
129 | (2) |
II Extensions |
|
131 | (66) |
|
9 HMMs with general state-dependent distribution |
|
|
133 | (12) |
|
|
133 | (1) |
|
9.2 General univariate state-dependent distribution |
|
|
133 | (3) |
|
9.2.1 HMMs for unbounded counts |
|
|
133 | (1) |
|
9.2.2 HMMs for binary data |
|
|
134 | (1) |
|
9.2.3 HMMs for bounded counts |
|
|
134 | (1) |
|
9.2.4 HMMs for continuous-valued series |
|
|
135 | (1) |
|
9.2.5 HMMs for proportions |
|
|
135 | (1) |
|
9.2.6 HMMs for circular-valued series |
|
|
136 | (1) |
|
9.3 Multinomial and categorical HMMs |
|
|
136 | (2) |
|
|
136 | (1) |
|
9.3.2 HMMs for categorical data |
|
|
137 | (1) |
|
9.3.3 HMMs for compositional data |
|
|
138 | (1) |
|
9.4 General multivariate state-dependent distribution |
|
|
138 | (4) |
|
9.4.1 Longitudinal conditional independence |
|
|
138 | (2) |
|
9.4.2 Contemporaneous conditional independence |
|
|
140 | (1) |
|
9.4.3 Further remarks on multivariate HMMs |
|
|
141 | (1) |
|
|
142 | (3) |
|
10 Covariates and other extra dependencies |
|
|
145 | (10) |
|
|
145 | (1) |
|
10.2 HMMs with covariates |
|
|
145 | (3) |
|
10.2.1 Covariates in the state-dependent distributions |
|
|
146 | (1) |
|
10.2.2 Covariates in the transition probabilities |
|
|
147 | (1) |
|
10.3 HMMs based on a second-order Malloy chain |
|
|
148 | (2) |
|
10.4 HMMs with other additional dependencies |
|
|
150 | (2) |
|
|
152 | (3) |
|
11 Continuous-valued state processes |
|
|
155 | (10) |
|
|
155 | (1) |
|
11.2 Models with continuous-valued state process |
|
|
156 | (4) |
|
11.2.1 Numerical integration of the likelihood |
|
|
157 | (1) |
|
11.2.2 Evaluation of the approximate likelihood via forward recursion |
|
|
158 | (2) |
|
11.2.3 Parameter estimation and related issues |
|
|
160 | (1) |
|
11.3 Fitting an SSM to the earthquake data |
|
|
160 | (2) |
|
|
162 | (3) |
|
12 Hidden semi-Markov models and their representation as HMMs |
|
|
165 | (22) |
|
|
165 | (1) |
|
12.2 Semi-Markov processes, hidden semi-Markov models and approximating HMMs |
|
|
165 | (2) |
|
12.3 Examples of HSMMs represented as HMMs |
|
|
167 | (6) |
|
12.3.1 A simple two-state Poisson-HSMM |
|
|
167 | (2) |
|
12.3.2 Example of HSMM with three states |
|
|
169 | (2) |
|
12.3.3 A two-state HSMM with general dwell-time distribution in one state |
|
|
171 | (2) |
|
|
173 | (3) |
|
|
176 | (2) |
|
12.6 Some examples of dwell-time distributions |
|
|
178 | (3) |
|
12.6.1 Geometric distribution |
|
|
178 | (1) |
|
12.6.2 Shifted Poisson distribution |
|
|
178 | (1) |
|
12.6.3 Shifted negative binomial distribution |
|
|
179 | (1) |
|
12.6.4 Shifted binomial distribution |
|
|
180 | (1) |
|
12.6.5 A distribution with unstructured start and geometric tail |
|
|
180 | (1) |
|
12.7 Fitting HSMMs via the HMM representation |
|
|
181 | (1) |
|
12.8 Example: earthquakes |
|
|
182 | (2) |
|
|
184 | (1) |
|
|
184 | (3) |
|
13 HMMs for longitudinal data |
|
|
187 | (10) |
|
|
187 | (2) |
|
13.2 Models that assume some parameters to be constant across component series |
|
|
189 | (1) |
|
13.3 Models with random effects |
|
|
190 | (5) |
|
13.3.1 HMMs with continuous-valued random effects |
|
|
191 | (2) |
|
13.3.2 HMMs with discrete-valued random effects |
|
|
193 | (2) |
|
|
195 | (1) |
|
|
196 | (1) |
III Applications |
|
197 | (134) |
|
14 Introduction to applications |
|
|
199 | (2) |
|
|
201 | (6) |
|
|
201 | (1) |
|
|
201 | (3) |
|
15.3 Model checking by pseudo-residuals |
|
|
204 | (2) |
|
|
206 | (1) |
|
16 Daily rainfall occurrence |
|
|
207 | (6) |
|
|
207 | (1) |
|
|
207 | (6) |
|
17 Eruptions of the Old Faithful geyser |
|
|
213 | (14) |
|
|
213 | (1) |
|
|
213 | (1) |
|
17.3 The binary time series of short and long eruptions |
|
|
214 | (6) |
|
17.3.1 Markov chain models |
|
|
214 | (2) |
|
17.3.2 Hidden Markov models |
|
|
216 | (3) |
|
17.3.3 Comparison of models |
|
|
219 | (1) |
|
17.3.4 Forecast distributions |
|
|
219 | (1) |
|
17.4 Univariate normal-HMMs for durations and waiting times |
|
|
220 | (3) |
|
17.5 Bivariate normal-HMM for durations and waiting times |
|
|
223 | (1) |
|
|
224 | (3) |
|
18 HMMs for animal movement |
|
|
227 | (18) |
|
|
227 | (1) |
|
|
228 | (1) |
|
|
228 | (1) |
|
18.2.2 The von Mises distribution |
|
|
228 | (1) |
|
18.3 HMMs for movement data |
|
|
229 | (3) |
|
|
229 | (1) |
|
18.3.2 HMMs as multi-state random walks |
|
|
230 | (2) |
|
18.4 A basic HMM for Drosophila movement |
|
|
232 | (3) |
|
18.5 HMMs and HSMMs for bison movement |
|
|
235 | (3) |
|
18.6 Mixed HMMs for woodpecker movement |
|
|
238 | (4) |
|
|
242 | (3) |
|
19 Wind direction at Koeberg |
|
|
245 | (14) |
|
|
245 | (1) |
|
19.2 Wind direction classified into 16 categories |
|
|
245 | (6) |
|
19.2.1 Three HMMs for hourly averages of wind direction |
|
|
245 | (3) |
|
19.2.2 Model comparisons and other possible models |
|
|
248 | (3) |
|
19.3 Wind direction as a circular variable |
|
|
251 | (6) |
|
19.3.1 Daily at hour 24: von Mises-HMMs |
|
|
251 | (2) |
|
19.3.2 Modelling hourly change of direction |
|
|
253 | (1) |
|
19.3.3 Transition probabilities varying with lagged speed |
|
|
253 | (1) |
|
19.3.4 Concentration parameter varying with lagged speed |
|
|
254 | (3) |
|
|
257 | (2) |
|
20 Models for financial series |
|
|
259 | (16) |
|
20.1 Financial series I: A multivariate normal-HMM for returns on four shares |
|
|
259 | (3) |
|
20.2 Financial series II: Discrete state-space stochastic volatility models |
|
|
262 | (11) |
|
20.2.1 Stochastic volatility models without leverage |
|
|
263 | (2) |
|
20.2.2 Application: FTSE 100 returns |
|
|
265 | (1) |
|
20.2.3 Stochastic volatility models with leverage |
|
|
265 | (3) |
|
20.2.4 Application: TOPIX returns |
|
|
268 | (2) |
|
20.2.5 Non-standard stochastic volatility models |
|
|
270 | (1) |
|
20.2.6 A model with a mixture AR(1) volatility process |
|
|
271 | (1) |
|
20.2.7 Application: S&P 500 returns |
|
|
272 | (1) |
|
|
273 | (2) |
|
21 Births at Edendale Hospital |
|
|
275 | (12) |
|
|
275 | (1) |
|
21.2 Models for the proportion Caesarean |
|
|
275 | (7) |
|
21.3 Models for the total number of deliveries |
|
|
282 | (3) |
|
|
285 | (2) |
|
22 Homicides and suicides in Cape Town, 1986-1991 |
|
|
287 | (10) |
|
|
287 | (1) |
|
22.2 Firearm homicides as a proportion of all homicides, suicides and legal intervention homicides |
|
|
287 | (2) |
|
22.3 The number of firearm homicides |
|
|
289 | (2) |
|
22.4 Firearm homicides as a proportion of all homicides, and firearm suicides as a proportion of all suicides |
|
|
291 | (4) |
|
22.5 Proportion in each of the five categories |
|
|
295 | (2) |
|
23 A model for animal behaviour which incorporates feed- back |
|
|
297 | (20) |
|
|
297 | (1) |
|
|
298 | (2) |
|
23.3 Likelihood evaluation |
|
|
300 | (2) |
|
23.3.1 The likelihood as a multiple sum |
|
|
301 | (1) |
|
23.3.2 Recursive evaluation |
|
|
301 | (1) |
|
23.4 Parameter estimation by maximum likelihood |
|
|
302 | (1) |
|
|
302 | (1) |
|
23.6 Inferring the underlying state |
|
|
303 | (1) |
|
23.7 Models for a heterogeneous group of subjects |
|
|
304 | (2) |
|
23.7.1 Models assuming some parameters to be constant across subjects |
|
|
304 | (1) |
|
|
305 | (1) |
|
23.7.3 Inclusion of covariates |
|
|
306 | (1) |
|
23.8 Other modifications or extensions |
|
|
306 | (1) |
|
23.8.1 Increasing the number of states |
|
|
306 | (1) |
|
23.8.2 Changing the nature of the state-dependent distribution |
|
|
306 | (1) |
|
23.9 Application to caterpillar feeding behaviour |
|
|
307 | (7) |
|
23.9.1 Data description And preliminary analysis |
|
|
307 | (1) |
|
23.9.2 Parameter estimates and model checking |
|
|
307 | (4) |
|
23.9.3 Runlength distributions |
|
|
311 | (2) |
|
23.9.4 Joint models for seven subjects |
|
|
313 | (1) |
|
|
314 | (3) |
|
24 Estimating the survival rates of Soay sheep from mark- recapture-recovery data |
|
|
317 | (14) |
|
|
317 | (1) |
|
24.2 MRR data without use of covariates |
|
|
318 | (3) |
|
24.3 MRR data involving individual-specific time-varying continuous-valued covariates |
|
|
321 | (3) |
|
24.4 Application to Soay sheep data |
|
|
324 | (4) |
|
|
328 | (3) |
A Examples of R code |
|
331 | (10) |
|
|
331 | (7) |
|
A.1.1 Transforming natural parameters to working |
|
|
332 | (1) |
|
A.1.2 Transforming working parameters to natural |
|
|
332 | (1) |
|
A.1.3 Computing minus the log-likelihood from the working parameters |
|
|
332 | (1) |
|
A.1.4 Computing the MLEs, given starting values for the natural parameters |
|
|
333 | (1) |
|
A.1.5 Generating a sample |
|
|
333 | (1) |
|
A.1.6 Global decoding by the Viterbi algorithm |
|
|
334 | (1) |
|
A.1.7 Computing log(forward probabilities) |
|
|
334 | (1) |
|
A.1.8 Computing log(backward probabilities) |
|
|
334 | (1) |
|
A.1.9 Conditional probabilities |
|
|
335 | (1) |
|
|
336 | (1) |
|
A.1.11 State probabilities |
|
|
336 | (1) |
|
|
336 | (1) |
|
|
337 | (1) |
|
A.1.14 Forecast probabilities |
|
|
337 | (1) |
|
A.2 Examples of code using the above functions |
|
|
338 | (3) |
|
A.2.1 Fitting Poisson-HMMs to the earthquakes series |
|
|
338 | (1) |
|
A.2.2 Forecast probabilities |
|
|
339 | (2) |
B Some proofs |
|
341 | (4) |
|
B.1 A factorization needed for the forward probabilities |
|
|
341 | (1) |
|
B.2 Two results needed for the backward probabilities |
|
|
342 | (1) |
|
B.3 Conditional independence of XI and XT+1 |
|
|
343 | (2) |
References |
|
345 | (14) |
Author index |
|
359 | (6) |
Subject index |
|
365 | |