|
Introduction and Background |
|
|
1 | (24) |
|
What is Discriminative Learning? |
|
|
1 | (1) |
|
What is Speech Recognition? |
|
|
2 | (2) |
|
Roles of Discriminative Learning in Speech Recognition |
|
|
4 | (1) |
|
Background: Basic Probality Distributions |
|
|
5 | (12) |
|
Multinomial Distributions |
|
|
6 | (1) |
|
Gaussian and Mixture-of-Gaussian Distributions |
|
|
7 | (1) |
|
Exponential-Family Distribution |
|
|
7 | (10) |
|
Background: Basic Optimization Concepts and Techniques |
|
|
17 | (6) |
|
|
18 | (1) |
|
Necessary and Sufficient Conditions for an Optimum |
|
|
18 | (1) |
|
Lagrange Multiplier Method for Constrained Optimization |
|
|
19 | (1) |
|
|
20 | (1) |
|
Growth Transformation Method: Introduction |
|
|
21 | (2) |
|
|
23 | (2) |
|
Statistical Speech Recognition: A Tutorial |
|
|
25 | (6) |
|
|
25 | (1) |
|
|
26 | (1) |
|
Acoustic Modeling and HMMs |
|
|
27 | (4) |
|
Discriminative Learning: A Unified Objective Function |
|
|
31 | (16) |
|
|
31 | (1) |
|
A Unified Discriminative Training Criterion |
|
|
32 | (1) |
|
|
32 | (1) |
|
|
32 | (1) |
|
|
33 | (2) |
|
Introduction to MMI Criterion |
|
|
33 | (1) |
|
Reformulation of the MMI Criterion into Its Unified Form |
|
|
34 | (1) |
|
|
35 | (4) |
|
Introduction to the MCE Criterion |
|
|
35 | (3) |
|
Reformulation of the MCE Criterion Into its Unified Form |
|
|
38 | (1) |
|
Minimum Phone/Word Error and its Unified Form |
|
|
39 | (2) |
|
Introduction to the MPE/MWE Criterion |
|
|
39 | (1) |
|
Reformulation of the MPE/MWE Criterion Into Its Unified Form |
|
|
40 | (1) |
|
Discussions and Comparisons |
|
|
41 | (6) |
|
Discussion and Elaboration on the Unified Form |
|
|
41 | (2) |
|
Comparisons With Another Unifying Framework |
|
|
43 | (4) |
|
Discriminative Learning Algorithm for Exponential-Family Distributions |
|
|
47 | (12) |
|
Exponential-Family Models for Classification |
|
|
47 | (1) |
|
Construction of Auxiliary Functions |
|
|
48 | (1) |
|
GT Learning for Exponential-Family Distributions |
|
|
49 | (5) |
|
Estimation Formulas for Two Exponential-Family Distributions |
|
|
54 | (5) |
|
|
54 | (1) |
|
Multivariate Gaussian Distribution |
|
|
55 | (4) |
|
Discriminative Learning Algorithm for Hidden Markov Model |
|
|
59 | (16) |
|
Estimation Formulas for Discrete HMM |
|
|
59 | (8) |
|
Constructing Auxiliary Function F (ΛΛ') |
|
|
60 | (1) |
|
Constructing Auxiliary Function V(ΛΛ') |
|
|
60 | (1) |
|
Simplifying Auxiliary Function V(ΛΛ') |
|
|
61 | (4) |
|
GT by Optimizing Auxiliary Function U(ΛΛ') |
|
|
65 | (2) |
|
Estimation Formulas for CDHMM |
|
|
67 | (3) |
|
Relationship with Gradient-Based Methods |
|
|
70 | (1) |
|
Setting Constant D for GT-Based Optimization |
|
|
71 | (4) |
|
Existence Proof of Finite D in GT Updates for CDHMM |
|
|
72 | (3) |
|
Practical Implementation of Discriminative Learning |
|
|
75 | (16) |
|
Computing ΔΓ (i, r, t) in Growth-Transform Formulas |
|
|
75 | (4) |
|
Product Form of C(s) (for MMI) |
|
|
76 | (2) |
|
Summation Form of C(s) (MCE and MPE/MWE) |
|
|
78 | (1) |
|
Computing ΔΓ(i, r, t) Using Lattices |
|
|
79 | (9) |
|
Computing ΔΓ (i, r, t) for MMI Involving Lattices |
|
|
80 | (3) |
|
Computing ΔΓ (i, r, t) for MPE/MWE Involving Lattices |
|
|
83 | (4) |
|
Computing ΔΓ (i, r, t) for MCE Involving Lattices |
|
|
87 | (1) |
|
Arbitrary Exponent Scaling in MCE Implementation |
|
|
88 | (1) |
|
Arbitrary Slope in Defining MCE Cost Function |
|
|
89 | (2) |
|
Selected Experimental Results |
|
|
91 | (6) |
|
Experimental Results on Small ASR Tasks TIDIGITS |
|
|
91 | (1) |
|
Telephony LV-ASR Applications |
|
|
92 | (5) |
|
|
97 | (6) |
|
|
97 | (1) |
|
|
98 | (1) |
|
Remaining Theoretical Issue and Future Direction |
|
|
99 | (4) |
Major Symbols Used in the Book and Their Descriptions |
|
103 | (2) |
Mathematical Notation |
|
105 | (2) |
Bibliography |
|
107 | (4) |
Author Biography |
|
111 | |