Speech Recognition: Lecture 11: Advanced Topics
Speech Recognition: Lecture 11: Advanced Topics
Mehryar Mohri
Courant Institute of Mathematical Sciences
[email protected]
This Lecture
Speech recognition evaluation
N-best strings algorithms
Lattice generation
Discriminative training
*
*
Graphical representation:
• best path of A ◦ T ◦ B . e
100
5 5
5
90
5 4 4
4
3 3
4 3
word accuracy
5 3 2 2
2
4 2
3
80
2 1 1 1
1
5
43 1
2
70
1
60
0 1 2 3 4 5 6
x real-time
1. 1-Best 2. 10-Best 3. 100-Best 4. 1000-Best 5. Lattice
Additional time to pay for N-best very small even for large N.
Mehryar Mohri - Speech Recognition page 21 Courant Institute, NYU
N-Best String Alg. - Properties
Simplicity and efficiency:
is/77.68 7 my/19.2
I’d/143.3 card/20.1
9
3 this/90.3 6
like/41.58
(r, t!! )
(q, t)
(r, t! )
t! t!! t+1
in/25
saint/68.1
in/21.7
around/107
at/33.8
at/31.6
at/28.5
at/32.8
arrive/24.2
at/29.7
arrive/23.2 at/31.1
arrive/27.7 in/24.7
saint/53.6
19 48
arrive/21.7 in/27.2 saint/46.8
53 petersburg/73.6
75
saint/42.5 petersburg/91.5 around/116
arrive/16.5 27 at/28.9
arrive/16.5
22 at/35 saint/49.7
around/105
62 petersburg/85.2
arrive/19.7 in/29 saint/39.8 77
28 around/109
petersburg/84.4
at/22.2 saint/56.1 61
arrive/19.1 around/104
petersburg/90.3
at/23.4 around/92.7
arrive/16.9 71
saint/34.4
and/59.3 at/19.5 petersburg/92.3 96
arriving/41.3 around/109 nine/11.8
saint/39.1 a/16.3
ninety/18
arrive/21.1 79 97
leave/39.2 at/26.4
32 45 saint/45.4 around/87.6 a/9.34
nine/12.9
arrive/16
leave/35.9 at/37.3 saint/57.8
around/104
nine/5.61
flights/79 8 leave/41.9 at/36.8 saint/44.4 around/91.6 98
55 petersburg/97.9 nine/10.7 a/14.1
arrive/13.7
69 around/99.3 nine/17.2 m/16.6
saint/33.2 petersburg/99.9
flights/83.8 leave/34.6 arrive/16.6 at/32.8
a/22.8 102 m/19
95
arrive/20.5 petersburg/104 91 a/13.3 m/13.9
at/28.9 saint/38.7 around/117 and/19.1
nine/13
leave/50.7
81 105 /13.6
and/14.2 nine/24.9
flights/83.4 detroit/99.1 around/97.2 a/17.9 100 m/16.3
leave/31.3 12 arrive/15.6 at/35.1 saint/51.5 petersburg/86.1
9
flights/88.2
leave/37.3 detroit/102 saint/45.5 around/92.2 nine/9.2 92
arrive/16.9 m/13.6
petersburg/92.8 a/26.6
25 at/32 nine/5.66
detroit/103
nine/18.7
3 leave/47.4 82 m/24.1
flights/63.5 11 arrive/13.1 at/33.7 saint/37.8 petersburg/85.1 70 a/18.5 104 /16.2
which/77.7 detroit/106 20 47 around/87.8
and/55.8
leave/57.7
saint/44.2 around/105 a/8.73 m/12.5
flights/68.3 arrive/15.7 at/27.5 petersburg/88.1 78 87 nine/17.2
54 67 94
which/72.9 2 detroit/96.3 15 petersburg/90 around/109 around/48.8
flights/61.8 7 and/57 a/13.6 m/10.9
leave/53.4 arrive/14.4
at/23.6 saint/43.1 nine/6.31 nine/8.8 101
and/55.3 21 m/21.4
0 detroit/88.5
which/81.6 flights/72.4 leave/54.4 arrive/18.2
petersburg/98.8 around/93.2 86 around/102
at/29.8 saint/38 nine/8.76 a/16.4
13 99
which/69.9 4 detroit/106
flights/55.2 leave/70.9 m/7.03
24 saint/46.5 around/109 nine/28 nine/18.8 a/22.3
arrive/22.7 at/20 petersburg/92 103 /20.8
and/53.5
flights/59.9 leave/60.4 detroit/102
m/17.5
6 at/21.2 saint/43.3 petersburg/80.9 around/111 nine/12.3
and/55 arrive/20.2 a/12.1
leave/45.8 detroit/105 around/97.2
flights/53.5
at/17.3 saint/49.7 56 around/97.8 nine/31.2
flights/64 arrive/19.6 petersburg/91.1 93
detroit/99.7
1 leave/61.9 16 85
flights/45.4 arrive/13.9 at/20.1 petersburg/73.3
saint/48.6 around/99.5
detroit/91.9
leave/68.9 14 nine/21.9
that/60.1 arrive/16.6 at/25.7 46 petersburg/84.8 at/65 89
detroit/110
saint/36.8 68
leave/67.6
flights/50.2 and/53.1 around/85.6
nine/21.9
at/27.4 saint/50.1 petersburg/77.1 at/12.2
arrive/14.1 57
flights/43.7 detroit/99.4 at/15.2 nine/28
leave/73.6 17 petersburg/76.4
5 at/21.3 saint/38.7 around/81
and/55 arrive/16 88
31 80
detroit/91.6 around/97.1 nine/12.4
flights/54.3 leave/82.1 at/17.4
arrive/20.5 at/20.1
detroit/109
leave/44.4 saint/42.3 90
33
18 at/23.5
10 arrive/14.4
petersburg/75.6 at/14.1
saint/49.5 nine/9.45
30
leave/51.4 at/26.3 around/113
arrive/13.6 83
saint/43.1
leave/64.6 nine/22.6
arrive/18.6 at/27.5 saint/56.5 around/111
73
petersburg/89.3
saint/45 around/101
arrive/23.1 at/20.1
59 84
petersburg/84.1
arrive/17.1 saint/43 around/99.5
at/23.5
arrive/16.5
at/25.2 saint/49.3
at/21.3 saint/55.4
43
Lattice after Determinization
(MM, 1997)
around/81
27
at/19.1
32
around/83
nine/21.8
36
arriving/45.2 saint/51.9 petersburg/85.9 nine/21.7
12 17 21 around/96.5 a/15.3
in/16.3
25 ninety/18
at/16.1
arrive/17.4 at/23
7 11 petersburg/80 a/15.3
18 22 29 33 m/
5 saint/49.4 at/16.1 26 ninety/34.1
and/49.7 arrives/23.6
at/20.9 m/
which/69.9 flights/53.1 leave/61.2 detroit/105 8 13 saint/43
0 1 2 3 4 nine/21.7
that/60.1 a/9.34
petersburg/85.6 around/97.1 30 34
6 arrive/12.8 saint/43 16 20
at/21.9 a/14.1
10 15 nine/21.7
around/97.1 28 ninety/34.1
35
and/18.7 nine/10.7
and/18.7 31
24 ninety/34.1
and/18.7
around/97.1 nine/29.1
• David Eppstein, Finding the shortest paths, SIAM Journal of Computing, vol.28, no. 2, pp. 652–
673, 1998.
• Andrej Ljolje and Fernando Pereira and Michael Riley, Efficient general lattice generation
and rescoring. In Proceedings of the European Conference on Speech Communication
and Technology (Eurospeech ’99), Budapest, Hungary, 1999.
• Mehryar Mohri and Michael Riley. An Efficient Algorithm for the N-Best-Strings Problem.
In Proceedings of the International Conference on Spoken Language Processing 2002 (ICSLP
’02), Denver, Colorado, September 2002.
• Mehryar Mohri, Fernando C. N. Pereira, and Michael Riley. The Design Principles of a
Weighted Finite-State Transducer Library. Theoretical Computer Science, 231:17-32, January
2000.
• Julian Odell. The Use of Context in Large Vocabulary Speech Recognition. Ph.D. thesis, 1995.
Cambridge University, UK.
• Frank Soong and Eng-Fong Huang, A Tree-Trellis Based Fast Search for Finding the N Best
Sentence Hypotheses in Continuous Speech Recognition. In Proceedings of the International
Conference on Acoustics, Speech, and Signal Processing (ICASSP ’91), Toronto, Canada,
November 1991, pp. 705–708.