HMM Toolkit (HTK) : Presentation by Daniel Whiteley AME Department

HTK is a toolkit for building and manipulating hidden Markov models (HMMs). It contains tools for speech analysis, HMM training, testing, and results analysis. HTK uses HMMs with both continuous density mixture Gaussians and discrete distributions. The basic workflow in HTK involves data preparation, HMM model creation and training, and pattern recognition. Key steps include vector quantization to discretize continuous data, initialization and retraining of HMMs, and creation of dictionaries and label files.

Uploaded by

shfaisal2327

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

224 views21 pages

HMM Toolkit (HTK) : Presentation by Daniel Whiteley AME Department

Uploaded by

shfaisal2327

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

HMM Toolkit (HTK)

Presentation by
Daniel Whiteley

AME department
What is HTK?
The Hidden Markov Model Toolkit (HTK) is a
portable toolkit for building and manipulating
hidden Markov models. HTK is primarily used
for speech recognition research although it has
been used for numerous other applications
including research into speech synthesis,
character recognition and DNA sequencing. HTK
is in use at hundreds of sites worldwide.
What is HTK?
HTK consists of a set of library modules and tools
available in C source form. The tools provide
sophisticated facilities for speech analysis, HMM
training, testing and results analysis. The software
supports HMMs using both continuous density
mixture Gaussians and discrete distributions and
can be used to build complex HMM systems.
Basic HTK command format
●
The commands in HTK follow a basic command
line format:
HCommand [options] files
●
Options are indicated by a dash followed by the
option letter. Universal options are capital letters.
●
In HTK, it is not necessary to use file extentions,
but headers to determine their format.
Configuration files
●
As well, you can set up the configuration of HTK
modules using config files. They are implemented using
the -C option; or they can be implemented globally using
the command setenv HCONFIG myconfig where
myconfig is your own config modifications.
●
All possible configuration variables can be found in
chapter 18 of the HTK manual. However, for most of
our purposes, we only need to create a config file with
these lines:
SOURCEKIND = USER %The user defined file format (not sound)
TARGETKIND = ANON_D %Keep the file the same format.
Using HTK
●
Parts of HMM modeling
– Data Preparation
– Model Training
– Pattern Recognition
– Model Analysis
Data Preparation
●
One small problem:
–
HTK was tailored for speech recognition. Therefore, most of
the data preparation tools are for audio.
– Due to this, we need to jerry-rig our data to the HTK
parameterized data file format.
●
HTK parameter files consist of a sequence of samples
preceeded by a header. The samples are simply data
vectors, whose components are 2-byte integers or 4-byte
floating point numbers.
●
For us, these vectors will be a sequence of joint angles
received from a motion capture session.
HTK file format
●
The file begins with a 12-byte header containing
the following information:
– nSamples (4-byte int): Number of samples
– samplePeriod (4-byte int): Sample period (calculated
by multiplying the number by 100ns)
– sampleSize (2-byte): Number of bytes per vector
– parameterKind (2-byte int): Defines the type of data
●
For our purposes, either this parameter will be 0x2400,
which is the user defined parameter kind, or 0x2800, which
is the discrete case.
HMM model creation
●
In order to model the motion capture squence, we need
to create a prototype of the HMM. In this prototype, the
values of B and  are arbitrary. The same is true for the
transition matrix A, save that any transition probability
you set to zero will remain as zero.
●
Models are created using a scripting language similar to
HTML.
●
As well, models in HTK have a beginning and ending
state which are non-emitting. These states are not
defined in the script.
Name of
the file
HMM Model Example
~h ''prototype'' Number of
Gaussian ... Transition
distributions matrix A
<BeginHMM>
<TransP>
Number <VectorSize> 4 <USER>
of states 0.0 0.4 0.3 0.3 0.0
<NumStates> 5 0.0 0.2 0.5 0.3 0.0
<State> 2 <NumMixes> 3 0.0 0.2 0.2 0.4 0.2
Mean <Mixture> 1 0.3
observation Sample size 0.0 0.1 0.2 0.3 0.4
vector <Mean> 4
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
Covariance <Variance> 4
matrix 1.0 1.0 1.0 1.0 All the transition
diagonal <Mixture> 2 0.4 ... probabilities for
<State> 3 ... the ending state
are always zero
The distribution’s
ID and weight
Vector Quantization
●
In order to reduce computation, we can make the
HMM discreete.
●
In order to use a discreete HMM, we must first
quantize the data into a set of standard vectors.
●
Warning: in quantizing the data, error is
inheritably introduced.
●
Before quantizing the data, we must first have a
standard set of vectors, or a “vector cookbook”.
This is made with HQuant.
HQuant
●
HQuant takes the training data and uses a K-means
algorithm to evenly partition the data and find the centriods
of these partitions to create our quantization vectors (QVs).
●
A sample command: Number of You can use a
QVs for a script to list all Our cookbook
Use the configuration certain data of your will be written to
variables found in stream training files this file
config

HQuant -C config -n 1 64 -S train.scp vqcook

●
To reduce quatization time, a cookbook using a binary tree
search algorithm can be made using the -t option.
Converting to Discrete
●
The conversion of data files is done using the HCopy
command. In order to quantize our data, we do this:
HCopy –C quantize rawdata qvdata
Where rawdata is our original data, qvdata is our
quantized data, and quantize is a config file having
these commands:
SOURCEKIND = USER %We start with our original data
TARGETKIND = DISCRETE %Convert it into discrete data
SAVEASVQ = T %We throw away the continuous data
VQTABLE = vqcook %We use are previously made
%cookbook to quantize the data
Discrete HMM
~o <Discrete> <StreamInfo> 1 1
●
Discreete HMMs are
~h “dhmm”
very similar to their Number of
<BeginHMM> discrete symbols
continuous <NumStates> 5
counterparts, save for <State> 2 <NumMixes> 10
a few changes. <DProb> 5461*10
....
●
Discrete probabilities <EndHMM> Duplicate
function
are in logrithmic form,
where:
P(v) = exp(-d(v)/2371.8)
Model Training (token HMM)
●
The initialization of our prototype can be done
using HInit: (The HHMM
being trained)
HInit [options] hmm data1 data2 data3 ...
●
HInit is used mainly for left-right HMMs. For
more ergodic HMMs, it can be initialized by
doing a flat-start. This is done by setting all
means and variances to the global counterparts
using HCompV:
HCompV -m -S trainlist hmm
Retraining
●
The model this then retrained using the Welch-
Baum algorithm found in HRest:
HRest -w 1.0 -v 0.0001 -S trainlist hmm
●
The -w and -v options are to set floors for the
mixture probability and variances respectively.
The float used in -w represents a multiplier of
10^-5.
●
This can be iterated as many times as wanted to
achieve desired results.
Dictionary Creation
●
In order to create a recognition program or script,
we must first create a dictionary.
●
A dictionary in HTK gives the word and its
pronunciation. For our purposes, it will just
consist of our token HMM that we trained.
RUNNING run
WALKING walk
JUMPING [SKIPPING] jump
Word Tokens used to
Displayed output (if not
form the word
specified the word is displayed)
Label Files
●
Label files contain a transcription of what is
going on in the data sequence.
Start of frame End of frame Token found in
in samples in samples that time frame

000000 100000 walk

100001 200000 run
200001 300000 jump
Master Label Files (MLFs)
Same as a
original label file
• During training and “#!MLF!#”
“*/a.lab”
recognition, we may 000000 100000 walk
have many test files 100001 200000 run
200001 300000 jump
and their . If the entire file
accompanying label “*/b.lab” is one token, it
can be labeled
files. The label files run with just the
. token
can be condensed into “*/jump*.lab”
one file called a master jump
The wildcard operator can
label file, or MLF. . be used to label multiple
files at once
Pattern Recognition
●
The recognition of a motion sequence is done by
using HVite.
●
To receive a transcription of the recognition data
in MLF format, we use:
Throws away
Output transcription unnecessary data Text file containing
Create word network file in MLF format in the label files a list of HMM used
from given transcriptions

HVite –a –i results –o SWT –H hmmlist \

–I transcripts.mlf –S testfiles

MLF file that has the Motion capture data

test files’ transcriptions to be recognized
Model Analysis
●
The analysis of the recognition results is done by
HResults.
HResults -I transcripts.mlf -H hmmlist results

MLF containing the List of MLF containing

reference labels HMMs used result labels

●
Note: The reference labels and the results labels
must have different file extensions

2
100% (2)
2
5 pages
Criminal Law and Procedure 7th Edition Daniel E Hall Daniel E Hall ISBN10 1285448812 ISBN13 9781285448817
No ratings yet
Criminal Law and Procedure 7th Edition Daniel E Hall Daniel E Hall ISBN10 1285448812 ISBN13 9781285448817
344 pages
Physics: Answer Any Six Questions: Question No. 4 Is Compulsory: (6 X2 12)
No ratings yet
Physics: Answer Any Six Questions: Question No. 4 Is Compulsory: (6 X2 12)
3 pages
Research in Educ.
No ratings yet
Research in Educ.
7 pages
Merit List
No ratings yet
Merit List
64 pages
OPINION STRUCTURE For FAST
No ratings yet
OPINION STRUCTURE For FAST
3 pages
Cutting Up The Founder's Pie - Prof Demmler
No ratings yet
Cutting Up The Founder's Pie - Prof Demmler
4 pages
BC-Unit 6
No ratings yet
BC-Unit 6
27 pages
Ingrey 2016
No ratings yet
Ingrey 2016
3 pages
ISW2022 - Contracting and Feedback
No ratings yet
ISW2022 - Contracting and Feedback
31 pages
Executive - Legislature - Judiciary
No ratings yet
Executive - Legislature - Judiciary
85 pages
133 Thando Tshaka Presentation
No ratings yet
133 Thando Tshaka Presentation
8 pages
Statistical Speech Processing
No ratings yet
Statistical Speech Processing
51 pages
Engl 2010 Course Reflection
No ratings yet
Engl 2010 Course Reflection
5 pages
Employee Experience Trends
No ratings yet
Employee Experience Trends
83 pages
Lecture 2
No ratings yet
Lecture 2
21 pages
Anyone Who Speaks The Language Can Teac
No ratings yet
Anyone Who Speaks The Language Can Teac
4 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
5 pages
NLP Lecture 01-10-Hmm
No ratings yet
NLP Lecture 01-10-Hmm
9 pages
Celebrity
No ratings yet
Celebrity
6 pages
Presentation 20241212 094152 0000
No ratings yet
Presentation 20241212 094152 0000
8 pages
Cis262 HMM
No ratings yet
Cis262 HMM
50 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
David W Moore Opinion Makers
No ratings yet
David W Moore Opinion Makers
217 pages
QUESTIONNAIRE Short Math
No ratings yet
QUESTIONNAIRE Short Math
8 pages
Quizlet Ais ch05
No ratings yet
Quizlet Ais ch05
1 page
HMM Detailed
No ratings yet
HMM Detailed
41 pages
Binfo (HMM)
No ratings yet
Binfo (HMM)
16 pages
Eve Teasing
100% (1)
Eve Teasing
129 pages
HMM in BI
No ratings yet
HMM in BI
37 pages
Surgical Guidelines For Dental Implant Placement: British Dental Journal September 2006
No ratings yet
Surgical Guidelines For Dental Implant Placement: British Dental Journal September 2006
15 pages
An Introduction To Hidden Markov Models
No ratings yet
An Introduction To Hidden Markov Models
10 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
24 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
1.pdf B Aboutus Website 8
No ratings yet
1.pdf B Aboutus Website 8
40 pages
Physics For Teachers
No ratings yet
Physics For Teachers
2 pages
Shoulderexaminationppt 180505152418
No ratings yet
Shoulderexaminationppt 180505152418
19 pages
Applications of Hidden Markov Model Stat-1
No ratings yet
Applications of Hidden Markov Model Stat-1
8 pages
A History of Organized Labor in Panama and Central America
No ratings yet
A History of Organized Labor in Panama and Central America
321 pages
Mahee 718&saba 710
No ratings yet
Mahee 718&saba 710
4 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
Dat LM3940
No ratings yet
Dat LM3940
9 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
Maths - No Problem Addition and Subtraction, Ages 5-7 (Key Stage 1) (Maths - No Problem)
100% (2)
Maths - No Problem Addition and Subtraction, Ages 5-7 (Key Stage 1) (Maths - No Problem)
50 pages
Markov Models
No ratings yet
Markov Models
54 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
CPR Awareness Training
No ratings yet
CPR Awareness Training
5 pages
Plume Behaviour
100% (6)
Plume Behaviour
8 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Obie e Course Contents
No ratings yet
Obie e Course Contents
7 pages
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
9 Letters Text File
No ratings yet
9 Letters Text File
23 pages
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
HTK (v.3.1) : Basic Tutorial: Content
No ratings yet
HTK (v.3.1) : Basic Tutorial: Content
18 pages
Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model 1. Project Introduction
No ratings yet
Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model 1. Project Introduction
11 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
Applying-Hidden Markov Models To Bioinformatics
No ratings yet
Applying-Hidden Markov Models To Bioinformatics
28 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
09 - Hidden Markov Model
No ratings yet
09 - Hidden Markov Model
78 pages
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
1.1. An Example of A HMM For Protein Sequences: Output Prob
No ratings yet
1.1. An Example of A HMM For Protein Sequences: Output Prob
16 pages
Hidden Markov Models Applied To Information Extraction: Part I: Concept
No ratings yet
Hidden Markov Models Applied To Information Extraction: Part I: Concept
34 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
Htkbook
No ratings yet
Htkbook
354 pages
HTK Tutorial
No ratings yet
HTK Tutorial
36 pages
HTK Manual
100% (1)
HTK Manual
368 pages
Htkbook
No ratings yet
Htkbook
384 pages
Training Problem
No ratings yet
Training Problem
12 pages
Three Problems of Hidden Markov Models: 1) Scoring Problem
No ratings yet
Three Problems of Hidden Markov Models: 1) Scoring Problem
11 pages
9 HTK Tutorial
No ratings yet
9 HTK Tutorial
17 pages
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
No ratings yet
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
41 pages
Hidden Markov Models and Their Applications in Biological Sequence Analysis Byung-Jun Yoon
No ratings yet
Hidden Markov Models and Their Applications in Biological Sequence Analysis Byung-Jun Yoon
30 pages
Gesture Recognition and HMM: A Seminar Report
No ratings yet
Gesture Recognition and HMM: A Seminar Report
22 pages
Hidden Markov Model (HMM) Tutorial: Home Ciphers Cryptanalysis Hashes Resources
No ratings yet
Hidden Markov Model (HMM) Tutorial: Home Ciphers Cryptanalysis Hashes Resources
5 pages
HMM Er User Guide
No ratings yet
HMM Er User Guide
94 pages
HMM Presentation
No ratings yet
HMM Presentation
31 pages
Cis262 HMM
No ratings yet
Cis262 HMM
34 pages
Hidden Markov Models Theory and Applications
100% (1)
Hidden Markov Models Theory and Applications
326 pages
Optical Character Recognition Using Hidden Markov Models
100% (1)
Optical Character Recognition Using Hidden Markov Models
31 pages
Using HTK
No ratings yet
Using HTK
36 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
19 pages
HMMPY Doc
No ratings yet
HMMPY Doc
14 pages
Jahmm: An Implementation of Hidden Markov Models in Java
No ratings yet
Jahmm: An Implementation of Hidden Markov Models in Java
11 pages
CD-HMM For Normal Sinus Rhythm: V.K.Srivastava, Dr. Devendra Prasad
No ratings yet
CD-HMM For Normal Sinus Rhythm: V.K.Srivastava, Dr. Devendra Prasad
4 pages
Hidden Markov Models in Bioinformatics: Example Domain: Gene Finding
No ratings yet
Hidden Markov Models in Bioinformatics: Example Domain: Gene Finding
32 pages
Jahmm 0.6.1 Userguide
No ratings yet
Jahmm 0.6.1 Userguide
23 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
HMM-Mona Singh
No ratings yet
HMM-Mona Singh
11 pages

HMM Toolkit (HTK) : Presentation by Daniel Whiteley AME Department

Uploaded by

HMM Toolkit (HTK) : Presentation by Daniel Whiteley AME Department

Uploaded by

HMM Toolkit (HTK)

HQuant -C config -n 1 64 -S train.scp vqcook

000000 100000 walk

HVite –a –i results –o SWT –H hmmlist \

MLF file that has the Motion capture data

MLF containing the List of MLF containing

You might also like