0% found this document useful (0 votes)

33 views6 pages

Weather System

This document describes the process of preparing data, training, and testing for a weather system speech recognition project using Sphinx software. It involves recording speech data from multiple speakers saying city names, segmenting the recordings into individual files, and preparing various files needed by Sphinx including audio files, dictionaries, filler files, phone files, test/train file IDs and transcriptions, a corpus file, and a language model file generated from the corpus. Over 30 minutes of recording is required from each speaker to collect a large dataset for the speaker independent system.

Uploaded by

rida fatima

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views6 pages

Weather System

Uploaded by

rida fatima

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Weather System

Introduction
This document describes the preparation of the data, testing and training of Weather System. Weather system uses Sphinx software as a speech toolkit. This system is intended to be a speaker independent system. For this reason a large number of data has to be prepared. Weather system is a large vocabulary speech system. This system will have speech from number of speakers. It will deal with huge amount of data from multiple speakers.

Preparing Data
Define Dataset
The first step is to determine the size of the vocabulary of the system and then based on this the next step is the preparation of dataset for training and testing of system. In case of our system the dataset is the name of 36 cities of Punjab. The cities names are in isolated form. But we will concentrate our attention on 19 cities because their weather data is easily available on the internet. For speaker independent system a huge amount of data is required. For this purpose a large number of speakers are required for recording the cities name. All 36 cities name are written on 30 different lists in a random order. First 15 lists contain 19 districts and next 15 lists contain next 16 districts. So there are total 525 cities wave files are created by a single user. From which 285 wave files are presently processed. At least 30 minutes recording is required by a single user for this purpose.

Recorded Data
Recording is simultaneously carried out on mobile as well as on microphone. For this reason a laptop, mobile phone and microphone is needed. A recording software PRAAT has been used. Recording is being done on 8 kHz. A complete noiseless environment is required for recording because noise disrupts the recorded data and hence making it useless. Speech files should be stored in .wav extension.

Segmentation of Recorded Data

After recording data is saved as a wav files segmentation is needed because each wav file consist of a list and each list contain 16 to 19 cities. We have to segment the lists in such a way that each city file is segmented from each other. Most importantly before segmentation we have set the naming rules of segmented data. As this system is a city based and large number of speakers are required for recording. So I recommend to name the wav files in such a manner so that it can easily be understandable e.g.

Sp1_m_att1.wav Sp1_m_bah1.wav . . . Sp1 means speaker number 1, m means that the speaker gender is male and att mean District Attock and 1 means of list1.

Preparing Files
Now we need to prepare certain files for Sphinx. Sphinx is open source speech recognition software. Speech recognition system requires two types of models i.e. acoustic model and language model. Acoustic model is created form audio files and their text transcription. For language model we need training data. Therefore 10% of the data is placed in testing data and remaining 90% data is in the training data. Following are the files needed for sphinx

Audio files
Audio files are created after segmentation of the recorded data.

Dictionary File
The dictionary file is named as an4.dic. The dictionary file contains the utterances of the phones mappings. In dictionary file we have to place the words defined in the dataset. ATTAK A TT A K BAHAAVALPUR B A H AA V A L P U R B_HAKAR T_SHAKVAAL . . . Each word is separated from other word by a tab. First half consists of the phones defined in the dataset and the second half consists of the phonemes separated by spaces. B_H A K A R T_SH A K V AA L

Filler File
The filler file is named as an4.filler. This file contains the silence which is incorporated in our speech.

The silence at start of any utterance is <s>. The silence at end of any utterance is </s> and the silence in the middle of any utterance is <sil> <s> </s> <sil> SIL SIL SIL

Above three words are also separated from each other by a tab.

Phone File
The phone file is named as an4.phone. In the phone file all the phonemes of all the words defined in a data set are listed in such a manner that there is no repetition. A TT K B H AA V L P U R B_H T_SH F . . .

Test File IDs

The test file ids file is named as an4_test.fileids. This file contains the fileids of the files in the test section of your data e.g. test/sp1_f_att1 test/sp1_f_bah1 test/sp1_f_bha1 test/sp1_f_cha1 test/sp1_f_fsl1 test/sp1_f_gujr1 test/sp1_f_guj1 . . . Wave file format have been described earlier. test/ shows that these wave files are present in the test folder

Test Transcription File

The test transcription file is named as an4_test.transcription. This file contains the transcription of the wave files present in the test.fileids file. <s> ATTAK </s> (sp1_f_att1) <s> BAHAAVALPUR </s> (sp1_f_bah1) <s> B_HAKAR </s> (sp1_f_bha1) <s> T_SHAKVAAL </s> (sp1_f_cha1) . . . As sp1_f_att1 contains ATTAK thats why between starting and ending silences ATTAK is placed.

Train File IDs

The train fileids file is named as an4_train.fileids. This file contains the file ids of the files in the train section of your data e.g. train/sp1_f_att2 train/sp1_f_att3 train/sp1_f_att4 train/sp1_f_att5 train/sp1_f_att6 . . .

Train Transcription File

The train transcription file is named as an4_train.transcription. This file contains the transcription of the wave file present in the train.fileids file. <s> ATTAK </s> (sp1_f_att2) <s> ATTAK </s> (sp1_f_att3) <s> ATTAK </s> (sp1_f_att4) <s> ATTAK </s> (sp1_f_att5) . . .

Corpus
Corpus is the file which is made from the an4_train.transcription file by removing the wave file names present at the end of each line as shown below. <s> ATTAK </s> <s> ATTAK </s> <s> ATTAK </s> <s> ATTAK </s>

<s> ATTAK </s> . . . This can be done in PSPad software. 1. 2. 3. 4. 5. Open the an4_train.transcription file is PSPad software Type Crtl+H, then click the regular expression In find tab write $S.*$ Then click OK Rename the file as Corpus.txt

Language Model
This file is named as an4.lm. Language model is created from Corpus.txt. Language model is created by the following way 1. Download the CMU toolkit from internet 2. Run the following commands on your terminal

a. ./text2wfreq <Corpus.txt> a.wfreq b. ./wfreq2vocab <a.wfreq> a.vocab c. ./text2idngram -n 3 -vocab a.vocab <Corpus.txt> a.idngram d. ./idngram2lm -n 3 -vocab_type 2 -witten_bell -oov_fraction 0.5 -idngram a.idngram -vocab a.vocab -context training.ccs -arpa LanguageModel.arpa
3. A file name LanguageModel.arpa will be created 4. Copy the languageModel.arpa to lm3g2dmp folder and run the following command

5. lm3g2dmp LanguageModel.arpa .\
6. A LanguageModel.arpa.DMP will be created

ASR Building Using Sphinx
100% (2)
ASR Building Using Sphinx
36 pages
Performance Analysis of Different Acoustic Features Based On LSTM For Bangla Speech Recognition
No ratings yet
Performance Analysis of Different Acoustic Features Based On LSTM For Bangla Speech Recognition
9 pages
Viva Speech
100% (1)
Viva Speech
4 pages
Plant QRQC Form
100% (5)
Plant QRQC Form
1 page
Huruf Fonetis
No ratings yet
Huruf Fonetis
298 pages
Tanaman Hias
No ratings yet
Tanaman Hias
8 pages
Speech Recognition1
100% (1)
Speech Recognition1
39 pages
Assignment 4
No ratings yet
Assignment 4
13 pages
Schneider Ecofit - Low and Medium Voltage Distribution Switchboards FPX
No ratings yet
Schneider Ecofit - Low and Medium Voltage Distribution Switchboards FPX
150 pages
Tutorial On Speech Recognition: Alex Acero Microsoft Research
No ratings yet
Tutorial On Speech Recognition: Alex Acero Microsoft Research
38 pages
FST Explained
No ratings yet
FST Explained
7 pages
Speech Recognition Application
No ratings yet
Speech Recognition Application
13 pages
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
No ratings yet
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
14 pages
Redaction HTK Amazigh Speech
No ratings yet
Redaction HTK Amazigh Speech
15 pages
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
No ratings yet
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
13 pages
CCS369 - TSS-Unit 5
No ratings yet
CCS369 - TSS-Unit 5
23 pages
Forced Alignment and Speech Recognition Systems
No ratings yet
Forced Alignment and Speech Recognition Systems
32 pages
Phonetically Balanced Code-Mixed Speech Corpus For Hindi-English Automatic Speech Recognition
No ratings yet
Phonetically Balanced Code-Mixed Speech Corpus For Hindi-English Automatic Speech Recognition
5 pages
Some Rules of Gerunds and Infinitives PDF
100% (1)
Some Rules of Gerunds and Infinitives PDF
1 page
White Paper - Demystifying Speech Recognition by Charles Corfield - July2012
No ratings yet
White Paper - Demystifying Speech Recognition by Charles Corfield - July2012
5 pages
Recall What Are Sound Features? Feature Detection and Extraction Features in Sphinx III
No ratings yet
Recall What Are Sound Features? Feature Detection and Extraction Features in Sphinx III
11 pages
NLP CH 2
No ratings yet
NLP CH 2
59 pages
Week 4 Day 2 Science
No ratings yet
Week 4 Day 2 Science
3 pages
Feature Extraction Using PCA
No ratings yet
Feature Extraction Using PCA
36 pages
ASR - Thesis Report PDF
No ratings yet
ASR - Thesis Report PDF
42 pages
NLP-Lectures 4,5,6
No ratings yet
NLP-Lectures 4,5,6
85 pages
Write: Get Unlimited Access To The Best of Medium For Less Than $1/week
No ratings yet
Write: Get Unlimited Access To The Best of Medium For Less Than $1/week
19 pages
Praat Prosodic Feature Extraction Tool
No ratings yet
Praat Prosodic Feature Extraction Tool
13 pages
Implementation of Marathi Language Speech Databases For Large Dictionary
No ratings yet
Implementation of Marathi Language Speech Databases For Large Dictionary
6 pages
An Speech Collector
No ratings yet
An Speech Collector
4 pages
Sharika Malayalam Speech Recognition System: Shyam.k MES College of Engineering, Kuttipuram
No ratings yet
Sharika Malayalam Speech Recognition System: Shyam.k MES College of Engineering, Kuttipuram
4 pages
2010 - O-COCOSDA - An ASR System For Spontaneous Urdu Speech
No ratings yet
2010 - O-COCOSDA - An ASR System For Spontaneous Urdu Speech
6 pages
Md-070 Application Extensions Technical Design
100% (1)
Md-070 Application Extensions Technical Design
16 pages
Lecture 4
No ratings yet
Lecture 4
87 pages
Low-Pass Filtering of Speech Signals Project: Tasks
No ratings yet
Low-Pass Filtering of Speech Signals Project: Tasks
2 pages
Experimental Phonetics Handout 1
No ratings yet
Experimental Phonetics Handout 1
31 pages
Exim Bank Claim Form
No ratings yet
Exim Bank Claim Form
9 pages
Lecture5 Ngrams
No ratings yet
Lecture5 Ngrams
40 pages
Implementation of Speech Synthesis System Using Neural Networks
No ratings yet
Implementation of Speech Synthesis System Using Neural Networks
4 pages
IIIT-H Indic Speech Databases
No ratings yet
IIIT-H Indic Speech Databases
4 pages
Bangla Text To Speech Using Festival: Firoj Alam S.M. Murtoza Habib Mumit Khan
No ratings yet
Bangla Text To Speech Using Festival: Firoj Alam S.M. Murtoza Habib Mumit Khan
8 pages
Unique Aspects of Accounting - Non-Profit and Healthcare Organizations
No ratings yet
Unique Aspects of Accounting - Non-Profit and Healthcare Organizations
28 pages
6.chapter6 LanguageModel
No ratings yet
6.chapter6 LanguageModel
33 pages
Amit Yadav Project
No ratings yet
Amit Yadav Project
49 pages
Speech Recognition UTHM
No ratings yet
Speech Recognition UTHM
30 pages
Lecture 9 - Speech Recognition
No ratings yet
Lecture 9 - Speech Recognition
65 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
34 pages
13 Ngramlm
No ratings yet
13 Ngramlm
27 pages
Term Paper ECE-300 Topic: - Speech Recognition
No ratings yet
Term Paper ECE-300 Topic: - Speech Recognition
14 pages
Text To Speech System For Punjabi Using Festival Framework
No ratings yet
Text To Speech System For Punjabi Using Festival Framework
5 pages
SPR 05 NLTK
No ratings yet
SPR 05 NLTK
18 pages
5 PDF
No ratings yet
5 PDF
1 page
HRM360 Assignment
No ratings yet
HRM360 Assignment
10 pages
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
No ratings yet
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
7 pages
Chapter 4 Flexural Design - (Part 3)
No ratings yet
Chapter 4 Flexural Design - (Part 3)
37 pages
Type VR Vacuum Circuit Breaker Interruptor Automático Al Vacío Tipo VR Disjoncteur Sous Vide Type VR
No ratings yet
Type VR Vacuum Circuit Breaker Interruptor Automático Al Vacío Tipo VR Disjoncteur Sous Vide Type VR
113 pages
Capacity Planning For Products and Services
No ratings yet
Capacity Planning For Products and Services
31 pages
Unit 5 (Automatic Speech Recognition)
No ratings yet
Unit 5 (Automatic Speech Recognition)
13 pages
Analysis of Statistical Parsing in Natural Language Processing
No ratings yet
Analysis of Statistical Parsing in Natural Language Processing
6 pages
Group 4 Travel Device
No ratings yet
Group 4 Travel Device
8 pages
Science
No ratings yet
Science
5 pages
Speech Recognition System: Surabhi Bansal Ruchi Bahety
No ratings yet
Speech Recognition System: Surabhi Bansal Ruchi Bahety
5 pages
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
No ratings yet
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
6 pages
CHE 430 FA21 - HW#4 Due Sept 24
No ratings yet
CHE 430 FA21 - HW#4 Due Sept 24
3 pages
The High Line Hates Artists
No ratings yet
The High Line Hates Artists
4 pages
Bhaashika: Telugu Tts System: Dr. K.V.N.Sunitha
No ratings yet
Bhaashika: Telugu Tts System: Dr. K.V.N.Sunitha
9 pages
9709 s10 QP 32
No ratings yet
9709 s10 QP 32
4 pages
Lecture 15 - Summing Up of Part-1 (Policy) & Introduction To Housing Planning
No ratings yet
Lecture 15 - Summing Up of Part-1 (Policy) & Introduction To Housing Planning
17 pages
CCS369 - TSS-Unit 4
No ratings yet
CCS369 - TSS-Unit 4
30 pages
DS-M5504HM-T Series Mobile DVR: Main Features
No ratings yet
DS-M5504HM-T Series Mobile DVR: Main Features
4 pages
Text To Speech Synthesis TTS
No ratings yet
Text To Speech Synthesis TTS
7 pages
Invoice Nntl-Msa003
No ratings yet
Invoice Nntl-Msa003
1 page
14 Concrete Structures Cast in Situ - Colour
No ratings yet
14 Concrete Structures Cast in Situ - Colour
233 pages
IJISRT18DC138
No ratings yet
IJISRT18DC138
6 pages
Cat Global Catalog Loctite
100% (1)
Cat Global Catalog Loctite
47 pages
School Memorandum With Number
No ratings yet
School Memorandum With Number
29 pages
Reconocimiento de Voz - MATLAB
No ratings yet
Reconocimiento de Voz - MATLAB
5 pages
NLP Project Reportttt
No ratings yet
NLP Project Reportttt
9 pages
Speech Recognition Architecture
No ratings yet
Speech Recognition Architecture
13 pages
Grade 2, Module 2 (45 Pages)
No ratings yet
Grade 2, Module 2 (45 Pages)
45 pages
End Sem Answer Key 2023
No ratings yet
End Sem Answer Key 2023
4 pages
Dispatch & Store
No ratings yet
Dispatch & Store
1 page
Why Law Students Should Study The Course On Environmental Studies and The Law 2
No ratings yet
Why Law Students Should Study The Course On Environmental Studies and The Law 2
5 pages
GR 3 IsiZulu Tracker 2020 Term 3 WEB
No ratings yet
GR 3 IsiZulu Tracker 2020 Term 3 WEB
50 pages
GTU Big Data Analysis Question Paper Summer 2022
No ratings yet
GTU Big Data Analysis Question Paper Summer 2022
1 page
How to Create and Manage Mp3 Songs
From Everand
How to Create and Manage Mp3 Songs
Jeff Palmer
No ratings yet
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Best Free Open Source Data Recovery Apps for Mac OS English Edition
From Everand
Best Free Open Source Data Recovery Apps for Mac OS English Edition
Cyber Jannah Sakura
No ratings yet

Weather System

Uploaded by

Weather System

Uploaded by

Weather System

Segmentation of Recorded Data

Test File IDs

Test Transcription File

Train File IDs

Train Transcription File

You might also like