0% found this document useful (0 votes)

59 views5 pages

Nadhia Iffah Saraswati - 7211040019

1. The document describes tasks to analyze speech signals using Mell-Cepstrum processing and compare results to DFT processing. 2. It provides code in Matlab to implement Mell-Cepstrum processing on a speech signal including framing, windowing, Mell filter bank analysis, DCT, and liftering. 3. Plots of the speech waveform, filterbank energies, and Mell cepstrum are generated and compared to results from DFT processing on a section of the speech.

Uploaded by

Nadhia Iffah Saraswati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views5 pages

Nadhia Iffah Saraswati - 7211040019

Uploaded by

Nadhia Iffah Saraswati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Nadhia Iffah Saraswati | 7211040019

Tugas:
1. Dapatkan suatu model pembentukan Mell-Cepstrum untuk pengolahan sinyal wicara
2. Buat suatu program Matlab untuk pengolahan Mell-Cepstrum
3. Bandingkan hasilnya dengan proses pengolahan mengunakan DFT
Jawaban:
1. Sinyal kontinyu Pre emphasize Frame blocking Frame Windowing Fast Fourrier
Transform (FFT) Spectrums Mell Frequency Warping Mell Spectrums Discrete Cosine
Transform Mell Cepstrums Cepstral Liftering Library
2.
fs=16000;
plot(f,x);
[s,fs]=wavread('a.wav');
xlabel('frequency(Hz)')
figure(1)
ylabel('magnitude(dB)')
subplot(211)
plot(s)
%Mell Filter Bank
xlabel('Sample')
K = nfft/2+1;
ylabel('Magnitude')
M = 23;
hz2mel = @(hz)(1127*log(1+hz/700));
%frame
mel2hz = @(mel)(700*exp(mel/1127)-700);
frame_1=0.02*fs;
xframe=s(9*frame_1:10*frame_1);
[ H1, fs ] = trifbank( M, K, [0 fs], fs, hz2mel,
subplot(212)
mel2hz );
plot(xframe)
figure(4)
xlabel('Sample')
plot( fs, H1 );
ylabel('Magnitude')
xlabel( 'Frequency (Hz)' );
ylabel( 'Weight' );
%windowing = hamming
FBE = H1 * MAG(1:K,:); % FBE( FBE<1.0 ) = 1.0;
win=hamming(length(frame_1));
% apply mel floor
y_1frame_window=xframe.*win;
figure(5)
figure(2)
plot(abs(FBE))
freqz(y_1frame_window)
dctm = @( N, M )( sqrt(2.0/M) * cos(
%proses FT
repmat([0:N-1].',1,M) ...
Y=fft(y_1frame_window);
.* repmat(pi*([1:M]-0.5)/M,N,1) ) );
hz8000=8000*length(Y)/fs;
N=16;
f=(1:hz8000)*fs/length(Y);
DCT_ = dctm(N,M);
x=20*log10(abs(Y(1:length(f)))+eps);
CC= DCT_ * log(FBE);
nfft = 2^8;
ceplifter = @( N, L )( 1+0.5*L*sin(pi*[0:N-1]/L)
MAG = abs(fft(y_1frame_window,nfft,1));
);
figure(3)
L = 16; %liftering parameter

Nadhia Iffah Saraswati | 7211040019

lifter = ceplifter( N, L );
CCfinal = diag( lifter ) * CC;
figure(6)
plot(CCfinal)

0.9

0.8

0.7

0.6

Weight

1
waveform
0.5

0.5

Magnitude

0.4

0
0.3

-0.5
0.2

-1
0

500

1000

1500

2000

2500

3000

0.1

Sample
0

0.2

500

1000

1500

2000

2500

3000

3500

4000

Frequency (Hz)

waveform

Magnitude

0.1

-0.1

9
-0.2
0

100

120

140

160

180

Sample

7
20

Magnitude (dB)

6
0

4
-20

3
-40
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Normalized Frequency ( rad/sample)

1
2000

0
0

-2000

-4000

-6000
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Normalized Frequency ( rad/sample)

20
Spectrum

magnitude(dB)

Phase (degrees)

-10

-20

-30

clear all; close all; clc;

Tw = 20;
Ts = 10;
alpha = 0.97;
M = 20;
C = 12;
L = 22;
LF = 200;
HF = 3000;
wav_file = 'a.wav';
[ speech, fs, nbits ] = wavread( wav_file );

-40
0

1000

2000

3000

4000

frequency(Hz)

5000

6000

7000

8000

[ MFCCs, FBEs, frames ] = ...

mfcc( speech, fs, Tw, Ts, alpha,
@hamming, [LF HF], M, C+1, L );

Nadhia Iffah Saraswati | 7211040019

% Generate data needed for plotting

[ Nw, NF ] = size( frames );
% frame
length and number of frames
time_frames = [0:NF-1]*Ts*0.001+0.5*Nw/fs; %
time vector (s) for frames
time = [ 0:length(speech)-1 ]/fs;
logFBEs =
20*log10( FBEs );
logFBEs_floor = max(logFBEs(:))-50;
logFBEs( logFBEs<logFBEs_floor ) = logFBEs_floor;

subplot( 313 );
imagesc( time_frames, [1:C], MFCCs(2:end,:) );
axis( 'xy' );
xlim( [ min(time_frames) max(time_frames) ] );
xlabel( 'Time (s)' );
ylabel( 'Cepstrum index' );
title( 'Mel frequency cepstrum' );

% Generate plots
figure('Position', [30 30 800 600],
'PaperPositionMode', 'auto', ...
'color', 'w', 'PaperOrientation', 'landscape',
'Visible', 'on' );
subplot( 311 );
plot( time, speech, 'k' );
xlim( [ min(time_frames) max(time_frames) ] );
xlabel( 'Time (s)' );
ylabel( 'Amplitude' );
title( 'Speech waveform');

% Print figure to pdf and png files

print('-dpdf', sprintf('%s.pdf', mfilename));
print('-dpng', sprintf('%s.png', mfilename));

% Set color map to grayscale

colormap( 1-colormap('gray') );

Speech waveform
1

Amplitude

0.5
0
-0.5
-1
0.05

0.1

0.15

0.2

0.25

0.3

0.2

0.25

0.3

0.2

0.25

0.3

Time (s)
Log (mel) filterbank energies
20

Channel index

15
10
5

0.05

0.1

0.15
Time (s)
Mel frequency cepstrum

12
10

Cepstrum index

subplot( 312 );
imagesc( time_frames, [1:M], logFBEs );
axis( 'xy' );
xlim( [ min(time_frames) max(time_frames) ] );
xlabel( 'Time (s)' );
ylabel( 'Channel index' );
title( 'Log (mel) filterbank energies');

8
6
4
2
0.05

0.1

0.15
Time (s)

Nadhia Iffah Saraswati | 7211040019

%get a section vowel

%plot waveform
figure(1)
subplot(211)
plot(x)
legend('Waveform');
ylabel('Magnitude')
xlabel('Sample')

q=(ms1:ms20)/fs;
plot(q,abs(C(ms1:ms20)));
legend('Cepstrum');
xlabel('Quefrency(s)')
ylabel('Amplitude')
[c,fx]=max(abs(C(ms1:ms20)));
fprintf('Fx=%g Hz\n',fs/(ms2+fx-1));
1
Waveform
0.5

Magnitude

3.
clear all;
fs=16000;
x=wavread('a.wav');

-0.5

-1
0

%plot waveform 1 frame

frame_i=0.02*fs;
xframe=x(5*frame_i:6*frame_i);
subplot(212)
plot(xframe)

500

1000

1500

2000

2500

3000

Sample
0.2

0.1

-0.1

-0.2
0

%Cepstrum is DFT log spectrum

figure(4)
ms1=fs/1000;
ms20=fs/50;
C=fft(log(abs(Y)+eps));

100

150

200

250

300

350

Magnitude (dB)

-20

-40

-60
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.8

0.9

Normalized Frequency ( rad/sample)

x 10
2

Phase (degrees)

%windowing
win=hamming(length(frame_i));
y_1frame_window=xframe.*win;
figure(2)
freqz(y_1frame_window)
%do Fourier
Transform
figure(3)
Y=fft(y_1frame_window);
hz8000=8000*length(Y)/fs;
f=(0:hz8000)*fs/length(Y);
plot(f,20*log10(abs(Y(1:length(f)))+eps));
legend('Spectrum');
xlabel('Frequency(Hz)')
ylabel('Magnitude(dB)')

-2

-4
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Normalized Frequency ( rad/sample)

Nadhia Iffah Saraswati | 7211040019

20
Spectrum
10

Magnitude(dB)

-10

-20

-30

-40

-50
0

1000

2000

3000

4000

5000

6000

7000

8000

Frequency(Hz)

150
Cepstrum

Amplitude

100

0
0

0.002

0.004

0.006

0.008

0.01

0.012

Quefrency(s)

0.014

0.016

0.018

0.02

Speech Processing Lab Manual
No ratings yet
Speech Processing Lab Manual
23 pages
MFCC
100% (2)
MFCC
6 pages
Signals and Systems Lab - Assignment2
No ratings yet
Signals and Systems Lab - Assignment2
12 pages
Voice Recognition PDF
No ratings yet
Voice Recognition PDF
37 pages
Homework 1
No ratings yet
Homework 1
3 pages
An Automatic Speaker Recognition System
100% (1)
An Automatic Speaker Recognition System
11 pages
Listing Code Voice Recognition
No ratings yet
Listing Code Voice Recognition
11 pages
Ee338: Digital Signal Processing Computing Assignment # 2
No ratings yet
Ee338: Digital Signal Processing Computing Assignment # 2
42 pages
Fundamental Frequency Estimation - Frequency Domain
No ratings yet
Fundamental Frequency Estimation - Frequency Domain
5 pages
DSP Da-02 23bec0056 Yash Mehta
No ratings yet
DSP Da-02 23bec0056 Yash Mehta
14 pages
MFCC Code
No ratings yet
MFCC Code
8 pages
Sns Lab 7 19-Ee-0
No ratings yet
Sns Lab 7 19-Ee-0
12 pages
Scribd
No ratings yet
Scribd
9 pages
For End For End: "Sp01.wav"
No ratings yet
For End For End: "Sp01.wav"
2 pages
Scribd
No ratings yet
Scribd
9 pages
Signals Proj 170630
No ratings yet
Signals Proj 170630
8 pages
Team5 Final
No ratings yet
Team5 Final
24 pages
PSC
No ratings yet
PSC
4 pages
This Is Used For Recording Audio Signal Via Mi-Crophone: %NAME: KOY Brosoeu %Group:I4-EA %ID: E20130325
No ratings yet
This Is Used For Recording Audio Signal Via Mi-Crophone: %NAME: KOY Brosoeu %Group:I4-EA %ID: E20130325
5 pages
Assignment 1 Rajveer Saini: Question 2 Code
No ratings yet
Assignment 1 Rajveer Saini: Question 2 Code
3 pages
DSP Da-01
No ratings yet
DSP Da-01
14 pages
5707 Assign1
No ratings yet
5707 Assign1
9 pages
PCS Programs
No ratings yet
PCS Programs
9 pages
DSP Lab 2
No ratings yet
DSP Lab 2
6 pages
Acoustic Feature Analysis For ASR: Instructor: Preethi Jyothi
No ratings yet
Acoustic Feature Analysis For ASR: Instructor: Preethi Jyothi
34 pages
FROMTXTTIMESERIESTOWAVEFILESANDSPECTROGRAMEXTRACTION SEISMIC JupyterNotebook
No ratings yet
FROMTXTTIMESERIESTOWAVEFILESANDSPECTROGRAMEXTRACTION SEISMIC JupyterNotebook
29 pages
Analysisof Speech Signal 29 TH October 2018
No ratings yet
Analysisof Speech Signal 29 TH October 2018
16 pages
Discrete Representation of Signal
No ratings yet
Discrete Representation of Signal
34 pages
Blok Diagram Pitch Correction
No ratings yet
Blok Diagram Pitch Correction
37 pages
% A Program To Draw A Cepstrum of Speech Segment From The Speech Utterance.%
No ratings yet
% A Program To Draw A Cepstrum of Speech Segment From The Speech Utterance.%
3 pages
Speech Feature Extraction
No ratings yet
Speech Feature Extraction
9 pages
Ab Star Action
No ratings yet
Ab Star Action
7 pages
DSP Lab4
No ratings yet
DSP Lab4
6 pages
Exp1 Merged
No ratings yet
Exp1 Merged
11 pages
MSC Data Science - 02 PDF
No ratings yet
MSC Data Science - 02 PDF
37 pages
Implementing Speaker Recognition: Chase Zhou Physics 406 - 11 May 2015
No ratings yet
Implementing Speaker Recognition: Chase Zhou Physics 406 - 11 May 2015
10 pages
Scribd
No ratings yet
Scribd
10 pages
Developing A MATLAB Code For Fundamental Frequency and Pitch Estimation From Audio Signal
No ratings yet
Developing A MATLAB Code For Fundamental Frequency and Pitch Estimation From Audio Signal
16 pages
Ita Posgrad EA 268 Lab-1
No ratings yet
Ita Posgrad EA 268 Lab-1
4 pages
7.0 Speech Signals and Front-End Processing: References: 1. 3.3, 3.4 of Becchetti
No ratings yet
7.0 Speech Signals and Front-End Processing: References: 1. 3.3, 3.4 of Becchetti
50 pages
Sns pbl2
No ratings yet
Sns pbl2
24 pages
ASP Lab Report
No ratings yet
ASP Lab Report
8 pages
Mel Frequency Cepstral Coefficient (MFCC) - Guidebook - Informatica e Ingegneria Online
No ratings yet
Mel Frequency Cepstral Coefficient (MFCC) - Guidebook - Informatica e Ingegneria Online
12 pages
Aryan Raj ASP Aat
No ratings yet
Aryan Raj ASP Aat
9 pages
DSP File
No ratings yet
DSP File
3 pages
ECE471 Lab#3 Due: 3/27/2015 Voice Recording and FFT (20points)
No ratings yet
ECE471 Lab#3 Due: 3/27/2015 Voice Recording and FFT (20points)
1 page
Experiment No. 3: The Fourier Transform - An Audio Signal Is Comprised of Several Single-Frequency Sound
No ratings yet
Experiment No. 3: The Fourier Transform - An Audio Signal Is Comprised of Several Single-Frequency Sound
7 pages
13MFCC Tutorial
No ratings yet
13MFCC Tutorial
6 pages
Audio and Speech Processing - Prof - Muralikrishna H
No ratings yet
Audio and Speech Processing - Prof - Muralikrishna H
28 pages
SNP201 Mini Project
No ratings yet
SNP201 Mini Project
7 pages
MFCC PDF
No ratings yet
MFCC PDF
14 pages
DSP Lab 5
No ratings yet
DSP Lab 5
7 pages
DSP Project 2
No ratings yet
DSP Project 2
10 pages
DSP Lab Mini Project
No ratings yet
DSP Lab Mini Project
7 pages
Biometrics Lecture Speech
No ratings yet
Biometrics Lecture Speech
38 pages
DSP Lab 01-1
No ratings yet
DSP Lab 01-1
4 pages
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
No ratings yet
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
30 pages
A First Course in Wavelets with Fourier Analysis
From Everand
A First Course in Wavelets with Fourier Analysis
Albert Boggess
3.5/5 (2)
Statistics for Spatio-Temporal Data
From Everand
Statistics for Spatio-Temporal Data
Noel Cressie
No ratings yet
Some Case Studies on Signal, Audio and Image Processing Using Matlab
From Everand
Some Case Studies on Signal, Audio and Image Processing Using Matlab
Dr. Hedaya Mahmood Alasooly
No ratings yet