Squad Voice Measurement Description: Manual
Squad Voice Measurement Description: Manual
Measurement
Description
Manual
August 2009
SwissQual® License AG
Allmendweg 8
CH-4528 Zuchwil
Switzerland
Internet: http://www.swissqual.com
Office: +41 32 686 65 65
Fax: +41 32 686 65 66
No part of this publication may be copied, distributed, transmitted, transcribed, stored in a retrieval system,
or translated into any human or computer language without the prior written permission of SwissQual AG.
SwissQual has made every effort to ensure that eventual instructions contained in the document are
adequate and free of errors and omissions. SwissQual will, if necessary, explain issues which may not be
covered by the documents. SwissQual’s liability for any errors in the documents is limited to the correction of
errors and the aforementioned advisory services.
When you refer to a SwissQual technology or product, you must acknowledge the respective text or logo
trademark somewhere in your text.
SwissQual®, Seven.Five®, SQuad®, QualiPoc®, NetQual®, VQuad® as well as the following logos are
registered trademarks of SwissQual AG.
SwissQual acknowledges the following trademarks for company names and products:
Adobe®, Adobe Acrobat®, and Adobe Postscript® are trademarks of Adobe Systems Incorporated.
Intel®, Intel Itanium®, Intel Pentium®, and Intel Xeon™ are trademarks or registered trademarks of Intel
Corporation.
Microsoft®, Microsoft Windows®, Microsoft Windows NT®, and Windows Vista® are either registered
trademarks or trademarks of Microsoft Corporation in the United States and/or other countries U.S.
Contents
1 About this Guide................................................................................................................ 1
Introduction .......................................................................................................................... 1
2 SQuad Listening Quality................................................................................................... 2
Introduction .......................................................................................................................... 2
Speech Quality Definition .................................................................................................... 2
SQuad Method..................................................................................................................... 2
MOS Rating ......................................................................................................................... 3
Speech and Noise Level – Received Signal ....................................................................... 4
Channel Gain....................................................................................................................... 4
Clipping................................................................................................................................ 5
DC-Offset............................................................................................................................. 5
Frequency-Shift ................................................................................................................... 5
Delay Spread (Voice Jitter)..................................................................................................6
Speech Threshold................................................................................................................ 6
Degradations ....................................................................................................................... 7
AGC Problems ................................................................................................................ 7
Speech Enhancer / Noise Suppressors .......................................................................... 8
Impulsive Noise ............................................................................................................... 8
Background Noise ........................................................................................................... 8
Interruptions .................................................................................................................... 9
VAD resp. Silence Suppression Problems.................................................................... 10
Variable Delay (Voice Jitter).......................................................................................... 10
Delays Deviation ...........................................................................................................10
Frequency Shifts ........................................................................................................... 12
Quality Code..................................................................................................................12
Option: P.862 'PESQ' ........................................................................................................13
3 SQuad Noise Suppression .............................................................................................15
Introduction ........................................................................................................................ 15
Listening Quality ................................................................................................................ 15
NS-Speech Power Classes ...............................................................................................17
SNRI, Signal-to-Noise Ratio Improvement........................................................................ 18
NPLR, Noise Power Level Reduction................................................................................18
SPLR, Signal Power Level Reduction ...............................................................................19
Overall NS Quality .............................................................................................................20
Quality Index...................................................................................................................... 21
Convergence Time ............................................................................................................ 22
Noise Reduction/Suppression Test ...................................................................................24
ii | Contents
SQuad Voice Measurement Description Manual
© 2000 - 2009 SwissQual AG
Examples ........................................................................................................................... 25
Evaluation of the transmitted signal...................................................................................25
Evaluation of the transmitted signal...................................................................................26
4 DTMF Tests ...................................................................................................................... 27
Introduction ........................................................................................................................ 27
DTMF-Test Overview ........................................................................................................ 27
Criterions ........................................................................................................................... 28
Results............................................................................................................................... 28
5 SQuad Advanced Echo Check (Passive Test).............................................................. 31
Introduction ........................................................................................................................ 31
Echo Measurement ...........................................................................................................31
Measurement Results........................................................................................................31
6 SQuad Advanced Echo Check (Active Test) ................................................................ 35
Introduction ........................................................................................................................ 35
Echo Measurement ...........................................................................................................35
Measurement Results – Echo Evaluation ......................................................................... 35
Measurement Results – Listening Quality.........................................................................37
7 Round Trip........................................................................................................................ 38
Introduction ........................................................................................................................ 38
The Round Trip Method..................................................................................................... 38
Results............................................................................................................................... 38
References ........................................................................................................................ 38
A Appendix .......................................................................................................................... 39
Abbreviations ..................................................................................................................... 39
Figures
Figure 2-1 Block Diagram of the SQuad........................................................................................... 3
Figure 2-2 Main outcomes of SQuad-LQ ......................................................................................... 3
Figure 2-3 Typical MOS-LQ values for Different Codecs ................................................................. 4
Figure 2-4 Frequency Shift ............................................................................................................... 6
Figure 2-5 Histogram of Noised Speech Sample ............................................................................. 7
Figure 2-6 Level Chart with AGC...................................................................................................... 8
Figure 2-7 Similarity Chart with Impulsive Noise.............................................................................. 8
Figure 2-8 Background Noise........................................................................................................... 9
Figure 2-9 Level Chart with Handover.............................................................................................. 9
Figure 2-10 Time Clipping .............................................................................................................. 10
Figure 2-11 Variable Delay, Voice Jitter ......................................................................................... 10
Figure 2-12. Example for Variable Delay, which shows that Block B is delayed for –244 samples
to the left (arrives earlier when compared with the same reference block). Block B arrives later by
244 samples. .................................................................................................................................. 11
Figure 2-13 Frequency Shift ........................................................................................................... 12
Contents | iii
SQuad Voice Measurement Description Manual
© 2000 - 2009 SwissQual AG
iv | Contents
SQuad Voice Measurement Description Manual
© 2000 - 2009 SwissQual AG
SQuad Method
SQuad consists of three main parts. First, a pre-processing unit adjusts reference and coded
sample. Then, an auditory model is used to reduce both samples to their perceptually relevant
features. Finally, an assessment unit evaluates the perceptual difference between reference and
coded sample and outputs the result as a MOS value.
A speech sample is transmitted over a line with generally unknown combination of speech coders.
This speech sample is available in digital form. The sampling frequency is 8 kHz and the digital
quantization is 16 bits. As an initial step, the source speech signal is read into the vector x(i) and
the coded speech signal into the vector y(i). These speech signals are synchronized with respect
to both time and level. The DC offset must be removed from every sample. In addition, the signals
are normalized to a common RMS (Root Mean Square) level, to ensure that the constant
amplification factor is not taken into account.
The signals are split into processing units of 32 ms duration, also called Frames. The unit overlap
is 50%. During the first processing step, the frame is multiplied by a hamming window. The
source signal x(t) in the time domain is now transformed to the frequency domain using a discrete
Fourier transform, followed by computation of the squared magnitude FFT spectrum. Both signals
are filtered using a filter equivalent to the receiving curve of the corresponding telephone handset.
A rough approximation of the time masking is already achieved through the frame overlapping
during the signal pre-processing. The comparison method of SQuad is based on the following
principle; Signal parts with high energy are more important for the perceived speech quality. A
similarity coefficient for reference and impaired signal is computed for 4 different energy
thresholds. Only the parts of the signal exceeding the respective threshold are considered. This
can be viewed as a multi-resolution analysis with respect to signal energy. The “overall
similarity” is then computed using the coefficients from all thresholds. A polynomial is used to
transform the comparison result to the ITU MOS scale. The length of the speech sample varies
between 4 and 30 seconds.
Degraded
signal Network
Estimation estimation
alignment Quality
QLQ
- echo
- call setup quality
Frequency Psychoacoustic
equalization s Other
modelling measured
Referenc data
e
MOS Rating
Speech Quality is defined as a measure of a listener’s satisfaction and is generally expressed as
a Mean Opinion Score (MOS). SQuad delivers MOS rating as one number, ranging 1 to 4.5, fully
in accordance to the Listening Scale defined in ITU’s P.800 recommendation. This is not exactly
the same scope as MOS which is defined with 1-5. This is allowed since based on subjective
tests used for the validation of Squad-LQ the values above 4.5 have almost never appeared.
As described in ITU’s P.800 recommendation Annex B.4.5, various five-point category-judgment
scales may be used for different purposes. The Listening Only quality scale is the most
frequently used for ITU-T applications:
The following picture gives an overview about the obtained results in the main section of NQDI:
Channel Gain
This is a value in dBr, which shows the power level of the received signal relatively to the
reference (input) signal. Because, SQuad-LQ is applied to the electrical interfaces of the
connection, the terminal depending Send Loudness Rating (SLR) and the Receive Loudness
Rating (RLR) as well are modelled in SQuad-LQ itself. In Principle, SQuad-LQ is connected to the
so-called 0dbr-point of the networks input. At this 0dBr point a nominal level of -26dBov
(corresponds to -20dBm at a four-wire 600 Ohms interface) will be inserted.
The Channel Gain reflects only gains or attenuation caused by network (exception: attenuating
PSTN subscriber loops). It is close to the so-called JLR (Junction Loudness Rating) but does not
apply any spectral weighting.
In a transparent ISDN connection the Channel Gain should be around 0 dB. In principle also in a
Mobile-to-ISDN or Mobile-to-Mobile connection this value should be around 0dB too. Caused by
individual signal amplifications of cellular network providers this value might differ. Mainly they
amplify the signals, so a gain in the positive range can be observed. If a overall gain of 6dB is
exceeded, amplitude clipping may occur. This will lead – like in a real call – to quality impacts and
result in a lower SQuad-LQ score.
On the other way around, an attenuating PSTN subscriber loop may lead to negative Channel
Gains because it is part of the evaluated transmission chain. Like a PSTN phone, which is more
sensitive, also SQuad-LQ gain internally such attenuated signals to a nominal level of -26dBov
(corresponds to 79dB sound pressure level at the subscriber’s ear).
To inform the user of SQuad-LQ, within NQDI Channel Gains outside of the expected range are
highlighted. The expected range is here +6…-9dB and in an extended range down to -15dB.
The Channel Gain is available as a single overall value in dBr (total Gain) but also as a range of
values in the time domain (every 16ms) like a an attenuation profile. Based on this attenuation
profile values a chart can be created providing information on:
AGC (Adaptive Gain Control) Elements that are not working correctly
Level Jumps (for example after a handover)
Level Interruptions (for example interruptions in the audio path or during handovers)
Clipping
Temporal Speech Clipping (also called front-end clipping) is the loss of speech frames. It may
occur when voice activity detection is used, when Digital Circuit Multiplication Equipment
(DCME) is used or during uncontrolled slips. Time clipping is presented as clipped frames in a
function of time.
Clipping is an annoying phenomenon that cuts off a bit of speech in the instant it takes for the
transmitter to detect presence of speech. It is almost impossible to eliminate clipping in a
traditional circuit-switched voice conversation. Using circuit switching, the transmitter is not turned
on until sound is detected, and by then, a piece of the speech has been clipped off. SQuad
detects this clipping and generates the results as a distribution of time. The resolution of the
clipping measurement is 8 milliseconds. First, the mean energy per 8 milliseconds is calculated.
The energy values are then saved for each frame (both reference and coded). After the whole
speech sample has been processed, the post processing of time clipping data is done. There are
some simple rules during this post-processing:
Time clipping can only occur during transitions pause-speech.
Minimum pause length must be reached. In our case, it is 64 milliseconds.
The difference Energy (ref) – Energy (cod) must be at least 10 dB.
Clipped frames are succeeding frames.
The clipping measurement values are indicated as an average % value per sample (number of
active speech frames / number of clipped speech frames) and as a time domain distribution. Time
Clipping in SQuad-LQ is calculated each 8 ms, but only an average value of two succeeding
frames is reported in output file.
DC-Offset
This number shows the DC-Offset of the coded signal in percentage. This is an important piece of
information if the measured speech quality is lower than expected. Various interface problems
(impedance, coding technique, HW) can produce DC-offset discrepancies.
DC Offset is calculated as
100 * average_audio_voltage / Max_audio_voltage
Max_audio_voltage for 16 bit digital resolution is equal 2^15 (32768).
For example: average_audio_voltage=300 results in DC_Offset=100*300/32768=0.91%
Frequency-Shift
A low bit rate encoder can move the formants (spectral peaks) of the speech. This degradation
can be described as frequency shift of one or more components of the source signal. This
drift is measured as a percentage of moved frequency components in the speech active phases.
The result is a number of pos- and neg -shifted frames in %, reflected in a compressed
frequency (bark). Figure 2-2 shows a typical situation for one processing buffer of voice signal
(32ms).
Speech Threshold
This is a value in dBov, which shows a level of the speech in a coded signal. The measurement
is based on building of r.m.s. histograms for both coded and reference signals. dBov means
decibel relative to a digital over-load point. The range for this value is –90 to 0 dBov. For signals
containing background noise, this value is between –55 to –40 dBov.
A histogram evaluates an individual frequency for a set of data bins. The result is a number of
occurrences of a value in a data set. A histogram table presents the energy-grade boundaries and
the number of scores between the lowest bound and the current bound.
25
Noise position
20
Speech level
15
Count
Bound position
10
0
-53.0
-51.5
-49.9
-48.3
-46.7
-45.1
-43.5
-41.9
-40.4
-38.8
-37.2
-35.6
-34.0
-32.4
-30.8
-29.3
-27.7
-26.1
-24.5
-22.9
-21.3
-19.7
-18.2
-16.6
-15.0
RMS of the coded Signal (dB)
Degradations
The below list present some possible degradation reasons for the Listening Quality Value using
a clean reference sample:
AGC (Adaptive Gain Control) Elements
Speech Enhancer / Noise Suppressors
Impulsive Noise
Background Noise
Interruptions
VAD (Voice Activity Detectors)
Variable Delay or Jitter in Packet Networks
AGC Problems
Indications: LQ less than expected, Level Chart indicates an abnormal level trend.
Example of an AGC of a mobile handset that attenuates too strong toward the end of a sample:
Impulsive Noise
Indications: LQ less than expected. Similarity Chart shows a lot of quite big degradation peaks.
Background Noise
Indications: LQ less than expected. Signal Envelope Chart shows some additional energy
during the speech pause.
Example:
Interruptions
Indications: LQ less than expected. Similarity Chart shows blue bars and Signal Envelope
indicates a ‘peak’ drop.
Example of an Interruption due to a Handover (interruption is indicated in blue):
Delays Deviation
“DelaysDeviation” is placed in the section “!SQuad_LQ_AVG” (in Squad result file) and is
defined as an absolute value of the standard deviation of block delays (D), divided by an average
of block delays [in samples]. The duration of one sample at 8000 Hz, sampling frequency is 125
µs. “DelaysDeviation” shows the smoothness of an array of delays. Small “DelaysDeviation”
value means there is a uniform delay distribution, where a large value indicates a big delay-jitter
like in IP networks. For only one single delay, this value is equal zero.
⎡ stdev( D) ⎤
DelaysDeviation = fabs ⎢ ⎥
⎣ average( D) ⎦
Example: Coded file has fixed offset of 1024 samples to the reference file. Six blocks with
different variable delays are found with Squad-LQ:
Table 2-1 Example for Variable Delay where five blocks are elayed at different offsets regarding Reference
Speech Sample
N ⋅ ∑ D 2 − (∑ D )
2
1
Stdev(D)=
N
=241.53 average(D)=
N
∑ D =1115.3
DelaysDeviation=241.53/1115.3=0.217
Delay Spread is also another important parameter, which describes the maximum delay
amplitude calculated over all single group delays. Based on the example above, we can calculate
new Delay Values (D”), which are scaled D values by subtracting a fix delay from all other
values.
Figure 2-12. Example for Variable Delay, which shows that Block B is delayed for –244 samples to the left
(arrives earlier when compared with the same reference block). Block B arrives later by 244 samples.
Delay Spread is calculated as a distance between the minimum and the maximum block delay. In
our example, minimum value is –512 samples and maximum is +512 samples. So the distance
between max and min equals 1024 samples. This is then converted to time in ms.
DelaySpread = Delay _ smp ⋅ smp _ duration
1
smp _ duration =
Fs
Fs = 8000 Hz
In the calculation for our example, we get the value for DelaySpread=1024/8000=128 ms.
Frequency Shifts
The distribution of the frequency shifts is shown in the histogram below, with the number of
frames in which a shift at a certain frequency occurred. The diagram covers the whole range of
frequencies in steps of 31.25 Hz.
Quality Code
The thresholds for each degradation descriptor is, as follows:
“MOS-drops”
Quality distribution is unsteady such as during handovers or interruptions.
“Received signal level out of recommended range”
The level difference to the reference level exceeds +9dB or falls below -12dB.
“Signal interruptions”
Temporal clipping for more then 8 ms.
“High DC-Offset”
Malfunction of terminal or interface card. DC-Offset > 0.2%.
“Variable delay”
Indicates possible packet-switched transmission.
“Variable delay during speech”
Same as “Variable delay” but occurring during speech active intervals.
“Background noise”
High level of circuit noise. Higher then –50 dBov.
“Impulse noise”
Relay/switching problems detected. More then 1 pulse / second.
“Low bitrate coding / coding artefacts”
Low bit rate coding scheme has been used (e.g. Less then 8 kbit/s) or residual errors from
decoding are introduced (e.g. by frame loss concealment).
“Not Specified”
signalizes that the speech quality is degraded but no outstanding reason for that degradation
could be classified.
“OK”
shows that the speech quality is nearly non-degraded
Furthermore, special problems in the audio-path will be reported:
“Silence/Audio Level Too Low”
There is no signal activity in the audio path or the signal level is below -45dBov. SQuad-LQ will
not calculated since it will lead to misleading results.
“Corrupted Signal/Wrong Reference”
Here the received audio signal is heavily corrupted (e.g. only partly transmitted or the audio
stream was lost completely). Such a behaviour can observed e.g. during a call drops. Normally,
SQuad will score those signals with close to 1.0. For statistical reasons, NQDI allows the
exclusion of such results from the reporting.
This indicator will also signalize if a wrong reference signal was used for SQuad.
5.0
Scale limit = 4.5
4.5
4.0
3.5
P.862.1
3.0
2.5
2.0
1.5
1.0
-0.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
P.862
Note: The P.862 results are a bit lower in tendency compared to SQuad-
LQ especially in the range from 3.0 … 4.0. It is mainly caused by a high
sensitivity of P.862 regarding clipping and time-variant filtering.
It has taken also into account that P.862 does not rate any linear distortions such as frequency
responses. Those linear distortions will be compensated completely by P.862 itself before the
quality prediction starts.
The P.862 option has to be enabled by the test-type 'Speech-P.862' and requires a special
software key.
Listening Quality
Speech Quality is measured according to ITU’s P.800 where the coded file and the clean
reference are inputs for SQuad LQ algorithm. The algorithm is elaborated in Section 2.
The Listening Quality evaluation is running twice. The LQ of the noised input signal is estimated
and the LQ of the de-noise output signal as well. From both results the change of the speech
quality is derived.
3: Much Better
2: Better
1: Slightly Better
0: About the Same
−1: Slightly Worse
−2: Worse
−3: Much Worse
The CCR methods are particularly useful for assessing the performance of telecommunications
systems when the input has been corrupted by background noise. An advantage of the CCR
method over the other scales is the possibility to assess speech processing that either degrades
or improves the quality of the speech.
Figure 3-2 The Five Energy Windows for 16 bit Digital System (90.3 dB dynamics)
The definition of Windows is given in Table 3. The aim of this measurement is to detect the
influence of noise reduction circuits on speech parts of the signal.
Five SPLR values are calculated: SPLRh , SPLRm , SPLRl , SPLRn and SPLR p . SPLRn is
equal to NPLR value. Good noise reduction would generate SPLRh closed to zero and
SPLR p below –10 dB. The trend curve down through these five values shows the quality and
ability of noise reduction circuit to reduce only noisy frames and to keep unchanged the speech
active frames. In other words, the first coefficient (a) of the trend curve y=ax+b must be negative
(see example in Figure 16). The SPLR measure in SquadNS algorithm is equal to this coefficient
(a) of the trend curve.
Figure 3-4 SPLR Calculation out of Five Values Calculated in Five Different Energy Windows.
The bottom picture shows good noise reduction, whereas on the right is shown poor noise
reduction.
SPLR is then mapped to a new range 1 – 4.5 (like MOS scale). This mapping from SPLR to
SPLRm is shown in Figure 16. SPLRm > 2.5 should be achieved for good noise reduction.
Overall NS Quality
Quality Index
The following table shows some measurement examples for different network conditions including
noise reduction effects:
Convergence Time
For the measurement of the Convergence Time in a noisy signal, the algorithm examines the first
two seconds of the given signal. For the calculations it uses the filtered difference between the
coded and the reference signal (red color, see Figure 3-6).
Figure 3-10 Example of Convergence Time Evaluation
First it checks whether the signal belongs to the noise or pause group and then it compares data
with the set threshold. The threshold is calculated as NPLR + 25 (default, use PERCENT to
change) percent of the difference between the maximum value of the filtered signal in these first 2
seconds and noise level afterwards (NPLR). If the filtered data is lower than the threshold the first
condition for the convergence is fulfilled (see Figure 3-7).
Filtered difference
0
Convergence
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191
-5
-10
-15
-20 Threshold
-25
Figure 3-11 Filtered Difference Envelope is Compared with the Threshold Value
The second condition is that the signal has a falling tendency. To verify that we check 5 (default,
use CT_NR_POINTS to change) equally spaced points over the tested convergence time. In case
of the falling signal the difference in values between every two consecutive points has to be less
than zero. In Fig. 21, we see that the difference between signal values in third and forth points is
bigger that 0, which signifies raising tendency of the signal. Here we perform additional check to
clarify what is actually going on.
30
20
10
0
106
113
120
127
134
141
148
155
162
169
176
183
190
15
22
29
36
43
50
57
64
71
78
85
92
99
1
8
-10
Threshold
-20
-30
-40
Figure 3-12 Five Point Analysis of the Difference Envelope during Decision on Noise Reduction State
This test is based on the average level of the signal before and after the first convergence
criterion is met. If the average level of the signal after falling below the threshold is less than that
threshold, and the average level of the signal before that point, is higher than the same threshold,
we say that the signal has converged. If not, the algorithm continues searching for convergence
until the end of 2 seconds buffer.
30
20
10
Mean signal level
0 before threshold
101
111
121
131
141
151
161
171
181
191
11
21
31
41
51
61
71
81
91
1
-10
Threshold
-20
Mean signal level
-30 after threshold
-40
Examples
The Signal Envelope shows that the noise is really reduced and the speech part is more or less
the same as for the reference signal.
4 DTMF Tests
Introduction
In telecommunications today, the most used signalling system is DTMF signalling. DTMF stands
for Dual Tone Multi-Frequency. As the name suggests, the DTMF signal consists of two
superimposed sinusoidal waveforms with frequencies chosen from a set of eight standardized
frequencies.
When a DTMF signal is sent over a network it can be degraded, especially when it is encoded.
For an operator of a network, it is of interest to know if the receiver of the DTMF signals can
convert the DTMF signal back into a digit or a symbol. The objective is to measure the percentage
of detected and undetected DTMF digits.
In the first part of SwissQual's algorithm for DTMF test, the algorithm scans through a given signal
and detects the locations of DTMF signals. Once a DTMF signal is found, the algorithm calculates
the characteristics and decides if the signal is valid. If the tone is invalid, the DTMF-Test
describes which condition that was not accomplished. The algorithm collects all characteristics
and saves them in a file.
The DTMF signal used for tests, which consist of two frequencies. According to the CCITT
Recommendation Q.23 [5] and Q.24, there are two frequency groups, each with four frequencies:
The figure below shows how the frequencies are allocated to the various digits and symbols of a
push-button set. Every digit and symbol consists of a frequency from the low and the high group.
Figure 4-1 Allocation of Frequencies to the Various Digits and Symbols of a Push-button Set
DTMF-Test Overview
One or more DTMF signals are sent over a network. The coded signal will be used for the DTMF-
Test. This signal is available in digital form; the data format is PCM (without compression). The
sampling frequency is 8 kHz or 16 kHz. The digital quantization of the signal can be 8 bit
(unsigned or signed) or 16 bit (big or little endian). Inside, the algorithm works with 16 bit
resolution. The figure below illustrates the basic algorithm of DTMF-Test. DTMF-Test saves its
result in a comma delimitated text file.
Criterions
The objective of the SwissQual model for DTMF testing is to measure the percentage of
undetected DTMF digits processed through the network. The DTMF signals are generated at the
frequencies specified in the ITU-T Rec. Q.23.
The algorithm follows the ETSI guidelines defined in "TS 101 235-1" "Technical Specification of
Dual Tone Multi-Frequency (DTMF)".
The received DTMF signal shall be detected as valid when:
Only two of the signalling frequencies are present, one from the high group and one from the low
group, fulfilling the conditions as described above
Each of these signalling frequencies are within +/-(1,5 % +2 Hz) of the nominal value
The level of each of these two signalling frequencies is within the range -27 dBV to -5 dBV
The difference in level of these two signalling frequencies is not more than 6 dB.
Results
Table 4-1 DTMF Result Code
Code Description
Tone Length Length of a DTMF Tone
Pause Length Pause Length between two DTMF tones
Measured Level Average Level of a DTMF Tone
Code Description
Level Deviation Level Deviation of the two frequencies of a DTMF Tone
Freq. Low Low Frequency value in Hertz
Freq. High High Frequency value in Hertz
DevFreqLow [Hz] Deviation of the low frequency from the standard in Hertz
DevFreqHigh [Hz] Deviation of the high frequency from the standard in Hertz
DevFreqLow [%] Deviation of the low frequency from the standard in percent
DevFreqHigh [%] Deviation of the high frequency from the standard in percent
Twist [dB] Level difference between the high and the low frequency
Signal Valid Signal valid code:
Cause: Valid (0)
If the received tone matches all conditions, then the signal is valid
and the field 'SignalValid' is set to '0'
Cause: TooShort (-1)
If the received tone is too short (<40ms) then the signal is invalid
and the field 'SignalValid' is set to the code '-1'
Cause: NoDigit (-2)
If the received tone does not contain the two frequencies as
specified, the signal is invalid and the field 'SignalValid' is set to '-2'
Cause: LowLevel (-3)
If the Noise Level is more than 10% of the Signal Level of a tone,
then the signal is invalid and the field 'SignalValid' is set to '-3'
Cause: FreqDeviation (-4)
If one of the two frequencies is out of the specified range (+/- 1.5%),
then the signal is invalid and the field 'SignalValid' is set to '-4'
Cause: LevelDiff (-5)
If the level difference of the two frequencies for one tone is more
than 10dB, then the signal is invalid and the field 'SignalValid' is set
to '-5'
Cause: Unknown (-6)
If there is no tone at all, then the signal is invalid as well and the field
'SignalValid' is set to '-6'
Cause: TooLong (-7)
If the received tone is too long (>90ms), then the signal is invalid
and the field 'SignalValid' is set to the code '-7
Code Description
'SignalMatch' is set to the code 'B'
Cause: MissingTone (C)
If a reference tone has no match but there are two or more irregular
tones, the field 'SignalMatch' is set to the code 'C'
Cause: MultipleMissingTones (D)
If there are two or more reference tones with no match but three or
more irregular tones, the field 'SignalMatch' is set to the code 'D'
Cause: MultipleMissingTones (E)
If there are more reference tones with no match than irregular tones,
the field 'SignalMatch' is set to the code 'E'
Cause: MultipleTone (F)
If a reference tone has two or more matching tones in the coded file,
the field 'SignalMatch' is set to the code 'F'
Cause: AdditionalTone (G)
If there is no reference for a tone in the coded file, the field
'SignalMatch' is set to the code 'G'
Cause: MissingTone (H)
If for a reference tone there is no tone in the coded file, the field
'SignalMatch' is set to the code 'H' Cause: Disparity (I)
If the number and order for a string of tones in reference and coded
file cannot be matched, the field 'SignalMatch' is set to the code 'I'
Echo Measurement
This Advanced Echo Check Passive Test (AEC passive) does not simulate anything on B-
Side. The A-Side starts a call and after B-Side has answered the call; the collecting of the down-
link (B->A) audio stream is started. When the recording of the stream has finished, the search of
echo signal is started by comparing the registered signal with the reference signal.
On the B-Side, we can use any (self-answering) voice terminal or a SwissQual Diversity
measurement probe. The AEC algorithm is able to detect echo in presence of background noise
and double talk.
The algorithm is running in two steps:
• Observing a wide range of echo delay for possible echoes (scan procedure)
• Analysing accepted echo regions in detail for calculating the echo loss and the other results
Measurement Results
The AEC algorithm generates the following results:
Signal type
Echo Delay in milliseconds
Echo Loss during Single Talk acc. ITU-T G.122
Echo Loss for the complete signal (incl. Double Talk)
Echo Objection Rate acc. ITU-T G.131 in %
1
Please remark that in case of pure single talk situations a powerful echo region might classified
as Double Talk and a Double Talk Ratio of some percentage will shown if the adaptive Double
Talk Threshold is exceeded. This might be observed especially for time varying echo paths.
Furthermore, a huge mount of noise in the receiving part may be classified as double talk too.
Echo Measurement
The SQUAD AEC active measurement is using the same echo detection approach as the passive
measurement described above. Compared to the passive measurement, where the far-end side is
silent in the active mode the far-end side will create an echo actively. The SQUAD AEC active
measurement includes an inband synchronization between both sides. In a first communication
the incoming signal will be recorded at the far-end side. Based on that signal an echo is
generated by applying selectable echo path responses on that. If required, the generated echo
can be interlaced with double talk. During receiving the signal second time, the pre-processed
echo will be played back to the sending side for evaluation.
This measurement is especially designed to detect and rate echo cancellers or suppressors in the
network. The generated echo will challenge these echo cancellers and possible integrated level-
switching devices will be forced by an inserted double talk signal.
The echo-detection is more confident if the remaining echo has linear components. Especially
during double talk, only linear dependent echoes can be detected. In connections including low
bit-rate codecs and/or non-linear processors the residual or low echoes might be non-detectable
by the measurement. For more confidence chose higher echo levels to increase their
differentiation from doubletalk and other non-linear components.
2
Please note that the channel gain will also influence the measured echo loss. Basically, the
channel attenuation in both directions has to be added to the defined echo loss at far-end side.
The measuring signal is attenuated due to the transmission from A to B, is there attenuated again
(during the defined loss value) and will be attenuated again due to the transmission from B to A
again. The echo loss reflects the level of the received echo compared to the original measuring
signal.
So a simple Listening Quality measurement can be done in parallel. The interesting point is here:
How the Listening Quality is affected by double talk/echo in the other direction. By comparison of
both Listening Quality values the double talk capability can be evaluated. If a network is fully
duplex both Listening Quality values should be the same even a double talk signal is chosen.
The right value gives the Listening Quality for the first transmission where no echo or double talk
is played back. The right value gives the LQ during the echo / double talk is sent at the same
time. In addition the channel gain and the clipping of the received signal is also given.
Please note that a strong side-tone at that B-side may affect the SQuad-LQ measurement,
because it interleaves with the received and evaluated signal.
7 Round Trip
Introduction
The Round Trip Time is the time a signal needs to travel from the near end side to the far end
side and back. The Round Trip Time is mostly close to the delay of the latest possible echo. The
time speech needed to travel from one talker to the other (One Way Signal Delay) is an
important indicator of the conversational quality of a call. A travel time that is too high leads to the
annoying effect that the talkers interrupt each other unintentionally.
Results
The measurable Round Trip Time is limited from 4ms in minimum to 3000ms in maximum; the
maximal delay jitter between the three repetitions within one measurement has to be below
±500ms. The results of the measurement are presented in the following table. In addition, the
lowest of the one way and round trip time of the measurements in milliseconds is shown as final
results.
References
ETSI TS 101 329-2 V1.1.1 (2000-07), Part 2: Definition of Quality of Service (QoS) Classes
A Appendix
Abbreviations
Abbreviation Description
ACR Absolute Category Rating
CELP Code Excited Linear Prediction
DCR Degradation Category Rating
DMOS Degradation Mean Opinion Score
MOS Mean Opinion Score
dBov dB relative to the overload point of a digital system
ADPCM Adaptive Differential Pulse Code Modulation
BFI Bad Frame Indication
CCITT Comité Consultatif International Télégraphique et Téléphonique (The
International Telegraph and Telephone Consultative Committee)
CDMA Code-Division Multiple Access
CRC Cyclic Redundancy Check (3 bit)
DAC Digital to Analogue Converter
DMR Digital Mobile Radio
DTMF Dual Tone Multi-Frequency (signalling)
DTX Discontinuous Transmission (mechanism)
EPROM Erasable Programmable Read Only Memory
ETR ETSI Technical Report
ETS European Telecommunication Standard
ETSI European Telecommunications Standards Institute
FER Frame Erasure Ratio
FR Full Rate
GMSK Gaussian Minimum Shift Keying (modulation)
GSM Global System for Mobile communications
GSM MS GSM Mobile Station
HANDO Handover
HDLC High level Data Link Control
HR Half Rate
IEC International Electro-technical Commission
ISDN Integrated Services Digital Network
ISO International Organization for Standardization
Appendix A | 39
SQuad Voice Measurement Description Manual
© 2000 - 2009 SwissQual AG
Abbreviation Description
ITU International Telecommunication Union
LAN Local Area Network
MSC Mobile-services Switching Center, Mobile Switching Center
OSI Open System Interconnection
PABX Private Automatic Branch eXchange
PDN Public Data Networks
PSPDN Packet Switched Public Data Network
PSTN Public Switched Telephone Network
QOS Quality Of Service
RXLEV Received signal level
RXQUAL Received Signal Quality
S/W Software
SIM Subscriber Identity Module
SS7 Signaling System No. 7
TDMA Time Division Multiple Access
TE Terminal Equipment
VAD Voice Activity Detection
40 | Appendix A