System Guidlines PDF
System Guidlines PDF
You will be listening to the dialogue that will likely contain multiple speakers. Your job is
to identify and mark when each speaker is speaking and segment the corresponding
audio.
Some of the audio will contain background noise, background music, and ringtones; this
must be marked too.
**Do not focus on the above two points, work on these points only if
confident of unintelligible and PII parts. (Please refer to the prefill text for
gaining confidence.)
7. 20 speaker rule
This rule indicates that if a task contains more than 20 speakers, it cannot be
continued any further. So, the task needs to be stopped when the 21st speaker is
introduced.
**Note that 30 seconds, 100 ms, and 500 ms rule does not apply to annotations.
FAQ for LT system
1. How much gap should be given at the beginning/end of the
segment?
The segment beginning/end must contain a 100MS gap. Please be sure that the
beginning and end of each segment contains a 100MS gap at most. Do not exceed
100MS. It can lay somewhere in between 85-100MS, but should not exceed 100MS. Be
more precise on this rule in every created segment and note to apply the 100MS rule
accordingly in each created segment. Do not use this for annotations.
Here, the speaker has started speaking from 00:00.090 and the segment has begun
from 0:00.000 which lies in between 85-100MS. Please use this rule at the segmentation
end as well.
Here, the speaker stops speaking at 00:19.202 thus the segment ends at 00:19.299,
which is approximately a 100MS gap.
1 music annotation can be used for different music sounds at the same time.
If the speaker is laughing and speaking at the same time then we should include it in
both segments and annotation.
If the speaker laughs between the conversation, we should annotate it as laughter but
we have to check if the laughter is more than 500ms so that we could split the segment
using 500ms gap rule.
Here, the speaker speaks from 12:58.004 to 13:28.580 which is greater than
30-seconds. Create a segment upto 30-second mark, i.e. 13:28.004 and create a new
segment from 13:28.005. Thus, at every 30-second mark a new segment must be
created if a single segment intends to run for longer than 30 seconds.
10. How much gap should be given between two segments after
using 30 second split rule?
We should give 1ms gap between the two segments. For example: If one segment stops
at 0:30:579 then another segment should start at 0:30:580.
**Please be cautious that multiple segments of the same speaker must contain at
least a gap of 1MS so that it does not overlap the previous segment. If a segment
ends at 00:03.579 and PII begins, then the start time of PII must be 00:03.580 and
so on.
11. Should we give 100ms gap when using the rule of 30 second
split?
If the speaker is speaking for exactly 2 minutes, we should create 4 segments every 30
second.
**Here is the beginning of the segment. We should give 100ms gap as per the rule.
**In this screenshot we split the segments and here we should not give 100ms gap. If we
give 100ms gap from one of the segments then we would overlap each other. So, we
just give 1ms gap between two segments because the speaker is speaking continuously
and we don’t want to miss a word the speaker is speaking.
**Here the speaker has stopped the speech. Now we can give 100ms gap at the end of the
segment.
Topics covered:
1. 100ms rule
2. Creating new speaker
3. 30 second rule
4. 500ms rule
5. Labelling the speakers
6. 1ms rule