0% found this document useful (0 votes)

75 views7 pages

TCS Bangla Guidelines

Uploaded by

Nirban saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views7 pages

TCS Bangla Guidelines

Uploaded by

Nirban saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

I.

Glossary
Term Definition

Speech Clear human voice

Discard Get rid of

Intercept Cut off/ seize

Transcribe To make a written copy of speech

Modal words Ha Ha; Wow; Oh; Aha

Default cut A piece of intercepted audio by default (system)

Current cut The audio that you cut

Overlapping Two or more people speak at the same time

Speech

Homophone Two or more words having the same pronunciation but different meanings
or spelling

Dialect A language used by the people of a specific area/district

Accelerated Increase in speed

II. Annotation Guidelines

1. Audio classes
There are 2 options for audio classes: 【speech】 and 【discard】, here are the definitions:

1. speech：
1. You can select a speech part which is in the language you transcribe, and the speech part is clear
2. Only when you chose speech, you need to transcribe text from the audio
2. discard：
1. The entire audio is not in the language you transcribe;
2. The entire audio is unclear or non-audible speech；
3. The entire audio is songs or non-human speech, which includes melodies, animals' sounds and nature sounds ;
4. The entire audio contains only modal words.
3. Cut speech

3. Text transcribe

III. Added explanation

1. Double space between words is ok
Note:
1. Please double check and make sure that the text aligns with the audio before moving on to the next section.
2. Transcribe what you hear, including ungrammaticalities.
3. Transcriptions must be 100% accurate to the cut speech part.
4. All symbols and numbers in the audio must be transcribed to corresponding words in your language accordingly.

Evaluation : Intercept and Written Error Guide

Scenario 1 : Beginning and the ending of the audio has unclear speech and in the middle a
word was spelt wrong in the text transcription.
Definition of Unclear speech = words spoken to slow cant put together, baby blabbering, low volume, fuzzy as long as it is due
to human speech
Answer : This situation we have intercept error and written error. Both is accepted.

Scenario 2 : Beginning and the ending of the audio has noise and in the middle a word was
spelt wrong in the text transcription.
Definition of Noise = Background music, fireworks, rainfall – Non human speech related
Answer : Written error. We ignore the beginning and end portion of noise

Scenario 3 : Beginning of the audio was not cut accurately causing the word to be half heard.
All the other portion of the audio has no written error.
Explanation : The intercept error has caused the written error. ( Since theaudio was not cut properly, it caused a
misunderstanding)
Answer : Intercept Error only.

Scenario 4 : Beginning and the ending of the audio is complete silence and was not cut off
and in the audio in the middle has a word was spelt wrong/missing word in the text
transcription.
Explanation : This situation has 1 type of error.
Answer : Written error. (Silence can be ignored)

Scenario 5 : Beginning and the ending of the audio is complete silence and was not cut off
and the audio in the middle as no written error.
Answer : This case can be pass
[Labeling] Queue SOP - Evaluation (TH-CS)

1. Production index
1. Productivity Requirement: 150-200 cases/h

2. Components of moderation interface

1. Audio part
2. Labeling index
3. Evaluation
4. Default cut: duration of the original audio, given by the machine (grey area)
5. Current cut: the speech part made by the first verifier.
6. Audio classes:
1. Default option made by the first verifier
2. Speech - clear human speech
3. Discard - audio does not meet ASR speech requirements and needs to be discarded.
7. Text box: transcribe audio of clear human speech into text, default text transcribed by the first verifier.

8. Evaluation conclusion:
1. Pass: [audio classes] option is correct, and the transcription is perfectly aligned with the speech part (current
cut)
2. Written error: written error in transcription
3. Intercept error: the speech cut is wrong
4. Classification error: [audio classes] option is wrong
5. Blank error: the transcript text area is blank
6. Punctuation error: punctuation errors

3. Interface instruction

4. Flow Chart of moderation instruction

Unable to paste block outside Docs

5. Operation steps and instructions

1. Step 1: Listen to the intercepted audio.
2. Step 2: Check the default written text.
3. Step 3: Evaluate corresponding conclusion.
4. Step 4: Submit or Submit and Leave
Dialects:

Words with the same meaning as the Bengali Dialect. Difference between both groups would be method of
pronunciation.
The Bengali Dialect has no accent. Other Dialects with an accent.

This will not be Discarded

Words with the same meaning as the Bengali Dialect but is completely different due to the location and cultural
difference.
These words are pronounced and spelt completely different than compared to the Bengali Dialect

If terms are completely different, then it must be a Discard

IMPORTANT GUIDELINE UPDATES

Date UPDATE STATUS

Dialects with same word but slight pronunciation difference will not be
14-July Effective Today (14-07-2021)
discarded. If complete term is different than standard, then discard

All English words will be written in Bangla script. Even abbreviations, brand
15-July Effective Today (15-07-2021)
names, proper nouns etc. Only Bangla script will be used

For proper nouns, they will remain written in BN as we have alligned & this will
17-July
be treated as BN as well. Meaning we do not need to cut it at the start or end Effective Today (17-07-2021)
of a sentences.

Indirect Translation Techniques
No ratings yet
Indirect Translation Techniques
17 pages
Indonesia Transcription Guidelines - EN - 0413
No ratings yet
Indonesia Transcription Guidelines - EN - 0413
7 pages
Tuesday Week 9 DLP Matatag Mapeh Grade 4 Quarter 1
No ratings yet
Tuesday Week 9 DLP Matatag Mapeh Grade 4 Quarter 1
2 pages
Transcription Guidelines
100% (1)
Transcription Guidelines
12 pages
STEP 3 Audio - Transcription - Rules - EN-Final - 0526
No ratings yet
STEP 3 Audio - Transcription - Rules - EN-Final - 0526
13 pages
English 8 Detailed Lesson Plan
88% (26)
English 8 Detailed Lesson Plan
2 pages
CrowdSurf General Guidelines
100% (1)
CrowdSurf General Guidelines
26 pages
Scribe Application - Happy Scribe
No ratings yet
Scribe Application - Happy Scribe
42 pages
Requirement
No ratings yet
Requirement
6 pages
1100 Hours of Tagalog Natural Dialogue Test
No ratings yet
1100 Hours of Tagalog Natural Dialogue Test
7 pages
Poetry Analysis
0% (1)
Poetry Analysis
4 pages
Appen Global
75% (4)
Appen Global
13 pages
English-10 Q1 Module5 M.-Gonzales-Split
No ratings yet
English-10 Q1 Module5 M.-Gonzales-Split
7 pages
Dos Donts Spelling Rules 4
100% (1)
Dos Donts Spelling Rules 4
14 pages
Transcriptionformat
No ratings yet
Transcriptionformat
14 pages
Standard American English
No ratings yet
Standard American English
28 pages
Unit 6 - Part III - Customer Relationship Management System
No ratings yet
Unit 6 - Part III - Customer Relationship Management System
39 pages
Ae tt9 Progress Test 3
No ratings yet
Ae tt9 Progress Test 3
8 pages
Text Annotation Guidelines For Hindi ASR
No ratings yet
Text Annotation Guidelines For Hindi ASR
8 pages
NCSE 2014 Language Arts 2
84% (56)
NCSE 2014 Language Arts 2
15 pages
Transcription
No ratings yet
Transcription
4 pages
Guidelines Transcribing
No ratings yet
Guidelines Transcribing
35 pages
Specification
No ratings yet
Specification
4 pages
User Guide - Colloquial Video Annotation
No ratings yet
User Guide - Colloquial Video Annotation
5 pages
Outsiders 3
No ratings yet
Outsiders 3
4 pages
Text Format Descriptions: Full Verbatim
No ratings yet
Text Format Descriptions: Full Verbatim
10 pages
Ge 4 Chapter 4
No ratings yet
Ge 4 Chapter 4
29 pages
1
No ratings yet
1
51 pages
Ecr Eapp
No ratings yet
Ecr Eapp
12 pages
LOFT System Guidelines
No ratings yet
LOFT System Guidelines
17 pages
Audio Transcription Instruction (Praat)
No ratings yet
Audio Transcription Instruction (Praat)
16 pages
Inglés Confirmación de Nivel
No ratings yet
Inglés Confirmación de Nivel
5 pages
Eura English Transcription Guidelines 2024 - ADAP QF
No ratings yet
Eura English Transcription Guidelines 2024 - ADAP QF
25 pages
Scanned - 0002
No ratings yet
Scanned - 0002
1 page
Ake ASR Transcription Rule (En) - Long Audio
No ratings yet
Ake ASR Transcription Rule (En) - Long Audio
4 pages
Sound Check
From Everand
Sound Check
Zil Fariza Sheikh Othman
No ratings yet
Specification For 1000 Hour American English Doctor-Patient Dialogue Annotations
No ratings yet
Specification For 1000 Hour American English Doctor-Patient Dialogue Annotations
7 pages
Grammar and Vocabulary Unit 2
No ratings yet
Grammar and Vocabulary Unit 2
2 pages
Admission Notice Summer (May) WMES
No ratings yet
Admission Notice Summer (May) WMES
5 pages
Annotation Project
No ratings yet
Annotation Project
11 pages
In Plant 2
No ratings yet
In Plant 2
71 pages
Labelling Rules
No ratings yet
Labelling Rules
4 pages
Data Annotation Guideline
No ratings yet
Data Annotation Guideline
8 pages
Rev Transcription
100% (2)
Rev Transcription
24 pages
Avert Transcription Style Guide 1.0
No ratings yet
Avert Transcription Style Guide 1.0
16 pages
Sociolinguistic Presentation Group 3
No ratings yet
Sociolinguistic Presentation Group 3
24 pages
wk301ms Long e
No ratings yet
wk301ms Long e
6 pages
Transcription Coaching
80% (10)
Transcription Coaching
14 pages
Grade10 June2025 Memo
No ratings yet
Grade10 June2025 Memo
7 pages
7es Lesson Plan Format
No ratings yet
7es Lesson Plan Format
4 pages
Appen
No ratings yet
Appen
9 pages
Job 2 Guidelines
No ratings yet
Job 2 Guidelines
9 pages
FFLT: ........................................ .........
No ratings yet
FFLT: ........................................ .........
2 pages
Ake ASR Transcription Rule (EN) - Long Audio - V0117
No ratings yet
Ake ASR Transcription Rule (EN) - Long Audio - V0117
5 pages
Loft Rules
No ratings yet
Loft Rules
6 pages
GOT
No ratings yet
GOT
13 pages
Introduction To Transcription
No ratings yet
Introduction To Transcription
8 pages
EU Portuguese Natural Conversation Annotation.docx 20240404 170408 ٠٠٠٠
No ratings yet
EU Portuguese Natural Conversation Annotation.docx 20240404 170408 ٠٠٠٠
8 pages
Gujarat (Standard Language) Specification
No ratings yet
Gujarat (Standard Language) Specification
6 pages
Transcription Rules - English Version
No ratings yet
Transcription Rules - English Version
7 pages
Aqa A Level English Literature Coursework Mark Scheme
100% (1)
Aqa A Level English Literature Coursework Mark Scheme
4 pages
Quebec Accent French Colloquial Video Speech Transcription
No ratings yet
Quebec Accent French Colloquial Video Speech Transcription
6 pages
CHAPTER ONE (Group10) Corrected 7 October, 2019
No ratings yet
CHAPTER ONE (Group10) Corrected 7 October, 2019
100 pages
Mercerizing Agent - RZ CP-235
No ratings yet
Mercerizing Agent - RZ CP-235
1 page
Aragorn Training Document
No ratings yet
Aragorn Training Document
34 pages
Genre Analysis Literature Review in Research Articles
No ratings yet
Genre Analysis Literature Review in Research Articles
15 pages
Bangla Brief
No ratings yet
Bangla Brief
19 pages
Indic Written Domain Conversion
No ratings yet
Indic Written Domain Conversion
10 pages
Carneros Transcription Guidelines - Updated 20210727
No ratings yet
Carneros Transcription Guidelines - Updated 20210727
29 pages
Iris EN Long Audio Transcription Project: FAQ Frequent Answers & Questions
No ratings yet
Iris EN Long Audio Transcription Project: FAQ Frequent Answers & Questions
10 pages
game 外语视频标注规范
No ratings yet
game 外语视频标注规范
6 pages
About The Story of Movies
No ratings yet
About The Story of Movies
1 page
Guideline
No ratings yet
Guideline
4 pages
Transcriptionformat
No ratings yet
Transcriptionformat
14 pages
PT-BR Transcription rules-0124-EN
No ratings yet
PT-BR Transcription rules-0124-EN
7 pages
Case Study 2 - Inclusive Education
No ratings yet
Case Study 2 - Inclusive Education
5 pages
CV of Md. Abdur Rahman
No ratings yet
CV of Md. Abdur Rahman
2 pages
I Am Sharing - 45 - Mid-term-Exam-Notice-B1S1-std-version-80a76b (1) - With You
No ratings yet
I Am Sharing - 45 - Mid-term-Exam-Notice-B1S1-std-version-80a76b (1) - With You
2 pages
Voter List
No ratings yet
Voter List
2 pages
PDF Past Continuous Lesson Plan - Compress
No ratings yet
PDF Past Continuous Lesson Plan - Compress
5 pages
Bio Cleaner
No ratings yet
Bio Cleaner
1 page
Two Week Household Waste Assignment BD Full
No ratings yet
Two Week Household Waste Assignment BD Full
6 pages
Rev Transcription Style Guide v3.3
No ratings yet
Rev Transcription Style Guide v3.3
18 pages
SJJ Hindi Transcription
No ratings yet
SJJ Hindi Transcription
9 pages
Transcription Guidelines en Ver2-9 05291019
No ratings yet
Transcription Guidelines en Ver2-9 05291019
12 pages
Transcription Skills Style Guide
No ratings yet
Transcription Skills Style Guide
4 pages
Rev+Transcription+Style+Guide+3 0
No ratings yet
Rev+Transcription+Style+Guide+3 0
18 pages
Shujiajia Audio Transcription & QA
No ratings yet
Shujiajia Audio Transcription & QA
6 pages
Rainwater Harvesting 25pages Dhaka
No ratings yet
Rainwater Harvesting 25pages Dhaka
3 pages
Household Weekly Waste Assignment
No ratings yet
Household Weekly Waste Assignment
3 pages
Pre Test L5
No ratings yet
Pre Test L5
8 pages
Transcription Guidelines: Last Updated: 05292019
No ratings yet
Transcription Guidelines: Last Updated: 05292019
11 pages
Rainfall Data Dhaka Chottogram
No ratings yet
Rainfall Data Dhaka Chottogram
2 pages
Transcription Requirements AA
No ratings yet
Transcription Requirements AA
11 pages
Tiktok Project Rules: Audio Characteristics
No ratings yet
Tiktok Project Rules: Audio Characteristics
7 pages
Transcription Guidelines FAAV
No ratings yet
Transcription Guidelines FAAV
16 pages
Pre-Test Quick Guide
No ratings yet
Pre-Test Quick Guide
3 pages
Synthesis Essay Rubric 4
No ratings yet
Synthesis Essay Rubric 4
1 page
2nd Year Test
No ratings yet
2nd Year Test
7 pages
Cover Letter For CV
No ratings yet
Cover Letter For CV
1 page
PRO2 Information
No ratings yet
PRO2 Information
3 pages
The Pronunciation of English: A Reference and Practice Book
From Everand
The Pronunciation of English: A Reference and Practice Book
Tamara Piankova
5/5 (1)
Main Style Guide For Transcribing: The Basics
No ratings yet
Main Style Guide For Transcribing: The Basics
4 pages
Casting Words Guidelines
No ratings yet
Casting Words Guidelines
1 page
Transcription Guidelines For GoTranscript and Rev
No ratings yet
Transcription Guidelines For GoTranscript and Rev
1 page
Self Evaluation of Answers
No ratings yet
Self Evaluation of Answers
1 page