Computers and Linguistic

Uploaded by

Isti Rahma Chinta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views6 pages

Computers and Linguistic

Uploaded by

Isti Rahma Chinta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

PRACTICE

1. Speech Synthesize
Analyze the pronunciation of given words and provide rules for a TTS system.
a. call, cab, cake, cone, cob, cinder, city, cell, cent, cello
b. zoo, boo, moon, spoon, food, room, good, stood, book
c. tough, rough, plough, enough, cough, bough
d. mould, could, would, should
e. bone, home, rode, stove, dove, love, done, move
Here's a breakdown of the pronunciation rules and irregular words:
Pronunciation Rules:
a. Letter <c>: Pronounce <c> as /k/ except when followed by <e>, <i>, or <y>, in
which case pronounce it as /s/.
b. Vowels:
 Pronounce <oo> as /uː/ when it occurs at the end of a word or a syllable.
 Pronounce <ough> as /ʌf/ except in "though," where it's pronounced /ðoʊ/.
 Pronounce <ould> as /ʊld/ except in "could," where it's pronounced /kʊd/.
 Pronounce <ove> as /oʊv/ except in "love," where it's pronounced /lʌv/.
 Pronounce <one> as /oʊn/ except in "done," where it's pronounced /dʌn/.
Irregular Words
 a. None
 b. None
 c. "though"
 d. "could"
 e. "love," "done"
TTS Handling:
1. Lexicon: Maintain a lexicon or dictionary that stores the correct
pronunciation for irregular words. When the system encounters an
irregular word, it can refer to the lexicon to retrieve the correct
pronunciation.
2. Phonetic Transcription: Represent the correct pronunciation of irregular
words using a phonetic transcription system like the International
Phonetic Alphabet (IPA).
3. Machine Learning: Train a machine learning model on a large dataset of
words and their correct pronunciations.
2. Automatic Speech Recognition
Choose one of your favorites ASR sistem in your mobile phone. It can be SIRI
(iOS), Google Assistant (Android), an so on.
Then, try to speak to your ARS, such as:
1. Start with basic commands. For example:
“What’s the weather today?”
“What date is it today?”
2. Choose words or phrases that you think might be problematic. For
example:
Words with multiple meanings (e.g., "lead" vs. "led").
Words with difficult sounds (e.g., "squirrel," "rural").
3. Try speaking in different accents or using regional pronunciations. For
instance:
Use a British accent versus an American accent.
Try regional dialects, accents, or slang.

Some types of words or phrases caused confusion. It can be influenced by:

1. Homophones
2. Technical Terms
3. Names
4. Contextual Phrases
5. Sound-Alike Words
6. Accent Variations
3. Corpus Linguistics
Analyzing a corpus of English literary texts, including the rationale for the chosen
order.
1. Part-of-Speech Tagging
involves labeling each word in a text with its corresponding part of speech, such
as noun, verb, adjective, adverb, etc.
Example:
In the sentence "The quick brown fox jumps over the lazy dog," tagging would
label "The" (determiner), "quick" (adjective), "brown" (adjective), "fox" (noun),
"jumps" (verb), "over" (preposition), "the" (determiner), "lazy" (adjective), and
"dog" (noun).
2. Identifying Subjects, Direct Objects, and Indirect Objects
involves analyzing the grammatical structure of sentences to identify key syntactic
components: subjects (who or what the sentence is about), direct objects (who or
what is receiving the action), and indirect objects (to whom or for whom the
action is performed).
Example:
In the sentence "The teacher gave the students homework," "The teacher" is the
subject, "homework" is the direct object, and "the students" is the indirect object.
3. Building Syntactic Trees
is a visual representation of the grammatical structure of a sentence. It shows how
words group together into phrases and how those phrases relate to one another.
Example:
For the sentence "Tree structures are very easy," the syntactic tree would show
"Tree structures" as a noun phrase and "are very easy" as a verb phrase, with
"very" as a degree and “easy” as an adjective.
Tree structures are

very easy
4. Producing Word Roots
involves reducing words to their base or root form, a process known as
lemmatization. It typically considers the context and part of speech to derive the
correct root.
Example:
The words "running," "ran," and "runs" would all be reduced to the root "run."

Summary of the Order

1. Start with Part-of-Speech Tagging: This provides the essential context for each
word.
2. Identify Subjects and Objects: Using the tagged information allows for more
accurate identification of grammatical roles.
3. Build Syntactic Trees: The previously identified components facilitate a clearer
structure of the sentence.
4. Produce Word Roots: Finalizing the analysis with roots allows for broader
linguistic insights across the corpus.
By following this order, each step builds logically upon the last, ensuring a
comprehensive understanding of the corpus and its grammatical structures.
Conclusion:
The integration of language and computers has significantly advanced fields such
as linguistics, natural language processing, and computational linguistics. This
practice enables the analysis, understanding, and generation of human language
by machines, impacting everything from automated translation to speech
recognition.
Strengths
Efficiency: Automated systems can process and analyze large volumes of text
much faster than humans, making it easier to extract information and identify
patterns.
Consistency: Computers apply the same rules and algorithms consistently,
reducing the variability that might occur in human analysis.
Accessibility: Language technologies facilitate communication for diverse
populations, including those with disabilities, through tools like speech-to-text
and text-to-speech systems.
Innovative Applications: Advances in machine learning and AI lead to
innovative applications, such as chatbots, sentiment analysis, and real-time
translation.
Weaknesses
Context Sensitivity: Computers often struggle with understanding context,
idioms, and nuances of language that can lead to misinterpretations.
Data Dependence: The performance of language models heavily relies on the
quality and diversity of training data, which can introduce biases or limitations.
Complexity of Human Language: Language is inherently complex and variable,
making it challenging for algorithms to capture all linguistic subtleties and
regional variations.
Resource Intensive: Developing and maintaining advanced language processing
systems can be resource-intensive, requiring significant computational power and
expertise.
Overall, while the practice of integrating language and computers brings
numerous benefits and innovations, it also presents challenges that require
ongoing research and refinement to improve accuracy and inclusivity.

Scott Eldridge II - Bob Franklin - The Routledge Handbook of Developments in Digital Journalism Studies-Routledge (2018)
No ratings yet
Scott Eldridge II - Bob Franklin - The Routledge Handbook of Developments in Digital Journalism Studies-Routledge (2018)
564 pages
Ai 6
No ratings yet
Ai 6
55 pages
NLP Question and Answers Final
No ratings yet
NLP Question and Answers Final
129 pages
Tattoo Narratives The Intersection of TH
No ratings yet
Tattoo Narratives The Intersection of TH
24 pages
454 GlassEtAl 2024
No ratings yet
454 GlassEtAl 2024
374 pages
Wendrich y Barnard 2008-The Archaeology of Mobility
No ratings yet
Wendrich y Barnard 2008-The Archaeology of Mobility
617 pages
An Open Introduction To Linguistics 2022
No ratings yet
An Open Introduction To Linguistics 2022
279 pages
Natural Language Processing Tools and Approaches
No ratings yet
Natural Language Processing Tools and Approaches
106 pages
Natural Language Processing Dossier 20231110 141736 0000
No ratings yet
Natural Language Processing Dossier 20231110 141736 0000
114 pages
NLP Pyq Solutions
No ratings yet
NLP Pyq Solutions
59 pages
Lect1 Intro 3jan08
No ratings yet
Lect1 Intro 3jan08
94 pages
Rethinking Geographic Polarization in Social Science Research: Insights From A Conference at The Hoover Institution
100% (1)
Rethinking Geographic Polarization in Social Science Research: Insights From A Conference at The Hoover Institution
12 pages
Unit-4 NLP
No ratings yet
Unit-4 NLP
54 pages
NLP Unit 1
No ratings yet
NLP Unit 1
56 pages
2015-TamMetin SMAS TutgunUnal Deniz
No ratings yet
2015-TamMetin SMAS TutgunUnal Deniz
21 pages
5th Unit NLP
No ratings yet
5th Unit NLP
32 pages
Computer Processing of Human Language
50% (2)
Computer Processing of Human Language
2 pages
Urban Systems (Smailes, 1971)
No ratings yet
Urban Systems (Smailes, 1971)
15 pages
Chapter 6-NLP Basics
No ratings yet
Chapter 6-NLP Basics
27 pages
SebentaLN Parte1
No ratings yet
SebentaLN Parte1
42 pages
NLP Module 1
No ratings yet
NLP Module 1
55 pages
CME4408 P3 NLPtechniques
No ratings yet
CME4408 P3 NLPtechniques
33 pages
Ai-Module 4
No ratings yet
Ai-Module 4
28 pages
Chapter 7 - Communication Perceving and Acting
No ratings yet
Chapter 7 - Communication Perceving and Acting
21 pages
NLP Conventional
No ratings yet
NLP Conventional
27 pages
Group 2 A Sociocultural Study On The Spending Practices of The Kapampangan Mothers in The Household 1
No ratings yet
Group 2 A Sociocultural Study On The Spending Practices of The Kapampangan Mothers in The Household 1
58 pages
Natural Language Processing Notes Class 10 AI
100% (1)
Natural Language Processing Notes Class 10 AI
20 pages
01 The Art of Listening
No ratings yet
01 The Art of Listening
14 pages
Lecture 04
No ratings yet
Lecture 04
543 pages
NLP Part1
No ratings yet
NLP Part1
67 pages
NLP Cmu
No ratings yet
NLP Cmu
38 pages
UNIT-1 Notes
No ratings yet
UNIT-1 Notes
19 pages
Unit V Expert Systems Notes
No ratings yet
Unit V Expert Systems Notes
15 pages
Dialogues - Between - Activist - Knowledge - and - Southern - Serrano-Amaya
No ratings yet
Dialogues - Between - Activist - Knowledge - and - Southern - Serrano-Amaya
20 pages
Natural Language processing-Regular-HO
No ratings yet
Natural Language processing-Regular-HO
10 pages
NLP Ambiguity
No ratings yet
NLP Ambiguity
35 pages
AI Unit 5
No ratings yet
AI Unit 5
18 pages
CCS369 - TSS-Unit 4
No ratings yet
CCS369 - TSS-Unit 4
30 pages
Natural Language Processing Handout
No ratings yet
Natural Language Processing Handout
8 pages
Neo Behaviorism Final
No ratings yet
Neo Behaviorism Final
30 pages
Collaborative Problem Solving Handbook 1
No ratings yet
Collaborative Problem Solving Handbook 1
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
19 pages
Introduction To Natural Language Processing and NLTK
No ratings yet
Introduction To Natural Language Processing and NLTK
23 pages
Unit 5
No ratings yet
Unit 5
18 pages
Group 6 - Cross-Cultural Contact With Americans
No ratings yet
Group 6 - Cross-Cultural Contact With Americans
49 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
16 pages
NLP JNTUH Unit 1
No ratings yet
NLP JNTUH Unit 1
9 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
Draft Proof Hi
No ratings yet
Draft Proof Hi
16 pages
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
No ratings yet
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
19 pages
Introduction
No ratings yet
Introduction
23 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
Yacht Club and Linea Calc
No ratings yet
Yacht Club and Linea Calc
16 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
Manajemen Kinerja
No ratings yet
Manajemen Kinerja
31 pages
Natural Language Processing Applications: Fabienne Venant Université Nancy2 / Loria 2008/2009
No ratings yet
Natural Language Processing Applications: Fabienne Venant Université Nancy2 / Loria 2008/2009
123 pages
Evaluation Rating Sheet For PRINT Resources
No ratings yet
Evaluation Rating Sheet For PRINT Resources
3 pages
Materi PPT Topic 2
No ratings yet
Materi PPT Topic 2
10 pages
3nlp Computer
No ratings yet
3nlp Computer
83 pages
Speech Recognition1
100% (1)
Speech Recognition1
39 pages
Understanding Culture, Society and Politics-Humss 11/12 Quarter 1
No ratings yet
Understanding Culture, Society and Politics-Humss 11/12 Quarter 1
6 pages
Chapter 3 Scripts
No ratings yet
Chapter 3 Scripts
9 pages
Final Project - Autoethnography Andrea Garcia
No ratings yet
Final Project - Autoethnography Andrea Garcia
7 pages
5.natural Language Processing
No ratings yet
5.natural Language Processing
5 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
4 pages
Human Condition
No ratings yet
Human Condition
3 pages
Ai Phases in NLP Sem Vi
No ratings yet
Ai Phases in NLP Sem Vi
3 pages
Natural Language Processing: Dr. Abdulfetah A.A
No ratings yet
Natural Language Processing: Dr. Abdulfetah A.A
25 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
34 pages
Introduction To Computational Linguistics: CS 5890 University of Colorado at Colorado Springs
No ratings yet
Introduction To Computational Linguistics: CS 5890 University of Colorado at Colorado Springs
29 pages
Lecture 03
No ratings yet
Lecture 03
4 pages
Diffusionism Anthropology
No ratings yet
Diffusionism Anthropology
3 pages
Computational
No ratings yet
Computational
22 pages
Terna Engineering College: Rohini Patil
No ratings yet
Terna Engineering College: Rohini Patil
9 pages
Corpus Processing
No ratings yet
Corpus Processing
2 pages
Cohen-Reconsidering Historical Materialism
No ratings yet
Cohen-Reconsidering Historical Materialism
26 pages
Natural Language Processing
No ratings yet
Natural Language Processing
19 pages
Activity 2 For Homework
No ratings yet
Activity 2 For Homework
3 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
26 pages
Research Methods For Business: A Skill Building Approach: Measurement of Variables: Operational Definition
No ratings yet
Research Methods For Business: A Skill Building Approach: Measurement of Variables: Operational Definition
11 pages
Big Data and Human Geography: Opportunities, Challenges and Risks
No ratings yet
Big Data and Human Geography: Opportunities, Challenges and Risks
6 pages
HBO Module 5
No ratings yet
HBO Module 5
3 pages
Natural Language Processing
100% (1)
Natural Language Processing
21 pages
Coursebook - Term 3 - Edited 8.2019!81!88
No ratings yet
Coursebook - Term 3 - Edited 8.2019!81!88
8 pages
Shari A Theory Practice Transformations Wael B. Hallaq PDF Download
No ratings yet
Shari A Theory Practice Transformations Wael B. Hallaq PDF Download
59 pages

Computers and Linguistic

Uploaded by

Computers and Linguistic

Uploaded by

PRACTICE

Some types of words or phrases caused confusion. It can be influenced by:

Summary of the Order

You might also like