Lec 1
Lec 1
Prerequisite
for
Natural language processing
● Python
● Basic Concept of Machine
Learning and Deep Learning
Natural language processing
Natural language processing
Natural language processing (NLP) is a subfield of linguistics, computer
science, information engineering, and artificial intelligence concerned
with the interactions between computers and human (natural)
languages, in particular how to program computers to process and
analyze large amounts of natural language data.
5
Why Study NLP?
• A hallmark of human intelligence.
• Text is the largest repository of human knowledge and is
growing quickly.
• emails, news articles, web pages, scientific articles,
insurance claims, customer complaint letters, transcripts
of phone calls, technical documents, government
documents, patent portfolios, court decisions,
contracts….
6
Goals of the Course
• Learn basic principles and approaches for natural language
processing
• Learn techniques and tools that can be used to develop
practical, robust systems that can (somewhat) understand text
• Gain insight into (some) of the open research problems in natural
language
• At the end you should:
• Agree that language is subtle & interesting
• Feel some ownership over some of the techniques
• Understand (some) research papers in the field
7
Types of NLP
Applications
Used by
How NLP works?
NLP Working
NLP Working
The Ambiguity of Language: Why NLP
Is Difficult
• Manual rule creation and hand-tuning
• Time consuming to build.
• Do not scale up well.
• Produce a knowledge acquisition bottleneck.
• Perform poorly.
• Majority of Statistical models in disambiguation
• Robust.
• Behave gracefully
• In the presence of errors and new data.
• Automatic learning
• Reduce the human effort in producing NLP system.
• What about deep learning models?
15
Natural Language Understanding
Ambiguity:
Lexical Ambiguity : It was cool.
Syntactic Ambiguity : ill men and women get to hospital.
Semantic Ambiguity : The Bike hit the pole while it was running.
Pragmatic Ambiguity : The Army is coming.
Phonology – This science helps to deal with patterns present in the sound and speeches related
to the sound as a physical entity.
– One example of pragmatics in language would be if one person asked, "What do you want to
eat?" and another responded, "Ice cream is good this time of year."
Pragmatics (Contextual Analysis) – This science studies the different uses of language.
Morphology – This science deals with the structure of the words and the systematic relations
between them.
Semantics – This science deals with the literal meaning of the words, phrases as well as
sentences.
NLP Applications
• speech processing: get flight information or book a hotel over the phone
• information extraction: discover names of people and events they participate in,
from a document
• machine translation: translate a document from one human language into
another
• question answering: find answers to natural language questions in a text
collection or database
• summarization: generate a short biography of Noam Chomsky from one or
more news articles
17
Course logistics and details
• Textbook:
• Speech and Language Processing An Introduction to
Natural Language Processing, Computational Linguistics,
and Speech Recognition by Daniel Jurafsky and James H.
Martin
• Foundations of Statistical Natural Language
Processing by Christopher D. Manning, Hinrich Schütze
18