0% found this document useful (0 votes)

31 views4 pages

Syllabus f18

This document outlines the syllabus for a Natural Language Processing course taught at Carnegie Mellon University in Fall 2018. The course will introduce students to computational approaches for representing and processing human languages, with the goal of building programs that can perform tasks like translation, summarization, question answering, and conversational agents. Students will learn about words, sounds, sentences, meanings and conversations from both machine learning and linguistic perspectives. They will complete homework assignments, exams, quizzes, and a semester-long team project involving developing question and answer programs.

Uploaded by

Anindya Masud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views4 pages

Syllabus f18

Uploaded by

Anindya Masud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Natural Language Processing: Syllabus

Alan W. Black & David R. Mortensen

Carnegie Mellon University

Fall 2018

Instructors: Prof. Alan W Black ([email protected]) and David R. Mortensen (dmortens@

cs.cmu.edu)
Teaching assistants: Fatima Al-Raisi ([email protected]), Manisha Chaurasia
([email protected]), Pooja Chitkara ([email protected].
edu), Sarveshwaran Dhansekar ([email protected])
Lecture time: Tuesdays & Thursdays, 3:00–4:20
Location: WEH 4623
Web page: http://demo.clab.cs.cmu.edu/NLP/
Faculty office hours: By appointment (Black);
By appointment at https://davidmortensen.youcanbook.me (Mortensen)
TA Office hours: TBA

1 Summary
This course is about a variety of ways to represent human languages (like English and Chinese) as
computational systems, and how to exploit those representations to write programs that do useful
things with text and speech data, like translation, summarization, extracting information, question
answering, natural interfaces to databases, and conversational agents.
This field is called Natural Language Processing or Computational Linguistics, and it is ex-
tremely multidisciplinary. This course will therefore include some ideas central to Machine Learning
(discrete classification, probability models) and to Linguistics (morphology, syntax, semantics).
We’ll cover computational treatments of words, sounds, sentences, meanings, and conversations.
We’ll see how probabilities and real-world text data can help. We’ll see how different levels interact
in state-of-the-art approaches to applications like translation and information extraction.
From a software engineering perspective, there will be an emphasis on rapid prototyping, a useful
skill in many other areas of Computer Science. In particular, we will introduce some high-level
formalisms (e.g., regular expressions) and tools (e.g., Python) that can greatly simplify prototype
implementation.

2 Target
The course is designed for SCS undergraduate students, and also to students in graduate programs
who have a peripheral interest in natural language, or linguistics students who know how to pro-

1
gram. Prerequisite: Fundamental Data Structures and Algorithms (15-211) or equivalent; strong
programming capabilities.

3 Evaluation
Students will be evaluated in five ways:

Exams (40%) one in-class midterm on March (20%) and one cumulative final exam (20%), date
TBD.

Project (30%) a semester-long 4-person team project (see below).

Homework assignments (20%) 7 pencil-and-paper or small programming problems given roughly

weekly.

Quizzes (10%) 10 Canvas quizzes given at the beginning of many lectures1 .

The lowest 2 homework grades and the lowest 3 quiz grades will be dropped.

Late Policy No work will be accepted late. The grading policy for pop quizzes and homework
assignments permits some slack of an administratively simpler kind than deducting points for
lateness or missing a lecture.

Academic Honesty Exams and pop quizzes are to be completed individually. Verbal collab-
oration on homework assignments is acceptable, but (a) you must not share any code or other
written material, (b) everything you turn in must be your own work, and (c) you must note the
names of anyone you collaborated with on each problem (the only exceptions are the instructor
and TA), and the nature of the collaboration (e.g., “X helped me,” “I helped X,” “X and I worked
it out together.”). If you find material in published literature (e.g., on the Web) that is helpful in
solving a problem, you must cite it and explain the answer in your own words. The project is to
be completed by a team; you are not permitted to discuss any aspect of your project with anyone
other than your team members, the instructor, and the TA. You are encouraged to use existing
NLP components in your project; you must acknowledge these appropriately in the documentation.
Suspected violations of these rules will be handled in accordance with the CMU guidelines on
collaboration and cheating (http://www.cmu.edu/policies/documents/Cheating.html).

4 Project
A major component will be a 4-person team project. The project involves two parts:

• a questioning program (ask) whose input is a web page P and whose output is a set of
questions about the content in P that a human could answer if she read P , and

• a answering program (answer) whose input is a web page P and a question Q about P and
whose output is an intelligent answer A.
1
Students should bring a device to class so they can acces Canvas.

2
Projects will be pitted against each other in a competition at the end of the course. Here’s how
the competition works:

1. Questions will be generated manually by students in the course (this happens early in the
course as an exercise to start thinking about how to build an ask program). These will be
rated by student judges in a blind setup, for reasonableness and difficulty.

2. Questions will be generated by each team’s ask program. These will be rated by student
judges, for reasonableness and fluency, in a blind setup.

3. Human-generated and reasonable automatically-generated questions will be provided as input

to the answer programs, producing answers. These answers will be rated for correctness and
fluency by student judges, in a blind setup.

The project will be primarily graded based on documentation your team submits describing
how the programs work, and a brief video presentation at the end of the semester.

5 Textbook
The textbook for the course will be the second edition of Speech and Language Processing: An
Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition,
by Daniel Jurafsky and James H. Martin. The course will cover roughly sections I, III, IV, and
parts of V.

6 Lectures
The lecture plan is subject to change. Readings from Jurafsky and Martin are given in brackets.

• Course overview; what does it mean to “know language”? [1]

(Part 1 is “shallow” NLP.)

• Words, morphology, and lexicons‡ [3.1, 3.9]

• Discussion of the project

• Information extraction, question answering, and NLP in information retrieval† [22.0, 23.0,
22.1–2, 23.1–2]

• Probability and language models [4.0–2]

• Language model evaluation and smoothing [4.3–8]

• Noisy channel models, edit distance, and spelling correction [3.10–11, 5.9]

• Classification∗

• Word categories and parts of speech [5.0–3]

• Hidden Markov models and part-of-speech tagging [6.0–4]

3
• Chomsky hierarchy and natural language‡ [16]

(There will be an in-class midterm in the week before spring break. Part 2 is “deep” NLP.)

• Syntactic representations of natural language‡ [12.0–3]

• Parsing algorithms [13]

• Treebanks and parsing evaluation [12.4, 14.7]

• Probabilistic context-free grammars and statistical parsing [14.0–4]

• Word Embeddings and Dense Word Vectors

• Beyond context-free parsing‡

• Lexical semantics‡ [19.0–3]

• Semantic disamiguation problems: word-sense and coreference [20.0-2, 21.3, 21.7]

• Semantic role labeling [20.9]

• Compositional semantics‡ [17.2–3, 18.0–3]

• Clustering and Expectation Maximization∗

• Machine translation† [25.0–1, 25.9]

(The classes closes by synthesizing and looking forward.)

• Current NLP research at CMU

• Wrap-up and discussion

∗ These lectures are essentially stand-alone lectures on important topics in machine learning, a
subfield of CS that is central to current NLP.
† These lectures focus on applications that companies you’ve heard of are currently working on.
‡ These lectures explore ideas from linguistics, but with a computational spin.

BTech. 4th Year - Computer Science and Engineering - Hindi - 2024-25 - v2
No ratings yet
BTech. 4th Year - Computer Science and Engineering - Hindi - 2024-25 - v2
20 pages
Natural Language Processing (Peiii)
No ratings yet
Natural Language Processing (Peiii)
2 pages
NLP Course File Notes
No ratings yet
NLP Course File Notes
71 pages
Nlp-Unit-I Final
No ratings yet
Nlp-Unit-I Final
31 pages
Natural Language Processing - Session 1 - Introduction
100% (1)
Natural Language Processing - Session 1 - Introduction
55 pages
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
No ratings yet
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
4 pages
NLP Syallabus Elective
No ratings yet
NLP Syallabus Elective
3 pages
6th Sem AIML Syllabus 2022 Scheme
No ratings yet
6th Sem AIML Syllabus 2022 Scheme
53 pages
Natural Language Processing Slides
No ratings yet
Natural Language Processing Slides
1,027 pages
Learn Punjabi Sentence Structure Made Easy PDF
100% (5)
Learn Punjabi Sentence Structure Made Easy PDF
110 pages
Bai601 NLP
No ratings yet
Bai601 NLP
5 pages
U1 NLP Complete
No ratings yet
U1 NLP Complete
112 pages
Caie Igcse French 0520 Foreign Language v1
100% (1)
Caie Igcse French 0520 Foreign Language v1
21 pages
Natural Language Processing CS 1462: Computer Science 3 Semester - 1444 Dr. Fahman Saeed Faesaeed@imamu - Edu.sa
No ratings yet
Natural Language Processing CS 1462: Computer Science 3 Semester - 1444 Dr. Fahman Saeed Faesaeed@imamu - Edu.sa
15 pages
Lecture01 Introduction
No ratings yet
Lecture01 Introduction
35 pages
CS312 NLP Lecture 1 Introduction
No ratings yet
CS312 NLP Lecture 1 Introduction
21 pages
2 IPCC - Natural Language Processing
No ratings yet
2 IPCC - Natural Language Processing
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
77 pages
NLP A
No ratings yet
NLP A
6 pages
NLP PG Syllabus 2023
No ratings yet
NLP PG Syllabus 2023
3 pages
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
No ratings yet
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
4 pages
NLP Syllabus
No ratings yet
NLP Syllabus
2 pages
180 Days of Language For Kindergarten Practice
100% (9)
180 Days of Language For Kindergarten Practice
210 pages
327C6B
No ratings yet
327C6B
2 pages
Syllabus 0
No ratings yet
Syllabus 0
2 pages
AnandKumar Course Intro IT356
No ratings yet
AnandKumar Course Intro IT356
42 pages
Introduction
No ratings yet
Introduction
29 pages
Lecture 01
No ratings yet
Lecture 01
44 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
57 pages
Syl CS NLP Natural Language Processing XXX
No ratings yet
Syl CS NLP Natural Language Processing XXX
7 pages
CS702B
No ratings yet
CS702B
114 pages
NLP Intro Logistics MIHE
No ratings yet
NLP Intro Logistics MIHE
21 pages
CSE4022 Natural-Language-Processing ETH 1 AC41
No ratings yet
CSE4022 Natural-Language-Processing ETH 1 AC41
6 pages
NLP Nanodegree Syllabus
No ratings yet
NLP Nanodegree Syllabus
11 pages
Computational Linguistics - Introduction
No ratings yet
Computational Linguistics - Introduction
50 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
68 pages
MScIT Sem4
No ratings yet
MScIT Sem4
8 pages
Computational Linguistics
No ratings yet
Computational Linguistics
8 pages
NLP Subject Orientation SH23
No ratings yet
NLP Subject Orientation SH23
35 pages
LING3401 Course Outline
No ratings yet
LING3401 Course Outline
5 pages
6 Aimlsyll
No ratings yet
6 Aimlsyll
9 pages
Intro To NLP Course Outline (Fall-2024)
No ratings yet
Intro To NLP Course Outline (Fall-2024)
4 pages
Cs-3-Lesson Plan
No ratings yet
Cs-3-Lesson Plan
3 pages
SYLLABUS
No ratings yet
SYLLABUS
2 pages
Ai in Natural Language Processing
No ratings yet
Ai in Natural Language Processing
4 pages
Brochure CMU NLP 24-08-2022 V13
No ratings yet
Brochure CMU NLP 24-08-2022 V13
13 pages
CM321 NLP Syllabus
No ratings yet
CM321 NLP Syllabus
2 pages
Natural Language Processing Nanodegree Syllabus: Before You Start
No ratings yet
Natural Language Processing Nanodegree Syllabus: Before You Start
5 pages
ME02023011
No ratings yet
ME02023011
3 pages
Independent Study Kaul Final
No ratings yet
Independent Study Kaul Final
7 pages
GBHRFTHRDF
No ratings yet
GBHRFTHRDF
3 pages
Examination CourseHandout
No ratings yet
Examination CourseHandout
4 pages
NLP Syllabus
No ratings yet
NLP Syllabus
4 pages
326C5B
No ratings yet
326C5B
2 pages
Ccs369-Text and Speech Analysis
No ratings yet
Ccs369-Text and Speech Analysis
3 pages
Natural Language Processing Course Content
No ratings yet
Natural Language Processing Course Content
2 pages
Hindi 3 Minute Kobo Audiobook
100% (1)
Hindi 3 Minute Kobo Audiobook
231 pages
CCS369
No ratings yet
CCS369
2 pages
Syllabus NLP (UE19CS334)
No ratings yet
Syllabus NLP (UE19CS334)
2 pages
Cse4022 Natural-Language-Processing Eth 1.0 37 Cse4022
No ratings yet
Cse4022 Natural-Language-Processing Eth 1.0 37 Cse4022
2 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
2 pages
The Modern School, Faridabad Session 2020-2021 Grade: S1 Notes Cum Worksheet Topic: Prepositions
No ratings yet
The Modern School, Faridabad Session 2020-2021 Grade: S1 Notes Cum Worksheet Topic: Prepositions
10 pages
S + Adjective: Apostrophe S - Meanings and Uses
100% (2)
S + Adjective: Apostrophe S - Meanings and Uses
6 pages
Bahasa Inggris
No ratings yet
Bahasa Inggris
23 pages
Phrase and Idioms
No ratings yet
Phrase and Idioms
33 pages
Notes 1
No ratings yet
Notes 1
54 pages
Getting Started Norwegian
No ratings yet
Getting Started Norwegian
4 pages
Degrees of Comparison
No ratings yet
Degrees of Comparison
12 pages
Introducing Yourself - Phrases Full Article
No ratings yet
Introducing Yourself - Phrases Full Article
6 pages
Wa0066
No ratings yet
Wa0066
10 pages
Session 4
No ratings yet
Session 4
32 pages
DLL - English 6 - Q1 - W2
No ratings yet
DLL - English 6 - Q1 - W2
8 pages
BestPhonics1 - 24 Lessons Syllabus
No ratings yet
BestPhonics1 - 24 Lessons Syllabus
1 page
The Metalinguistic Knowledge of Undergraduate Students of English Language or Linguistics 2023 (1) 1
No ratings yet
The Metalinguistic Knowledge of Undergraduate Students of English Language or Linguistics 2023 (1) 1
51 pages
Lexicology Syllabus 2023-2024
No ratings yet
Lexicology Syllabus 2023-2024
4 pages
Microsoft Word - All-Future-Tenses-Exercise-1
No ratings yet
Microsoft Word - All-Future-Tenses-Exercise-1
4 pages
Methods of Structural Linguistics PDF
No ratings yet
Methods of Structural Linguistics PDF
5 pages
Approved List Nua-O +3 - 192-25.2.24
No ratings yet
Approved List Nua-O +3 - 192-25.2.24
7 pages
Week 30-31 English6 LP
No ratings yet
Week 30-31 English6 LP
6 pages
Metacognitive Test
No ratings yet
Metacognitive Test
17 pages
CLIL Research in Europe
No ratings yet
CLIL Research in Europe
29 pages
Seminar 6 - Media Translation
No ratings yet
Seminar 6 - Media Translation
4 pages
Accident Definition and Meaning - Collins English Dictionary
No ratings yet
Accident Definition and Meaning - Collins English Dictionary
11 pages
Metafora Bahasa Aceh Pada Komentar Akun Instagram @tercyduck - Aceh
No ratings yet
Metafora Bahasa Aceh Pada Komentar Akun Instagram @tercyduck - Aceh
11 pages
Types of Implicature in Informal Conversations Used by The English Education Study Program Students
No ratings yet
Types of Implicature in Informal Conversations Used by The English Education Study Program Students
19 pages
Part I: Grammar and Structures: C) Prettier
No ratings yet
Part I: Grammar and Structures: C) Prettier
5 pages
Sounds and Their Variation
No ratings yet
Sounds and Their Variation
4 pages

Syllabus f18

Uploaded by

Syllabus f18

Uploaded by

Natural Language Processing: Syllabus

Alan W. Black & David R. Mortensen

Instructors: Prof. Alan W Black ([email protected]) and David R. Mortensen (dmortens@

Project (30%) a semester-long 4-person team project (see below).

Homework assignments (20%) 7 pencil-and-paper or small programming problems given roughly

Quizzes (10%) 10 Canvas quizzes given at the beginning of many lectures1 .

3. Human-generated and reasonable automatically-generated questions will be provided as input

• Course overview; what does it mean to “know language”? [1]

(Part 1 is “shallow” NLP.)

• Words, morphology, and lexicons‡ [3.1, 3.9]

• Discussion of the project

• Probability and language models [4.0–2]

• Language model evaluation and smoothing [4.3–8]

• Word categories and parts of speech [5.0–3]

• Hidden Markov models and part-of-speech tagging [6.0–4]

• Syntactic representations of natural language‡ [12.0–3]

• Parsing algorithms [13]

• Treebanks and parsing evaluation [12.4, 14.7]

• Probabilistic context-free grammars and statistical parsing [14.0–4]

• Word Embeddings and Dense Word Vectors

• Beyond context-free parsing‡

• Lexical semantics‡ [19.0–3]

• Semantic disamiguation problems: word-sense and coreference [20.0-2, 21.3, 21.7]

• Semantic role labeling [20.9]

• Compositional semantics‡ [17.2–3, 18.0–3]

• Clustering and Expectation Maximization∗

• Machine translation† [25.0–1, 25.9]

(The classes closes by synthesizing and looking forward.)

• Current NLP research at CMU

• Wrap-up and discussion

You might also like