0% found this document useful (0 votes)
67 views13 pages

01 Introduction

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views13 pages

01 Introduction

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

CSE440: Natural Language

Processing II
Dr. Farig Sadeque
Assistant Professor
Department of Computer Science and Engineering
BRAC University
Lecture 1: Introduction
Class Expectation

What can we expect?

- Some linguistics knowledge


- A whole lot of algorithms. This is an algorithms course
- A bunch of programming

What this course is not:

- This is not a linguistics course. We will learn whatever linguistics we require


during the course
- This is not a machine learning/neural network course. There will be a refresher
within the first couple of weeks, but for deeper understanding, please take the
respective courses.
Class Expectations
- Several semesters of programming
- Primary language: Python
- Linguistics experience helpful, not required
- Mathematical experience helpful, not required
- As long as you can understand some maths notation, you will do well
Course Structure
- Attendance: 5%
- Assignments/project: 20%
- Quiz (best 3 out of 4): 20%
- Midterm: 25%
- Final 30%
Course Contents
- Books, lectures, assignments etc.
- https://drive.google.com/drive/folders/1KXTLF4oq_4otsFU-s1BowvlmCchuzgq2?u
sp=sharing
Consultation
- You can visit me during consultation hours
- You can also book a timeslot– online or offline
For in-office consultation For online consultation
Course Plan
Linguistics essentials Sequence tagging
● Sentence segmentation ● Sequence tagging basics
● Tokenization ● Markov Models
● Lemmatization/Stemming ● Deep Learning Architectures: Recurrent Neural Network
● Parts-of-Speech tagging ● Transfer Learning with Pretrained Language Models
● Named Entity Recognition Parsing
● Parsing ● Parsing Basics
● Coreference Resolution ● Constituency Grammar
Machine Learning Essentials Review ● Constituency Parsing
● Probability Review ● Dependency Parsing
● Naive Bayes, Logistic regression Translation
● Splits, metrics, statistical significance Coreference
● Essential ML maths refresher Text Generation: Encoder-Decoder Algorithm
Text Categorization and Representations Question Answering
● Representation basics
● Word embeddings
● Contextual embeddings
● Text Categorization Algorithms
Why is NLP Hard?
Ambiguity
- Phonetics: I scream? Ice cream?
- Morphology: Union-ized? Un-ionized?
- Syntax: Squad helps dog bite victim.
- Squad helps a dog to bite a victim?
- Squad helps a dog-bite victim?
- Semantics: Ball: an orb, or a dance?
- “High-end” nonsense: Colorful green ideas sleep furiously.
- Discourse: see that photo again
Variability
He bought it
- He purchased it
- He acquired it
- It was bought by him (and all other synonyms with passive voice)
- It was sold to him
- ……….
Language Change
Language Change
- English beats up other languages in dark alleys, then rifles through their
pockets for loose grammar and spare vocabulary
- Example: We eat beef, but we raise cows.
- Fun video: https://www.youtube.com/watch?v=Jl3K63Rbygw

You might also like