04 - Word Representations

The document discusses different approaches to representing word meanings in computers, including knowledge-based representations using concepts like hypernyms, and corpus-based representations using techniques like one-hot encoding, co-occurrence matrices, and learning low-dimensional dense word vectors directly from text. Corpus-based representations allow modeling word similarity and capturing syntactic and semantic relationships.

Uploaded by

CUHN-FEI JAMES TAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views13 pages

04 - Word Representations

Uploaded by

CUHN-FEI JAMES TAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Applied Deep Learning

Word Representations
March 17th, 2020 http://adl.miulab.tw
2 Meaning Representations
◉ Definition of “Meaning”
o the idea that is represented by a word, phrase, etc.
o the idea that a person wants to express by using words, signs, etc.
o the idea that is expressed in a work of writing, art, etc.
3 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

4 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

5 Knowledge-Based Representation
◉ Hypernyms (is-a) relationships of WordNet

Issues:
▪ newly-invented words
▪ subjective
▪ annotation effort
▪ difficult to compute word similarity
6 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

7 Corpus-Based Representation
◉ Atomic symbols: one-hot representation

car [0 0 0 0 0 0 1 0 0 … 0]

car

Issues: difficult to compute the similarity (i.e. comparing “car” and “motorcycle”)
[0 0 0 0 0 0 1 0 0 … 0] AND [0 0 1 0 0 0 0 0 0 … 0] = 0
car motorcycle

Idea: words with similar meanings often have similar neighbors

8 Corpus-Based Representation
◉ Neighbor-based representation
o Co-occurrence matrix constructed via neighbors
o Neighbor definition: full document v.s. windows

full document
word-document co-occurrence matrix gives general topics
→ “Latent Semantic Analysis”

windows
context window for each word
→ capture syntactic (e.g. POS) and sematic information
9 Window-Based Co-occurrence Matrix
similarity > 0
◉ Example
Counts I love enjoy AI deep learning
o Window length=1 I 0 2 1 0 0 0
o Left or right context love 2 0 0 1 1 0
o Corpus: I love AI. enjoy 1 0 0 0 0 1
I love deep learning. AI 0 1 0 0 0 0
I enjoy learning. deep 0 1 0 0 0 1
learning 0 0 1 0 1 0

Issues:
▪ matrix size increases with vocabulary
Idea: low dimensional word vector
▪ high dimensional
▪ sparsity → poor robustness
10 Low-Dimensional Dense Word Vector
◉ Method 1: dimension reduction on the matrix
◉ Singular Value Decomposition (SVD) of co-occurrence matrix X

approximate
11 Low-Dimensional Dense Word Vector
◉ Method 1: dimension reduction on the matrix
◉ Singular Value Decomposition (SVD) of co-occurrence matrix X

Issues:
▪ computationally expensive:
O(mn2) when n<m for nxm matrix
▪ difficult to add new words

Idea: directly learn low-

dimensional word vectors

semantic relations syntactic relations

Rohde et al., “An Improved Model of Semantic Similarity Based on Lexical Co-Occurrence,” 2005.
12 Low-Dimensional Dense Word Vector
◉ Method 2: directly learn low-dimensional word vectors
○ Learning representations by back-propagation. (Rumelhart et al., 1986)
○ A neural probabilistic language model (Bengio et al., 2003)
○ NLP (almost) from Scratch (Collobert & Weston, 2008)
○ Recent and most popular models: word2vec (Mikolov et al. 2013) and Glove
(Pennington et al., 2014)
• As known as “Word Embeddings”
13 Summary
◉ Knowledge-based representation
◉ Corpus-based representation
✓ Atomic symbol
✓ Neighbors
o High-dimensional sparse word vector
o Low-dimensional dense word vector
▪ Method 1 – dimension reduction
▪ Method 2 – direct learning

Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
57 pages
Word Embeddings
No ratings yet
Word Embeddings
55 pages
Lecture1 Word Embeddings
No ratings yet
Lecture1 Word Embeddings
99 pages
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
No ratings yet
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
20 pages
Unit 4 Functions and Equations Assessment C D 2023
100% (1)
Unit 4 Functions and Equations Assessment C D 2023
11 pages
Word Embedding
No ratings yet
Word Embedding
35 pages
Ling571 Class14 Distr Thes
No ratings yet
Ling571 Class14 Distr Thes
122 pages
2018 - Word Embedding - Word2Vec - 1 (Choi) (11 Slides)
100% (1)
2018 - Word Embedding - Word2Vec - 1 (Choi) (11 Slides)
11 pages
Constructing and Evaluating Word Embeddings
No ratings yet
Constructing and Evaluating Word Embeddings
33 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
Farlin Case Assignment 2 Final Draft
No ratings yet
Farlin Case Assignment 2 Final Draft
9 pages
Lecture 2
No ratings yet
Lecture 2
80 pages
NLP Course Lecture03 Huawei Noahs Ark Lab
No ratings yet
NLP Course Lecture03 Huawei Noahs Ark Lab
94 pages
cs224n 2025 Lecture02 Wordvecs2
No ratings yet
cs224n 2025 Lecture02 Wordvecs2
46 pages
Lecture 10
No ratings yet
Lecture 10
86 pages
XCS224N Module1 Slides
No ratings yet
XCS224N Module1 Slides
72 pages
Week 2 and 3
No ratings yet
Week 2 and 3
76 pages
Master Thesis
No ratings yet
Master Thesis
74 pages
AI - Unit 1
No ratings yet
AI - Unit 1
66 pages
From Linear Regression To Logistic Regression - Update - 1
No ratings yet
From Linear Regression To Logistic Regression - Update - 1
71 pages
08 Word Embeddings (2021)
No ratings yet
08 Word Embeddings (2021)
58 pages
3 WordMeaning
No ratings yet
3 WordMeaning
78 pages
ML4D-L6 nlp2
No ratings yet
ML4D-L6 nlp2
58 pages
Word Embeddings
No ratings yet
Word Embeddings
59 pages
Neural Models For NLP
No ratings yet
Neural Models For NLP
67 pages
21 Word2Vec 24 09 2024
No ratings yet
21 Word2Vec 24 09 2024
63 pages
COMP5046: Natural Language Processing
No ratings yet
COMP5046: Natural Language Processing
71 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
NLP DL Lecture2
No ratings yet
NLP DL Lecture2
54 pages
AIML Report FL
No ratings yet
AIML Report FL
19 pages
Unit - 3 Distributional Semantics and Word Embedding
No ratings yet
Unit - 3 Distributional Semantics and Word Embedding
69 pages
NLP Basic - YL
No ratings yet
NLP Basic - YL
16 pages
Wordembed v2.0
No ratings yet
Wordembed v2.0
46 pages
Neural Decoding
No ratings yet
Neural Decoding
34 pages
Vector Semantics and Embeddings
No ratings yet
Vector Semantics and Embeddings
29 pages
4 Word Representation
No ratings yet
4 Word Representation
41 pages
Ds Notes V Unit C++
No ratings yet
Ds Notes V Unit C++
20 pages
A Survey of Word Embeddings Based On Deep Learning: Shirui Wang Wenan Zhou Chao Jiang
No ratings yet
A Survey of Word Embeddings Based On Deep Learning: Shirui Wang Wenan Zhou Chao Jiang
24 pages
Can Computers Understand Words Like Human Do
No ratings yet
Can Computers Understand Words Like Human Do
28 pages
Chem 111 Course Outline
No ratings yet
Chem 111 Course Outline
2 pages
Cs 224 N
No ratings yet
Cs 224 N
128 pages
WordRepresentation
No ratings yet
WordRepresentation
26 pages
Christopher Manning Lecture 2: Word Vectors, Word Senses, and Neural Classifiers
No ratings yet
Christopher Manning Lecture 2: Word Vectors, Word Senses, and Neural Classifiers
57 pages
Week11 WordEmbedding
No ratings yet
Week11 WordEmbedding
20 pages
Advanced Cogntive Science
No ratings yet
Advanced Cogntive Science
15 pages
Chapitre 1
No ratings yet
Chapitre 1
22 pages
Xsense: Learning Sense-Separated Sparse Representations and Textual Definitions For Explainable Word Sense Networks
No ratings yet
Xsense: Learning Sense-Separated Sparse Representations and Textual Definitions For Explainable Word Sense Networks
8 pages
Word Embedding
No ratings yet
Word Embedding
9 pages
Maths Assignment 2
No ratings yet
Maths Assignment 2
2 pages
Language Analysis - Sociolinguistics of Word Embeddings - PREPRINT - 8.8.2020
No ratings yet
Language Analysis - Sociolinguistics of Word Embeddings - PREPRINT - 8.8.2020
17 pages
Stock Price Sam23
No ratings yet
Stock Price Sam23
38 pages
How Word Vectors Capture The Meaning Behind Words Mathematically
No ratings yet
How Word Vectors Capture The Meaning Behind Words Mathematically
4 pages
Christopher Manning Lecture 1: Introduction and Word Vectors
No ratings yet
Christopher Manning Lecture 1: Introduction and Word Vectors
42 pages
Lecture 18
No ratings yet
Lecture 18
7 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
File 29
No ratings yet
File 29
9 pages
Sheet 3
No ratings yet
Sheet 3
5 pages
Learning Representations That Convey Semantic and Syntactic Information
No ratings yet
Learning Representations That Convey Semantic and Syntactic Information
14 pages
AIML Engineer Roadmap
No ratings yet
AIML Engineer Roadmap
4 pages
A22 Sayson Ce50p 2 La2 Excel Solution
No ratings yet
A22 Sayson Ce50p 2 La2 Excel Solution
6 pages
Chapter Transformers
No ratings yet
Chapter Transformers
8 pages
Tutorial 1 Solutions
No ratings yet
Tutorial 1 Solutions
3 pages
Chapter 3 After Modfiy
No ratings yet
Chapter 3 After Modfiy
4 pages
Advanced Matrix Operations: 6.1 Opening Remarks
No ratings yet
Advanced Matrix Operations: 6.1 Opening Remarks
22 pages
Vector Based Models
No ratings yet
Vector Based Models
41 pages
Neural Network
No ratings yet
Neural Network
23 pages
Earley Parsing PDF
No ratings yet
Earley Parsing PDF
27 pages
CS224d Deep Learning For Natural Language Processing Lecture 2: Word Vectors
No ratings yet
CS224d Deep Learning For Natural Language Processing Lecture 2: Word Vectors
40 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
33 pages
A Brief Review On Artificial Neural Network Network Structures and Applications
No ratings yet
A Brief Review On Artificial Neural Network Network Structures and Applications
6 pages
Lecture 2a - Word Level Semantics
No ratings yet
Lecture 2a - Word Level Semantics
34 pages
Chapter 7 PDF
No ratings yet
Chapter 7 PDF
2 pages
Government Polytechnic Gandhinagar Gujarat
No ratings yet
Government Polytechnic Gandhinagar Gujarat
3 pages
Cse330:Competitive Coding Approaches-Techniques: Course Outcomes
No ratings yet
Cse330:Competitive Coding Approaches-Techniques: Course Outcomes
2 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part I Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part I Spring 2016
10 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
5 pages
Workshop Schedule: 9:00 - 10:00 Am Director, Director (R&D), Dean (A), Dean (SW), All Hods, Principal (Pharmacy)
No ratings yet
Workshop Schedule: 9:00 - 10:00 Am Director, Director (R&D), Dean (A), Dean (SW), All Hods, Principal (Pharmacy)
1 page
CS224n: Natural Language Processing With Deep Learning
No ratings yet
CS224n: Natural Language Processing With Deep Learning
14 pages
Frequencies: Frequencies Variables Age Strand Track Gradelevel Gender /order Analysis
No ratings yet
Frequencies: Frequencies Variables Age Strand Track Gradelevel Gender /order Analysis
3 pages
Simplex Method
No ratings yet
Simplex Method
15 pages
Allen E. Everett - Warp Drive and Causality
No ratings yet
Allen E. Everett - Warp Drive and Causality
4 pages
SPH Modeling Using LS Dyna
No ratings yet
SPH Modeling Using LS Dyna
6 pages
Manipal Institute of Technology: Course Plan
No ratings yet
Manipal Institute of Technology: Course Plan
3 pages
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
From Everand
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
Subhomoy Haldar
No ratings yet
C++ Programming: A Complete Guide for Beginners to Master C++ and Build Robust Programs
From Everand
C++ Programming: A Complete Guide for Beginners to Master C++ and Build Robust Programs
Lena Neill
No ratings yet
Girl Code Revolution: Profiles and Projects to Inspire Coders
From Everand
Girl Code Revolution: Profiles and Projects to Inspire Coders
Sheela Preuitt
5/5 (2)
Computer Programming: A Simplified Entry to Python, Java, and C++ Programming for Beginners
From Everand
Computer Programming: A Simplified Entry to Python, Java, and C++ Programming for Beginners
Lena Neill
No ratings yet
Comments on Cheong Lee's Essay (2018) "Peirce's Theory of Interpretation": Re-Articulations, #8
From Everand
Comments on Cheong Lee's Essay (2018) "Peirce's Theory of Interpretation": Re-Articulations, #8
Razie Mah
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)

04 - Word Representations

Uploaded by

04 - Word Representations

Uploaded by

Applied Deep Learning

Knowledge-Based Representation Corpus-Based Representation

Knowledge-Based Representation Corpus-Based Representation

Knowledge-Based Representation Corpus-Based Representation

Idea: words with similar meanings often have similar neighbors

Idea: directly learn low-

semantic relations syntactic relations

You might also like