0% found this document useful (0 votes)

21 views60 pages

Ca20 Part02 NLP

This document covers the basics of Natural Language Processing (NLP) and its application in computational argumentation, focusing on concepts from linguistics, statistics, and machine learning. It outlines the stages of NLP, including argument mining, assessment, and generation, while discussing the evolution of NLP technologies and the challenges of language understanding. Additionally, it highlights the importance of empirical methods and evaluation criteria in developing and assessing NLP algorithms.

Uploaded by

sahar kamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views60 pages

Ca20 Part02 NLP

Uploaded by

sahar kamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Computational Argumentation – Part II

Basics of Natural Language Processing

Henning Wachsmuth
[email protected]
Learning goals

https://commons.wikimedia.org
§ Concepts
• Basics from linguistics, statistics, and machine learning

§ Methods
• How to develop and evaluate data-driven algorithms

https://pixabay.com
• Standard techniques used in machine learning
• Types of analyses used in computational linguistics

§ Associated research fields

https://pixabay.com
• Computational linguistics

§ Within this course

• Concepts and methods this course builds upon

§ Disclaimer
• The basics selected here are all but complete and only revisited high-level.
For a more comprehensive overview, see e.g. the slides of my bachelor‘s course ”Introduction to Text Mining“.

Basics of Natural Language Processing, Henning Wachsmuth 2

Outline
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 3

Natural language processing (recap)
§ Natural language processing (NLP) (Tsujii, 2011)
• Algorithms for understanding and generating speech
Analysis
and human-readable text Synthesis
• From natural language to structured information, and vice versa

§ Computational linguistics (see http://www.aclweb.org)

https://de.wikipedia.org
• Intersection of computer science and linguistics

https://pixabay.com
• Technologies for natural language processing
• Models to explain linguistic phenomena,
based on knowledge and statistics

§ Main NLP stages in computational argumentation

• Mining arguments and their relations from text
• Assessing properties of arguments and argumentation
• Generating arguments and argumentative text
In most applications, not all stages/tasks are needed.

Basics of Natural Language Processing, Henning Wachsmuth 4

Evolution of natural language processing (NLP)
§ Selected milestones from industry

• February 2011. IBM’s Watson wins Jeopardy

https://www.youtube.com/watch?v=P18EdAKuC1U

• October 2011. Apple‘s Siri starts on the iPhone

https://www.youtube.com/watch?v=gUdVie_bRQo

• August 2014. Microsoft Skype translates conversations in real time

https://www.youtube.com/watch?v=RuAp92wW9bg

• May 2018. Google Assistant does phone call appointments

https://www.youtube.com/watch?v=pKVppdt_-B4

• June 2018. IBM Debater competes in classical debates

https://www.youtube.com/watch?v=UeF_N1r91RQ

§ Observations
• All applications need to ”understand“ language à linguistics needed
• None of these applications works perfectly à empirical methods needed

Basics of Natural Language Processing, Henning Wachsmuth 5

Next section: Linguistics
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 6

What is linguistics?
§ Linguistics
• The study of spoken and written natural language in terms of the analysis of
form, meaning, and context.

§ Levels of spoken language only

• Phonetics. The physical aspects of speech sounds.
• Phonology. The linguistic sounds of a particular language.

§ Levels of spoken and written language

• Morphology. The senseful components of words and wordforms.
• Syntax. The structural relationships between words, usually within a sentence
(or a similar utterance).
• Semantics. The meaning of single words and compositions of words.
• Discourse. Linguistic units larger than a single sentence, such as paragraphs
or complete documents.
• Pragmatics. How language is used to accomplish goals.

Basics of Natural Language Processing, Henning Wachsmuth 7

Levels of language analysis

DISCOURSE is on the boundary

between semantics and pragmatics.

https://en.wikipedia.org
Basics of Natural Language Processing, Henning Wachsmuth 8
Linguistic text units

Phonemes ð ə m ə n s a ɪ d ɪ t s r e ɪ n ɪ ŋ k æ t s æ n d d ɑ g z h i f ɛ l t

Morphemes The man sigh ed It s rain ing cat s and dog s he felt

Tokens The man sighed . It 's raining cats and dogs , he felt .
POS tags DT NN VBD . PRP VBZ VBG NNS CC NNS , PRP VBD .

Phrases The man sighed . It 's raining cats and dogs , he felt .
phrase types NP VP . NP VP NP , NP VP

Clauses The man sighed. It's raining cats and dogs, he felt.

Sentences The man sighed. It's raining cats and dogs, he felt.

Paragraphs The man sighed. It's raining cats and dogs, he felt.

Basics of Natural Language Processing, Henning Wachsmuth 9

Main morphological concepts
§ Word
• The smallest unit of language that is to be uttered in isolation.
Example: ”cats“ and ”ran“ in ”cats ran.“

§ Lemma
• The dictionary form of a word.
Example: ”cat“ for ”cats“, ”run“ for ”ran“

§ Wordform
• The fully inflected surface form of a lemma as it appears in a text.
Example: ”cats“ for ”cats“, ”ran“ for ”ran“

§ Stem
• The part of a word(form) that never changes.
Example: ”cat“ for ”cats“, ”ran“ for ”ran“

§ Token
• The smallest text unit in NLP: A wordform, number, symbol, or similar.
Example: ”cats“, ”ran“, and ”.“ in ”cats ran.“ (whitespaces are usually not considered as tokens)

Basics of Natural Language Processing, Henning Wachsmuth 10

Main syntactic concepts
§ Part-of-speech (POS)
• The lexical category (or word class) of a word.
• Abstract classes. Nouns, verbs, adjectives, adverbs, prepositions, ...
• POS tags. NN (single nouns), NNS (plural nouns), NNP (proper nouns), ...

§ Phrases
• A contiguous sequence of related words, functioning as a single meaning unit.
• Phrases often contain nested phrases.
• Types. Noun phrase (NP), verb phrase (VP), prepositional phrase (PP).
Sometimes also adjectival phrase (AP) and adverbial phrase (AdvP).

§ Clause
• The smallest grammatical unit that can express a complete proposition.
• Types. Main clause and subordinate clause.

§ Sentence
• A grammatically independent linguistic unit consisting of one or more words.
Basics of Natural Language Processing, Henning Wachsmuth 11
Main semantic concepts
§ Lexical semantics
• The meaning of words and multi-word expressions.
Different senses of a word, the roles of predicate arguments, ...

§ Compositional semantics
• The meaning of the composition of words in phrases, sentences, and similar.
Relations, scopes of operators, and much more.

§ Entities
• An object from the real world.
• Named entities. Persons, locations, organizations, products, ...
For example, ”Jun.-Prof. Dr. Henning Wachsmuth”, “Paderborn”, “Paderborn University”

• Numeric entities. Values, quantities, ranges, periods, dates, ...

For example, “in this year”, “2018-10-18”, “$ 100 000”, “60-68 44”

§ Relations
• Semantic. Relations between entities, e.g., organization founded in period.
• Temporal. Relations describing courses of events, e.g., as in news reports.

Basics of Natural Language Processing, Henning Wachsmuth 12

Main discourse and pragmatics concepts
§ Discourse (structure)
• Linguistic utterances larger than a sentence, e.g., paragraphs or entire texts.
Usually monological. Dialogical discourse often referred to as dialogue.

• Discourse segments. Building block of a discourse in terms of linguistic units.

• Coherence relations. Semantic or pragmatic relations between segments.

§ Coreference
• Two or more expressions in a text that refer to the same thing.
• Types. Pronouns in anaphora and cataphora, coreferring noun phrases, ...
Examples: ”Apple is based in Cupertino. The company is actually called Apple Inc., and they make hardware.“

§ Speech acts
• Linguistic utterances with a performative function. more details in
the lecture on basics
§ Communicative goals of argumentation
• Specific functions of passages within a discourse.
• Specific effects intended to be achieved by an untterance.

Basics of Natural Language Processing, Henning Wachsmuth 13

What makes language understanding hard?
§ Ambiguity
• The fundamental challenge of NLP is that language is ambiguous.

§ Ambiguity is pervasive
• Phonetic. ”wreck a nice beach”
• Word sense. ”I went to the bank”.
• Part of speech. ”I made her duck.”
• Attachment. ”I saw a kid with a telescope.”
• Coordination. ”If you love money problems show up.“
• Scope of quantifiers. ”I didn’t buy a car.”
• Speech act. “Have you emptied the dishwasher?”

§ Other challenges
• World knowledge. ”Trump must rethink capital punishment”

(Felbo et al., EMNLP 2017)

• Domain dependency. ”Read the book!”
• Language dependency. ”Bad”
... and many more
Basics of Natural Language Processing, Henning Wachsmuth 14
Is written language enough?
§ What‘s the purpose of this sentence?
• ”I never said she stole my money.“

§ Possible interpretations
• I never said she stole my money.
Someone else said it, but I didn’t.

• I never said she stole my money.

I simply didn’t ever say it.
• I never said she stole my money.
I might have implied it in some way. But I never explicitly said it.

• I never said she stole my money.

I said someone took it. But I didn’t say it was her.

• I never said she stole my money.

I just said she probably borrowed it.

• I never said she stole my money.

I said she stole someone else’s money.

• I never said she stole my money.

But not my money.
Basics of Natural Language Processing, Henning Wachsmuth 15
Next section: Empirical methods
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 16

Development and evaluation in NLP
§ Development and evaluation
• NLP algorithms are developed based on text corpora.
• The output of NLP algorithms is rarely free of errors, which is why it is usually
evaluated empirically in comparison to ground-truth annotations.

§ Evaluation criteria
• Effectiveness. The extent to which the output of an algorithm is correct.
• Efficiency. The consumption of time (or space) of an algorithm on an input.
• Robustness. The extent to which an algorithm remains effective (or efficient)
across different inputs, often in terms of textual domains.

§ Evaluation measures
• Quantify the quality of an algorithm on a specific task and text corpus.
• Algorithms can be ranked with respect to an evaluation measure.
• Different measures are useful depending on the task.

Basics of Natural Language Processing, Henning Wachsmuth 17

Annotated text corpora
§ Text corpus (and datasets)

https://pixabay.com
• A collection of real-world texts with known properties,
compiled to study a language problem.
• The texts are often annotated with meta-information.
• Corpora are usually split into datasets for developing (training) and/or
evaluating (testing) an algorithm.

§ Annotations Time entity Organization entity

“ 2014 ad revenues of Google are going to reach
• Marks a text or span of text as
Reference Time entity
representing meta-information $20B . The search company was founded in '98 .
of a specific type. Founded relation
Reference Time entity
• Also used to specify relations Its IPO followed in 2004 . [...] “
between different annotations. Topic: ”Google revenues“ Genre: ”News article“

§ Types of annotations
more details
• Ground-truth. Manual annotations, often created by experts. in the part on
• Automatic. NLP algorithms add annotations to texts. acquisition
Basics of Natural Language Processing, Henning Wachsmuth 18
Evaluation of effectiveness in classification tasks
§ Instances in classification tasks
• Positives. The output instances (annotations) an algorithm has created.
• Negatives. All other possible instances.
correct
§ Accuracy false
negatives
• Used if positives and negatives true
(FN)
are similarly important. positives
false (TP)
TP + TN positives
Accuracy = (FP) true
TP + TN + FP + FN
created negatives
(TN)
§ Precision, recall, and F1-score
• Used if positives are in the focus.
TP TP 2•P•R
Precision (P) = Recall (R) = F1-score =
TP + FP TP + FN P+R
• In multi-class tasks, micro- and macro-averaged values can be computed.

Basics of Natural Language Processing, Henning Wachsmuth 19

Evaluation of effectiveness in regression tasks
§ Instances in regression tasks
• In regression tasks, algorithms predict values yi from a real-valued scale.
• The numeric difference to the ground-truth values yi* is usually in the focus.

§ Mean absolute error (MAE)

• Used if outliers require no special treatment.
n
1 X
M AE = · |yi yi⇤ |
n i=1

§ Mean squared error (MSE)

• Used if outliers are considered particularly problematic.
n
1 X
M SE = · (yi yi⇤ )2
n i=1

§ Root mean squared error (RMSE)

• Just a different way of quantifying the squared error, RMSE = √ MSE

Basics of Natural Language Processing, Henning Wachsmuth 20

Dataset preparation
§ Dataset preparation
• Text corpora usually contain annotations for the task to be studied.
• Not always, these annotations match with the task instances required for
development and evaluation.

§ Creation of task instances

• Particularly, ”negative” instances often need to be created for learning.
Example: ”[Jaguar]ORG is named after the animal jaguar.”

• Also, annotations may have to be mapped to other task instances.

Example: Ratings 1–2 à ”negative”, 3 à ignore, 4–5 à ”positive”

§ Balancing of datasets
• A balanced distribution of target classes in the training set is often preferable.
• Undersampling. Removal of instances from majority classes.
• Oversampling. Addition of instances from minority classes.
• In machine learning, an alternative is to weight classes inverse to their size.

Basics of Natural Language Processing, Henning Wachsmuth 21

Training, validation, and test set
training set validation set test set

text
corpus ... ... ...

(e.g., 50% of the corpus) (e.g., 25%) (e.g., 25%)

§ Training set
• Known instances used to develop or statistically learn an algorithm.
• The training set may be analyzed manually and automatically.

§ Validation set (aka development set)

• Unknown test instances used to iteratively evaluate an algorithm.
• The algorithm is optimized towards and adapts to the validation set.

§ Test set (aka held-out set)

• Unknown test instances used for the final evaluation of an algorithm.
• The test set represents unseen data.
Basics of Natural Language Processing, Henning Wachsmuth 22
Cross-validation

fold 1 fold 2 fold i fold n

text
corpus ... ... ... ... ... ...

§ (Stratified) n-fold cross-validation

• Randomly split a corpus into n datasets of equal size, usually n = 10.
• The development and evaluation consist of n runs. The evaluation results are
averaged over all n runs.
• In the i-th run, the i-th fold is used for evaluation (testing). All other folds are
used for development (training).

§ Pros and cons of cross-validation

• Often preferred when data is small, as more data is given for training.
• Cross-validation avoids potential bias in a corpus split.
• Random splitting often makes the task easier, due to corpus bias.

Basics of Natural Language Processing, Henning Wachsmuth 23

Comparison
§ Need for comparison
• It is unclear how good a measured effectiveness result in a given task is.
• Comparison against lower (and upper) bounds is needed.

§ Baseline (lower bound)

• An alternative approach proposed before or can be developed easily.
• A new algorithm aims to be better than all baselines.
§ Types of baselines
• Trivial. An approach that can easily be derived from a given task or dataset.
• Standard. An approach that is often used for related tasks.
• Sub-approach. A sub-part of a new approach.
• State of the art. The best published approach for the addressed task.

§ Gold standard (upper bound)

• The best possible result in a given task, often what humans would achieve.
• Often equated with the ground-truth annotations in a corpus.
Basics of Natural Language Processing, Henning Wachsmuth 24
Empirical research and variables
§ Empirical methods
• Quantitative methods based on numbers and statistics.
• Study questions on behaviors and phenomena by analyzing data.
• Asks about the relationships between variables.

§ Variable
• An entity that can take on different numeric or non-numeric values.
• Independent. A variable X that is expected to affect another variable.
• Dependent. A variable Y that is expected to be effected by others.
• Other. Confounders, mediators, moderators, ...

§ Scales of variables
• Nominal. Values that represent discrete, separate categories.
• Ordinal. Values that can be ordered/ranked by what is better.
• Interval. Values whose difference can be measured.
• Ratio. Interval values that have an absolute zero.
Basics of Natural Language Processing, Henning Wachsmuth 25
Descriptive statistics
§ Descriptive statistics
• Measures for summarizing and comprehending distributions of values.
• Used to describe phenomena.

§ Measures of central tendency

• Mean. The arithmetic average of a sample from a distribution of values.
For (rather) symmetrical distributions of interval/ratio values.
• Median. The middle value of the ordered values in a sample.
For ordinal values and skewed interval/ratio distributions.

• Mode. The value with the greatest frequency in a sample.

For nominal values.

§ Measures of dispersion
• Range. The distance between minimum and maximum in a sample.
• Variance. The mean squared difference between each value and the mean.
• Standard deviation. The square root of the variance.

Basics of Natural Language Processing, Henning Wachsmuth 26

Inferential statistics
§ Inferential statistics
• Procedures that study hypotheses based on values.
• Used to make inferences about a distribution beyond a given sample.

§ Two competing hypothesis

“The accuracy of our
• Research hypothesis (H). Prediction about how some
approach is not
inpedendent variables will affect a dependent variable. higher with POS tags
• Null hypothesis (H0). Antithesis to H. than without.”

§ Hypothesis test (aka statistical significance test)

• A statistical procedure which determines the probability (p-value) that results
supporting H are due to chance (or sampling error).
• Significance given, if p is ≤ a significance level a (usually 0.05 or 0.01).

§ Steps in a hypothesis test

• State H and H0, choose a.
• Compute p-value with an adequate test. Decide whether H0 can be rejected.
Basics of Natural Language Processing, Henning Wachsmuth 27
Hypothesis tests
§ How to choose an adequate test?
• All tests require a random sample and independent values of variables.
• Parametric vs. non-parametric. Parametric tests make it easier to find
significance but do not always apply.

Parametric test Non-parametric correspondent

Independent t-test Mann-Whitney Test
Dependent and one-sample t-test Wilcoxon Signed-Rank Test
One way, between group ANOVA Kruskal-Wallis
One way, repeated measures ANOVA Friedman Test
Pearson Spearman, Kendall’s t , c2

§ Prerequisites of parametric tests

• The dependent variable needs to have an interval or ratio scale.
• The distributions needs to be normal.
• The compared distributions need to have the same variances.
Besides, different tests have different specific prerequisites.

Basics of Natural Language Processing, Henning Wachsmuth 28

Next section: Tasks and techniques
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 29

Common text analyses
§ Lexical and syntactic § Semantic and pragmatic
• Tokenization • Attribute extraction
• Sentence splitting • Numeric entity recognition
• Paragraph detection • Named entity recognition

• Stemming • Reference resolution

• Lemmatization • Entity relation extraction
• Part-of-speech tagging • Temporal relation extraction

• Similarity computation • Topic detection

• Spelling correction • Authorship attribution
• Phrase chunking • Sentiment analysis

• Dependency parsing • Discourse parsing

• Constituency parsing • Spam detection
... and some more ... and many many more

Basics of Natural Language Processing, Henning Wachsmuth 30

Example task: Information extraction
§ Information extraction
• The mining of entities, their attributes, relations between entities, and events
the entities participate in from natural language text.
• The output is structured information that can, e.g., be stored in databases.

§ Example task Time entity Organization entity

“ 2014 ad revenues of Google are going to reach
• Extraction of the founding dates
Reference Time entity
of companies
$20B . The search company was founded in '98 .
Reference Time entity Founded relation
Its IPO followed in 2004 . [...] “

Output: Founded(''Google'', 1998)

§ Typical text analysis steps
1. Lexical and syntactic preprocessing
2. Named and numeric entity recognition
3. Reference resolution
4. Entity relation extraction

Basics of Natural Language Processing, Henning Wachsmuth 31

Text analysis pipelines and alternatives
§ Text analysis pipeline
• The standard way to tackle an NLP task is with a pipeline that sequentially
applies a set of algorithms to the input texts.
• The output of one algorithm is the input to the next.

§ Example pipeline
• Extraction of the founding dates of companies

input Sentence Tokenization Part-of-spech Phrase

text splitting tagging chunking

Time entity Named entity Reference Founded rela- founded

recognition recognition resolution tion detection relations

§ Alternatives
• Joint model. Realizes multiple analysis steps at the same time.
• Neural network. Often works on the raw input text.

Basics of Natural Language Processing, Henning Wachsmuth 32

Dimensions of NLP tasks
§ Types of tasks
• Classification. Each input instance is assigned a predefined class label.
• Regression. Each input instance is assigned a numeric value.
• Clustering. A set of input instances is grouped into not-predefined classes.
... and some others

§ Types of approaches
• Supervised. Training instances with known output used in development.
• Unsupervised. No output labels/values used in development.
... and some others

§ Types of techniques
• Rule-based. Analysis based on manually encoded expert knowledge.
Knowledge includes rules, lexicons, grammars, ...

• Feature-based. Analysis based on statistical patterns in text features.

The text features used are encoded manually or semi-automatically.

• Neural. Analysis based on statistical patterns in self-learned functions.

Neural networks automatically learn and represent complex functions (often called deep learning).

Basics of Natural Language Processing, Henning Wachsmuth 33

Overview of rule-based and statistical techniques
§ Rule-based techniques
• (Hand-crafted) decision trees. Analyze text in a series of if-then-else rules.
• Lexicon matching. Match text spans with terms from a lexicon.
• Regular expressions. Extract text spans that follow sequential patterns.
• Probabilistic context-free grammars. Parse hierarchical structures of spans.
... among others

§ Statistical (machine learning) techniques

• Categorization. Assign a label to a text or span of text.
• Sequence labeling. Assign a label to each span in a sequence of spans.
• Scoring. Predict a score (or other numeric value) for a text or span of text.
• Clustering. Find possibly overlapping groups of similar texts.
... among others

§ Rules vs. statistics

• Rule-based techniques are often easier to control and explain.
• Statistical techniques are often more effective.
Basics of Natural Language Processing, Henning Wachsmuth 34
Next section: Rule-based NLP
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 35

NLP using decision trees
§ (Hand-crafted) Decision trees
• The representation of a series of if-then-else decision rules.
• Inner nodes are decision criteria, leafs the final outcomes in a task.
• Rules are composed using expert knowledge. next char is
Also, machine-learned decision trees exist. line break
true false

split after next char is not

§ Example: Sentence splitting current char closing bracket
true false
• Given a plain text.
current char is go to
• Check for sentence endings sentence delimiter next char
true false
character by character.
current char is not part of go to
• Split and proceed. number, abbreviation, or URL next char
true false

next char is not go to

quotation mark next char
true false

split after split after

current char next char

Basics of Natural Language Processing, Henning Wachsmuth 36

NLP using lexicons
§ Several types of lexicons
• Terms. Term lists, language lexicons, vocabularies
• + Definitions. Dictionaries, glossaries, thesauri
• + Structured information. Gazetters, frequency lists, confidence lexicons

§ Use cases of lexicons

• A given lexicon can be used to find all term occurrences in a text.
• The existence of a given term in a lexicon can be checked.
• The density or distribution of a vocabulary in a text can be measured.

§ Example: Attribute extraction Attribute Confidence

minibar 1.00
• Given a training set where attributes are annotated.
towels 0.97
• Compute confidence of each term, i.e., how often it is wi-fi 0.83
annotated as attribute. front desk 0.74
alcohol 0.5
• Consider terms with confidence above a certain threshold waiter 0.4
as attributes. buffet 0.21
people 0.01

Basics of Natural Language Processing, Henning Wachsmuth 37

NLP using regular expressions
§ Regular expression (regex)
• A representation of a regular grammar.
• Combines characters and meta-characters to generalize over language
structures.
• Used in NLP mainly to match text spans that follow clear sequential patterns.

§ Types of patterns in regexes

• Disjunctions. Alternative options, such as ([Ww]oodchuck|[Gg]roundhog).
• Negation+choice. Restrictions and arbitrary parts, such as [^A-Z] or 19...
• Repetitions. Parts that are optional and/or may appear multiple times, such as
woo(oo)?dchuck, woo(oo)*dchuck, or woo(oo)+dchuck.

§ Example
• (0?[1-9]|[10-31])\.(0?[1-9]|[10-12])\.(19|20)[0-9][0-9]
matches German dates, such as 8.5.1945 or 30.04.2020.

Basics of Natural Language Processing, Henning Wachsmuth 38

NLP using probabilistic context-free grammars
§ Probabilistic context-free grammar (PCFG) Rule Probability
S à NP VP 1.0
• A CFG where each rule is assigned a probability. VP à V NP 0.6
• Used in NLP mainly to parse sentence structure. VP à V NP PP 0.4
... …
• The goal is to find the most likely parse tree. V à fish 0.6
V à tanks 0.3
§ Example: Constituency parsing
• Use dynamic programming to iteratively compute the most likely parse tree.
Parse Most likely
triangle parse tree S

(1,3)

(1,2) (2,3) (3,4)

NP VP

NP
(1,1) (2,2) (3,3) (4,4)
N N V N

fish people fish tanks fish people fish tanks

1 2 3 4
Basics of Natural Language Processing, Henning Wachsmuth 39
Next section: Statistical NLP
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 40

Machine learning in NLP
§ Machine learning
• The ability of an algorithm to learn without being explicitly programmed.
• An algorithm learns from experience wrt. a task and a performance measure,
if its performance on the task increases with the experience.
• Aims at tasks where a target function g that maps input to output is unknown.
• A model y is learned that approximates g.

§ Typical output in NLP

• Text labels, such as topic, genre, and sentiment.
• Span annotations, such as tokens and entities.
• Span classifications, such as part-of-speech tags and entity types.
• Relations between annotations, such as entity relations.

§ Two-way relationship
• The output information of NLP serves as the input to machine learning.
• Many NLP algorithms rely on machine learning to produce output information.

Basics of Natural Language Processing, Henning Wachsmuth 41

Data mining
§ Data mining vs. machine learning
• Data mining puts the output into the view, machine learning the method.

( , , ..., )
input output

...
data ... information
data mining ( , , ..., )

representation generalization

machine learning
instances patterns

§ Text mining: NLP for data mining purposes

• Input data. A text corpus, i.e., a collection of texts to be processed.
• Output information. Annotations of the texts.
Basics of Natural Language Processing, Henning Wachsmuth 42
Representation
§ Feature
• A feature x denotes any measurable property of an input.
Example: The relative frequency of a particular word in a text.

§ Feature value
• The value of a feature of a given input, usually real-valued and normalized.
Example: The feature representing ”is“ would have the value 0.5 for the sentence ”is is a word“.

§ Feature type
• A set of features that conceptually belong together.
Example: The relative frequency of each known word in a text (this is often called ”bag-of-words“).

§ Feature vector
• A vector x(i) = (x1(i), ..., xm(i)) where each xj(i) is the value of one feature xj.
Example: For two feature types with k and l features respectively, x(i) would contain m = k+l values.

§ Feature-based vs. neural representations

• In feature-based learning, each instance is represented as a feature vector.
• In neural learning, features are not represented explicitly anymore.
Basics of Natural Language Processing, Henning Wachsmuth 43
Feature determination and computation
§ How to determine the set of features in a vector
1. Specify (using expert knowledge) what feature types to consider.
(a) token 1-grams (“bag-of-words”)
(b) text length in # tokens and # sentences

2. Where needed, process training set to get counts of candidate features.

(a) ”the” à 4242, ”a” à 2424, . . . , ”engineeeering” à 1
(b) not needed

3. Keep only features whose counts lie within some defined thresholds.
(a) “the”, “a”, . . . , “engineeeering”

§ How to compute the values for each feature

1. Compute value of each feature in a vector for a given input text.
(a) ”the” à 6, ”a” à 7, …
(b) # tokens à 50, # sentences à 10

2. Normalize feature values.

(a) ”the” à 0.12, ”a” à 0.14, …
(b) # tokens à 0.42, # sentences à 0.5

Basics of Natural Language Processing, Henning Wachsmuth 44

Machine learning
§ Machine learning process
• A learning algorithm explores
several candidate models y.

Ng (2018)
• Each y assigns one weight wj
to each feature xj.
• y is evaluated on training data
against a cost function L.
• Based on the result, the weights are adapted to obtain the next model.
• The adaptation relies on an optimization procedure.

§ Common optimization procedures

• Batch gradient descent. In each step, y is adapted to all training instances.
• Stochastic gradient descent. Adapts y iteratively to each single instance.

§ Hyperparameters
• Many learning algorithms have parameters that are not optimized in training.
• They need to be optimized against a validation set.
Basics of Natural Language Processing, Henning Wachsmuth 45
Generalization
Optimal fitting?
§ Fitting X2

• To generalize well, y should approximate the complexity

of the unknown function g based on the training data.

§ Underfitting (too high bias)

X1
• The model generalizes too much, not capturing certain Underfitting
relevant properties. X2

§ Overfitting (too high variance)

• The model captures too many irrelevant properties
of the input data.
X1
Overfitting
§ Regularization X2

• To avoid overfitting, the use of complex functions can

be penalized.
• A term is added to the cost function that forces feature
weights to be small.
X1
Basics of Natural Language Processing, Henning Wachsmuth 46
Supervised learning
§ Supervised (machine) learning
• A learning algorithm derives a model y from known training data, i.e., pairs of
instances x(i) and the associated output information y(i).
• y can then predict output information for unknown data.
classification
X2
§ Classification decision
boundary

• Assign an instance to the most likely class of a

set of predefined classes.

ua les
unknown

s
sq irc
re
training instance

c
instances
• A decision boundary y is learned that decides training
instances
the class of unknown instances.
X1
regression
§ Regression C
training
• Assign an instance to the most likely value of a instances
regression
continuous target variable. model

• A regression function y is learned that decides

the value of unknown instances.
unknown
instance
X1

Basics of Natural Language Processing, Henning Wachsmuth 47

Classification and regression algorithms
§ Selected classification algorithms
• Naïve Bayes. Predicts classes based on conditional probabilities.
• Support vector machine. Maximizes the margin between classes.
• Decision tree. Sequentially compares instances on single features.
• Random forest. Majority voting based on several decision trees.
• Neural network. Learns complex functions on feature combinations.
... among many others

§ Selected regression algorithms

• Linear regression. Predict output values using a learned linear function.
• Support vector regression. Maximize the flatness of a regression model.
• Neural network. As above
... among many others

§ Ensemble methods
• Meta-algorithms that combine multiple classifiers/regressors.

Basics of Natural Language Processing, Henning Wachsmuth 48

Unsupervised learning
§ Unsupervised (machine) learning
• A model y is derived from instances without output information.
flat hard
• The model reveals the organization and association of data.

§ Clustering
• The grouping of a set of instances into a possibly but not
necessarily predefined number of classes.
• The meaning of a class is usually unknown in advance.

§ Hard vs. soft clusters

• Hard. Each instance belongs to a single cluster.
• Soft. Instances belong to each cluster with a certain weight.
hierarchical
§ Flat vs. hierarchical clustering
• Flat. Group instances into a set of independent clusters.
• Hierarchical. Create a binary clustering tree over all instances.

Basics of Natural Language Processing, Henning Wachsmuth 49

Clustering algorithms
§ Selected flat hard clustering algorithms
• k-means. Iteratively create k instance clusters based on distance to centroids.
• DBSCAN. Cluster instances into regions of similar density.

§ Selected flat soft clustering algorithms

• Fuzzy k-means. Variation of k-means where clusters may overlap.
• LDA (topic modeling). Represent clusters by their most common features.

§ Selected hierarchical clustering algorithms

• Agglomerative. Incrementally merge closest clusters, starting from instances.
• MinCut. Split clusters based on their minimum cut, starting from one cluster.

§ Methods to find the best number of clusters

• Elbow criterion. Find k that maximizes cost reduction.

cost
• Silhouette analysis. Find k that maximizes distances
between clusters (and balances their size). 1 best k number of clusters |X|

Basics of Natural Language Processing, Henning Wachsmuth 50

Similarity measures
§ Similarity measure
• A real-valued function that quantifies how similar two instances of the same
concept are (between 0 and 1).
• Distance measures can be used as (inverse) similarity measures.

§ Selected use cases in NLP

• Clustering
• Spelling correction
• Retrieval of relevant web pages or related documents
• Paraphrase, (near-) duplicate, or plagiarism detection

§ Text similarity measures

• Vector-based measures. Mainly, for similarities between feature vectors.
• Edit distance. For spelling similarities.
• Thesaurus methods. For synonymy-related similarities.
• Distributional similarity. For similarities in the contextual usage.

Basics of Natural Language Processing, Henning Wachsmuth 51

Vector-based similarity (and distance) measures
Cosine similarity
X2
§ Cosine similarity (aka cosine score)
(1) (2)
Pm (1) (2)
x ·x i=1 xi ·x
cosine(x(1) , x(2) ) = = qP q i
Pm
||x(1) || · ||x(2) || m (1)2 (2)2
i=1 xi · i=1 xi X1
Jaccard similarity
§ Jaccard similarity coefficient (aka Jaccard index)
(1) (2) |x(1) \ x(2) | |x(1) \ x(2) | :
jaccard(x ,x ) = (1) (2)
= (1)
|x [ x | |x | + |x(2) | |x(1) \ x(2) |

§ Euclidean distance v X2
Euclidean distance

um
uX (1)
euclidean(x , x ) = t
(1) (2) (2)
|x i xi | 2
i=1
X1
§ Manhattan distance (aka city block distance) X2
Manhattan distance

m
X (1) (2)
manhattan(x(1) , x(2) ) = |xi xi |
i=1

Basics of Natural Language Processing, Henning Wachsmuth 52

Other learning types and variations
§ Sequence labeling
• Classifies each instance in a sequence of instances, exploiting information
about dependencies between instances.

§ Semi-supervised learning
• Derive patterns from little training data, then find similar patterns in
unannotated data to get more training data.

§ Reinforcement learning
• Learn, adapt, or optimize a behavior in order to maximize some benefit,
based on feedback provided by the environment.

§ Recommender systems
• Predict missing values of entities based on values of similar entities.

§ One-class classification and outlier detection

• Learn to classify, having only a representative sample of one class.

Basics of Natural Language Processing, Henning Wachsmuth 53

Development and evaluation of a learning approach
§ Machine learning in NLP
• Machine learning serves as a technique to approach a given task.
• A suitable learning algorithm from a library is chosen and applied.

Corpus Text Feature Machine

Task acquisition analysis learning Approach
engineering

§ Process steps
• Corpus acquisition. Acquire a corpus (and datasets) suitable to study the task.
• Text analysis. Preprocess all instances with existing NLP algorithms, in order
to obtain information that can be used in features.
• Feature engineering. Identify helpful features on training set, compute feature
vectors for each instance on all datasets.
• Machine learning. Train algorithm on training set and evaluate on validation
set, optimize hyperparameters. Finally, evaluate on test set.

Basics of Natural Language Processing, Henning Wachsmuth 54

Domain dependency
§ Domain
• A set of texts that share certain properties.
• Can refer to a topic, genre, style, or similar — or combinations.
• Texts from the same domain often have a similar feature distribution.

§ Domain dependency
• Many algorithm work better in the domain of training texts than in others.
X2 X2
domain A trained domain B applied
classifier classifier
domain
transfer

X1 X1

• The same feature values result in different output information.

• Different features are discriminative regarding the target variable.
Example: ”Read the book” in book reviews vs. movie reviews... vs. hotel reviews?

Basics of Natural Language Processing, Henning Wachsmuth 55

Next section: Conclusion
I. Introduction to computational argumentation
a) Introduction
II. Basics of natural language processing
b) Linguistics
III. Basics of argumentation
c) Empirical methods
IV. Argument acquisition
d) Tasks and techniques
V. Argument mining
e) Rule-based NLP
VI. Argument assessment f) Statistical NLP
VII. Argument generation g) Conclusion
VIII.Applications of computational argumentation

IX. Conclusion

Basics of Natural Language Processing, Henning Wachsmuth 56

What makes NLP hard?
§ Effectiveness challenges
• Ambiguity of natural language.
• Missing context and world knowledge.
• Accumulation of errors through the text analysis process.
• Lack of sufficient data for development.

§ Efficiency challenges
• Large amounts of data may need to be processed, possibly repeatedly.
• Complex, space-intensive models may be learned.
• Often, several time-intensive text analyses are needed.

§ Robustness challenges
• Datasets for training may be biased.
• Many text characteristics are domain-specific.
• Learned algorithms often capture too much variance (i.e., they overfit).

Basics of Natural Language Processing, Henning Wachsmuth 57

Approaches to NLP challenges
§ How to improve effectiveness?
• Joint inference may reduce/avoid error propagation.
• Different algorithms work well for different amounts of data.
• Sometimes, data can be extended easily.
• Redundancy can be exploited in large-scale situations.
• Combinations of statistical and rule-based approaches often do the trick.

§ How to improve efficiency?

• Resort to simpler algorithms
• Filtering of relevant information and scheduling in pipelines.
• Scale-out and parallelization of text analysis processes.

§ How to improve robustness?

• Use of heterogenous datasets in training.
• Resort to domain-independent features.
• Adaptation of algorithms based on sample from target domain.
Basics of Natural Language Processing, Henning Wachsmuth 58
Conclusion
§ Basics of natural language processing (NLP)

https://en.wikipedia.org
• Linguistic knowledge from phonetics to pragmatics.
• Empirical methods for development and evaluation.
• Rule-based and statistical (machine-learned) algorithms.

§ How to approach NLP tasks?

• Start from annotated text corpora.
( , , ..., )
input output

...
data ... information
data mining ( , , ..., )

• Develop algorithms that use rules or learn patterns. representation generalization

machine learning

• Evaluate quality of their output empirically.

instances patterns

§ Goals of NLP

https://de.wikipedia.org
https://pixabay.com
• Technology that can process natural language.
• Empirical explanations of linguistic phenomena.
• Solutions to problems from the real world.

Basics of Natural Language Processing, Henning Wachsmuth 59

References
§ Ng (2018). Andrew Ng. Machine Learning. Lecture slides from the Stanford Coursera course.
2018.https://www.coursera.org/learn/machine-learning.

§ Wachsmuth (2019). Henning Wachsmuth. Introduction to Text Mining. Lecture slides. Winter term, 2019.
https://cs.upb.de/css/teaching/courses/text-mining-w19/

Basics of Natural Language Processing, Henning Wachsmuth 60

The Speech Chain: The Physics And Biology Of Spoken Language
From Everand
The Speech Chain: The Physics And Biology Of Spoken Language
Dr. Peter B. Denes
4/5 (9)
The Elements of Grammar in 90 Minutes
From Everand
The Elements of Grammar in 90 Minutes
Robert Hollander
1/5 (3)
TODOROV, Tzvetan. Mikhail Bakhtin - The Dialogical Principle. Trad. Wlad Godzich. Minneapolis - University of Minnesota Press, 1984.
No ratings yet
TODOROV, Tzvetan. Mikhail Bakhtin - The Dialogical Principle. Trad. Wlad Godzich. Minneapolis - University of Minnesota Press, 1984.
3 pages
Multilingual Mentality: Strategies for Learning Multiple Languages Simultaneously
From Everand
Multilingual Mentality: Strategies for Learning Multiple Languages Simultaneously
Harlan G Otis
5/5 (1)
汉语句法—形态接口研究：以"得"为例（英文）
From Everand
汉语句法—形态接口研究：以"得"为例（英文）
汪昌松
No ratings yet
Simile and Metaphor
0% (1)
Simile and Metaphor
4 pages
CREATIVE NONFICTION Literal and Figurative Language
No ratings yet
CREATIVE NONFICTION Literal and Figurative Language
50 pages
POETIC-DEVICES-Home Task - 7.3.2025
No ratings yet
POETIC-DEVICES-Home Task - 7.3.2025
3 pages
Figurative Language
No ratings yet
Figurative Language
26 pages
The Language of Localization
From Everand
The Language of Localization
Katherine Brown-Hoekstra
1/5 (1)
CS361 Artificial Intelligence (SEP) Lecture 2 (Intelligent Agents - AI Related Disciplines) Fall 2020
No ratings yet
CS361 Artificial Intelligence (SEP) Lecture 2 (Intelligent Agents - AI Related Disciplines) Fall 2020
61 pages
LU4 Phonology 4nov2024
No ratings yet
LU4 Phonology 4nov2024
54 pages
Figure of Speech by Raman
No ratings yet
Figure of Speech by Raman
32 pages
Error Correction
No ratings yet
Error Correction
4 pages
Figure of Speech RIMC
No ratings yet
Figure of Speech RIMC
35 pages
Mnemonics for Study (2nd ed.): Study Skills, #2
From Everand
Mnemonics for Study (2nd ed.): Study Skills, #2
Fiona McPherson
No ratings yet
Frai 07 1345445
No ratings yet
Frai 07 1345445
19 pages
Multiple Choice Activities VERB SUBJECT PREDICATE.
No ratings yet
Multiple Choice Activities VERB SUBJECT PREDICATE.
3 pages
A Psychological Approach to Translation
From Everand
A Psychological Approach to Translation
Akbar Dehghan Ferdows
No ratings yet
Indo-European Cognate Dictionary
From Everand
Indo-European Cognate Dictionary
Fiona McPherson
5/5 (1)
Grammar and Linguistics: Core Concepts
From Everand
Grammar and Linguistics: Core Concepts
Saraswati Saini
No ratings yet
English Reader Chapter - 12 (EARTH DAY) Class - 6TH
No ratings yet
English Reader Chapter - 12 (EARTH DAY) Class - 6TH
3 pages
GR 7 Bridge - WS - 1
No ratings yet
GR 7 Bridge - WS - 1
7 pages
Bartleys Figurative Language
No ratings yet
Bartleys Figurative Language
58 pages
Figures of Speech Practice
No ratings yet
Figures of Speech Practice
4 pages
Lec - 3 - Image Enhancement in Spatial Domain - 3
No ratings yet
Lec - 3 - Image Enhancement in Spatial Domain - 3
28 pages
Unit 41 CiU
No ratings yet
Unit 41 CiU
2 pages
TOEFL Task 2 - Part One
No ratings yet
TOEFL Task 2 - Part One
80 pages
4 14755 CS213 20172018 1 2 1 Lecture 2
No ratings yet
4 14755 CS213 20172018 1 2 1 Lecture 2
35 pages
Language, Linguistics, and Development Simplified
From Everand
Language, Linguistics, and Development Simplified
Narinder Mehra
No ratings yet
Topic 12 - Conditionals - Tables and Timelines
No ratings yet
Topic 12 - Conditionals - Tables and Timelines
3 pages
Implementing Domain-Specific Languages with Xtext and Xtend
From Everand
Implementing Domain-Specific Languages with Xtext and Xtend
Lorenzo Bettini
4/5 (1)
4 2 - Use of Figures of Speech To Create Effect
No ratings yet
4 2 - Use of Figures of Speech To Create Effect
2 pages
Bank Job English Lec 01 To 03 CS W F PG 72
No ratings yet
Bank Job English Lec 01 To 03 CS W F PG 72
71 pages
Basic English Workbook 2-1
No ratings yet
Basic English Workbook 2-1
26 pages
Figures of Speech - DOC
No ratings yet
Figures of Speech - DOC
6 pages
The spaCy Handbook: Simplifying Natural Language Processing
From Everand
The spaCy Handbook: Simplifying Natural Language Processing
Robert Johnson
No ratings yet
Poetic Devices Activity
100% (1)
Poetic Devices Activity
3 pages
L7b Logic
No ratings yet
L7b Logic
9 pages
All The Grammer I Need To Know
No ratings yet
All The Grammer I Need To Know
48 pages
Programming Language Concepts: Improving your Software Development Skills
From Everand
Programming Language Concepts: Improving your Software Development Skills
Oliver Wegner
No ratings yet
Irregular Verb List
No ratings yet
Irregular Verb List
3 pages
Model C
No ratings yet
Model C
2 pages
Unit 3 - Adjectives and Verb
No ratings yet
Unit 3 - Adjectives and Verb
23 pages
Conceptual Transfer in the Bilingual Mental Lexicon
From Everand
Conceptual Transfer in the Bilingual Mental Lexicon
Sherif Okasha
No ratings yet
STD 4th English II - Gunjan
No ratings yet
STD 4th English II - Gunjan
2 pages
Figurative Language
No ratings yet
Figurative Language
2 pages
L7b Logic 5 9
No ratings yet
L7b Logic 5 9
5 pages
Lesson 5
No ratings yet
Lesson 5
8 pages
Context, Dexis, Reference
No ratings yet
Context, Dexis, Reference
8 pages
Figures of Speech PDF
No ratings yet
Figures of Speech PDF
3 pages
09le 03 PGP Adverbclause
No ratings yet
09le 03 PGP Adverbclause
3 pages
0 Evaluare
No ratings yet
0 Evaluare
3 pages
The German Sign Language Alphabet – A Project FingerAlphabet Reference Manual: Project FingerAlphabet BASIC, #2
From Everand
The German Sign Language Alphabet – A Project FingerAlphabet Reference Manual: Project FingerAlphabet BASIC, #2
S.T. Lassal
No ratings yet
Figures of Speech
100% (1)
Figures of Speech
11 pages
Linguistics - Shane
100% (1)
Linguistics - Shane
14 pages
Grammaring' Activities
No ratings yet
Grammaring' Activities
58 pages
Noun Clauses
No ratings yet
Noun Clauses
9 pages
Python Text Processing with NLTK 2.0 Cookbook: LITE
From Everand
Python Text Processing with NLTK 2.0 Cookbook: LITE
Jacob Perkins
4/5 (1)
How To Write Audio And Video Scripts
From Everand
How To Write Audio And Video Scripts
John Hughes
5/5 (1)
The Description of Speech
No ratings yet
The Description of Speech
6 pages
(Figurative Language Lectures
No ratings yet
(Figurative Language Lectures
20 pages
Structures Use of English
No ratings yet
Structures Use of English
6 pages
Subordinating Conjunctions: Name Date
No ratings yet
Subordinating Conjunctions: Name Date
3 pages
Biblical Hebrew Made Easy: The Triad System
From Everand
Biblical Hebrew Made Easy: The Triad System
Blair Kasfeldt
4.5/5 (4)
Yeni Model 11 Ci Sinif Sinaqlari - 6
No ratings yet
Yeni Model 11 Ci Sinif Sinaqlari - 6
3 pages
17 - 7 Presuposition in Copywriting
No ratings yet
17 - 7 Presuposition in Copywriting
8 pages
Causative Form (Part 1)
No ratings yet
Causative Form (Part 1)
14 pages
talkative យាយច្រ ើន: annoyed with
No ratings yet
talkative យាយច្រ ើន: annoyed with
51 pages
Own It Worksheet
100% (1)
Own It Worksheet
3 pages
CV Ebe J.C PDF
No ratings yet
CV Ebe J.C PDF
1 page
Difference Between Code Mixing and Code Switching
No ratings yet
Difference Between Code Mixing and Code Switching
2 pages
Wud Like in Affirmative You Can Use Would Like To Express A Disaier. FOR Example
No ratings yet
Wud Like in Affirmative You Can Use Would Like To Express A Disaier. FOR Example
3 pages
Figures of Speech: Good Writers Use These!
No ratings yet
Figures of Speech: Good Writers Use These!
32 pages
Linguistics For Beginners
From Everand
Linguistics For Beginners
W. Terrence Gordon
No ratings yet
Lexical Stylistic Devices Test 2 Task 1: Define The Figures of Speech Based On Imaginative Comparison
100% (1)
Lexical Stylistic Devices Test 2 Task 1: Define The Figures of Speech Based On Imaginative Comparison
2 pages
On the Logic and Learning of Language
From Everand
On the Logic and Learning of Language
Sean A. Fulop
No ratings yet
Figures of Speech Studysheet
No ratings yet
Figures of Speech Studysheet
2 pages
Brief Overview of Punctuation, OWL PERDUE UNIV
No ratings yet
Brief Overview of Punctuation, OWL PERDUE UNIV
11 pages
QUIZ - Figurative Language
No ratings yet
QUIZ - Figurative Language
2 pages
Lesson 1 Figures of Speech
No ratings yet
Lesson 1 Figures of Speech
4 pages
What Is The Origin of The Phrase It's Raining Cats and Dogs Library of Congress
No ratings yet
What Is The Origin of The Phrase It's Raining Cats and Dogs Library of Congress
1 page
Word Guide: Choosing the right words
From Everand
Word Guide: Choosing the right words
Mary Morel
No ratings yet
Analysis of a Medical Research Corpus: A Prelude for Learners, Teachers, Readers and Beyond
From Everand
Analysis of a Medical Research Corpus: A Prelude for Learners, Teachers, Readers and Beyond
Georgette Nicolas Jabbour
No ratings yet
Parts of Speech List
No ratings yet
Parts of Speech List
10 pages
Weekly Reinforcement: The City School
No ratings yet
Weekly Reinforcement: The City School
6 pages
English 6 Quarter 1 Week 1, Day 1: Analyze Sound Devices (Onomatopoeia, Alliteration, Assonance, Consonance)
100% (4)
English 6 Quarter 1 Week 1, Day 1: Analyze Sound Devices (Onomatopoeia, Alliteration, Assonance, Consonance)
100 pages
Natural Language Processing: Fundamentals and Applications
From Everand
Natural Language Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
SIMILE
No ratings yet
SIMILE
7 pages
Terminology Extraction: Fundamentals and Applications
From Everand
Terminology Extraction: Fundamentals and Applications
Fouad Sabry
No ratings yet
Figurativelanguage-BAYU, Ayu, Rey
No ratings yet
Figurativelanguage-BAYU, Ayu, Rey
26 pages
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet
Easy Learning German Complete Grammar, Verbs and Vocabulary (3 books in 1): Trusted support for learning
From Everand
Easy Learning German Complete Grammar, Verbs and Vocabulary (3 books in 1): Trusted support for learning
Collins Dictionaries
4.5/5 (3)
Language Identification: Fundamentals and Applications
From Everand
Language Identification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Explanation Based Learning: Fundamentals and Applications
From Everand
Explanation Based Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Semantics: Fundamentals and Applications
From Everand
Statistical Semantics: Fundamentals and Applications
Fouad Sabry
No ratings yet
Natural Language Understanding: Fundamentals and Applications
From Everand
Natural Language Understanding: Fundamentals and Applications
Fouad Sabry
No ratings yet