Group 7 - Corpus Analysis

Download as pdf or txt
Download as pdf or txt
You are on page 1of 20

CORPUS

ANALYSIS
By:
Alissa Rahmasari 22216251110
Shabrina Yumna Azhra 22216251113
Syifaurrahman Fadhilah 22216251111

Group 7 - Data Analysis


CONTENT OF PRESENTATION
01 The Definition
05 Application of Corpus Analysis
02 Types of Corpus in ELT

03 How to Use Corpus Tools 06 Question Session

04 Steps of Doing Corpus 07 Reference


Analysis
Page 1

What is corpus?
A corpus refers to a large and structured set of texts, spoken or written, that
is used as a basis for linguistic analysis and study, to analyze language
patterns, study language variations, and investigate linguistic phenomena.

Text Encoding Initiative (TEI):


A corpus needs to be compiled “according to some conscious set of
design criteria.”
A corpus can contain either complete texts or parts of texts.
A corpus is a type of text that is regarded “as composite, that is,
consisting of several components which are in some important sense
independent of each other.”
Page 2

Gilquin and Gries (2009)

The different kinds of corpora as occurring on a scale from most to least


prototypical:

Machine-readable

Representative

Balanced
Page 3

The transition from


pre-electronic to electronic corpora
Pre-electronic corpora
The Quirk Corpus created at the Survey of English Usage (SEU). It was originally a “print”
corpus. (1955-1985). 1 million words, 200 samples of spoken and written English.
(Greenbaum and Svartvik 1990)

Electronic corpora
The computerization of corpora began during the same time period (1960s) at the
University of Edinburgh. The first computerized corpus of English was created by John
Sinclair. 135,000-word corpus of conversational British English.
(Tognini Bonelli 2001)
Page 4

Corpus Analysis
(Vaughan & O'Keeffe, 2015)
Corpus analysis is an empirical research strategy that is
widely used within language research, using authentic
(real, actually attested) language material.

Qualitative corpus analysis is a methodology for


pursuing in-depth investigations of
authentic language use. It is a representative of
purposefully selected communicative situations.
Hasko, V. (2012)
Page 5

Types of Corpus
HOW TO USE CORPUS TOOLS Page 6

https://voyant-tools.org/
HOW TO USE CORPUS TOOLS Page 7

https://voyant-tools.org/
HOW TO USE CORPUS TOOLS Page 8

https://voyant-tools.org/
Page 9

ANOTHER CORPUS TOOLS


SPSS
Note Tab

AntCone

https://voyant-tools.org/
Page 10

Steps of Doing Corpus Analysis


Data Collection

Collecting samples of speech or other spoken language


and written texts.

Getting permission to use texts from speakers or writers.

Keeping careful records about the texts collected.

Encoding spoken and written data.

Meyer, 2023
Page 11

Data Analysis

Frame research question


1. What are the linguistic features of Clinton’s and Trumps’ speeches, and what
themes do they reflect in each candidate’s campaign?
2. What are the differences between Clinton’s and Trump’s linguistic styles, and
what political beliefs do they represent?

Select suitable corpora to address


a particular research question
Using speeches presented by Trump and Clinton as they
were running against each other during the campaign.

Meyer, 2023
Page 12

Data Analysis

Extracting Information
from a Corpus

Meyer, 2023
Page 13

Data Analysis

Integrating relevant
theories into corpus result

Meyer, 2023
Page 14

Application of Corpus
Analysis in ELT
Language pattern and usage

Title
Page 15

Application of Corpus
Analysis in ELT
Language pattern and usage
Page 16

Application of Corpus
Analysis in ELT
Language pattern and usage

Discussion
Question &
Discussion
Reference
Firdaus, S. F., Soemantri, Y. S., & Yuliawati, S. (2021). A corpus-based study of self-mention markers in English
research articles. International Journal of Language Teaching and Education, 5(2), 37–46.
https://doi.org/10.22437/ijolte.v5i2.15695
Fillmore, Charles (1992). ‟Corpus-linguistics‟ vs. “computer-aided armchair linguistics”. In Jan Svartvik, ed.,
Directions in Corpus Linguistics. Berlin: de Gruyter, 35-60.
Gilquin, Gaëtanelle and Stefan T. Gries (2009) Corpora and Experimental Methods: A State-of-the-art Review.
Corpus Linguistics and Linguistic Theory 5: 1–26.
Greenbaum, Sidney and Jan Svartvik (1990) The London-Lund Corpus of Spoken English. In Jan Svartvik (ed.)
The London-Lund Corpus of Spoken English: Description and Research. Lund, Sweden: Lund University
Press. 11–45.
Hasko, V. (2012). Qualitative corpus analysis. The encyclopedia of applied linguistics.
Meyer, C. F. (2023). English corpus linguistics: An introduction. Cambridge University Press.
Tognini-Bonelli, Elena (2001) Corpus Linguistics at Work Vol. 6. Philadelphia: J. Benjamins.
Vaughan, E., & O'Keeffe, A. (2015). Corpus analysis. The International Encyclopedia of Language and Social
Interaction, 1-17.

You might also like