N Gram Model

This document discusses building an n-gram model in Python for natural language processing. It imports NLTK libraries, tokenizes a sample text into words, creates an n-gram dictionary to store 3-grams and their following words, and then uses the dictionary to generate new text by randomly selecting the next word based on the previous 3-gram. The result demonstrates the model generating additional text in a similar style to the original sample.

Uploaded by

Premjit Sengupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views2 pages

N Gram Model

Uploaded by

Premjit Sengupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

4/17/2020 Untitled12 - Jupyter Notebook

In [8]:

# Natural Language Processing using Python

# N-Gram Modelling - Character Grams

# Importing libraries
import random
import nltk

In [10]:

text = """Global warming or climate change has become a worldwide concern. It is gradually

In [11]:

n = 3

In [12]:

ngrams = {}

In [13]:

# Building the model

words = nltk.word_tokenize(text)
for i in range(len(words)-n):
gram = ' '.join(words[i:i+n])
if gram not in ngrams.keys():
ngrams[gram] = []
ngrams[gram].append(words[i+n])

In [14]:

# Testing the model

currentGram = ' '.join(words[0:n])
result = currentGram
for i in range(30):
if currentGram not in ngrams.keys():
break
possibilities = ngrams[currentGram]
nextItem = possibilities[random.randrange(len(possibilities))]
result += ' '+nextItem
rWords = nltk.word_tokenize(result)
currentGram = ' '.join(rWords[len(rWords)-n:len(rWords)])

print(result)

Global warming or climate change has become a worldwide concern . It is grad

ually developing into an unprecedented environmental crisis evident in melti
ng glaciers , changing weather patterns , rising sea levels ,

127.0.0.1:8888/notebooks/Untitled12.ipynb?kernel_name=python3 1/2
4/17/2020 Untitled12 - Jupyter Notebook

In [ ]:

127.0.0.1:8888/notebooks/Untitled12.ipynb?kernel_name=python3 2/2

NLP Lab Manual (R20)
50% (2)
NLP Lab Manual (R20)
24 pages
管新潮"语料库与Python应用"讲座课件
No ratings yet
管新潮"语料库与Python应用"讲座课件
39 pages
Ccs369 - Text and Speech Analysis - Lab Manual
100% (1)
Ccs369 - Text and Speech Analysis - Lab Manual
23 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Exercises
No ratings yet
NLP Exercises
2 pages
English Test 11º Ano
100% (2)
English Test 11º Ano
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
NLP Final
No ratings yet
NLP Final
26 pages
LP Vi Manual
No ratings yet
LP Vi Manual
77 pages
Ccs339 Text and Speech Analysis Lab Manual
No ratings yet
Ccs339 Text and Speech Analysis Lab Manual
51 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
CCS369-Text and Speech Analysis Lab (1-9)
No ratings yet
CCS369-Text and Speech Analysis Lab (1-9)
37 pages
Final NLP Lab File
No ratings yet
Final NLP Lab File
28 pages
Experiment: 1
No ratings yet
Experiment: 1
28 pages
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
No ratings yet
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
13 pages
UBC Summer School in NLP - VSP 2019 Lecture 10
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 10
33 pages
NLP Lab Complete
No ratings yet
NLP Lab Complete
23 pages
Tsa Labmanual
No ratings yet
Tsa Labmanual
26 pages
Tsarecord
No ratings yet
Tsarecord
22 pages
NLP Record
No ratings yet
NLP Record
23 pages
Ai & ML Week-11
No ratings yet
Ai & ML Week-11
32 pages
Aiproject Report
No ratings yet
Aiproject Report
11 pages
Exp7 A10 NLP
No ratings yet
Exp7 A10 NLP
16 pages
Đề Ôn Thi
No ratings yet
Đề Ôn Thi
17 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
ANKUSH
No ratings yet
ANKUSH
20 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
32 pages
UBC Summer School in NLP - VSP 2019 Lecture 9
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 9
17 pages
NLP Record
No ratings yet
NLP Record
15 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
Climatext: A Dataset For Climate Change Topic Detection
No ratings yet
Climatext: A Dataset For Climate Change Topic Detection
13 pages
Batch 2
No ratings yet
Batch 2
13 pages
NLP - Exp 1 11
No ratings yet
NLP - Exp 1 11
29 pages
Aim - Procedure - Result - Single Side
No ratings yet
Aim - Procedure - Result - Single Side
18 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
NLTK 1736134770
No ratings yet
NLTK 1736134770
13 pages
Naan Muthalvan Project - Jagannath
No ratings yet
Naan Muthalvan Project - Jagannath
15 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
NLP Manual
No ratings yet
NLP Manual
15 pages
Ccs369-Lab Ex 3,4,5
No ratings yet
Ccs369-Lab Ex 3,4,5
8 pages
NLP
No ratings yet
NLP
12 pages
Semantic Networks, Frames, Scripts and Reasoning
No ratings yet
Semantic Networks, Frames, Scripts and Reasoning
8 pages
AI Lab File PDF
No ratings yet
AI Lab File PDF
9 pages
Synonym or Similar Word Detection in Assignment Papers: Gayatri Behera
No ratings yet
Synonym or Similar Word Detection in Assignment Papers: Gayatri Behera
2 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
Python NLP Assignment
No ratings yet
Python NLP Assignment
9 pages
NLP Record
No ratings yet
NLP Record
6 pages
CSE 3652 Lab Record Format - PDF
No ratings yet
CSE 3652 Lab Record Format - PDF
13 pages
E11-U6-Practice Test 2: Language I. Pronunciation
No ratings yet
E11-U6-Practice Test 2: Language I. Pronunciation
4 pages
GenAI Shortened
No ratings yet
GenAI Shortened
8 pages
Template - 5th Homework 5th A
No ratings yet
Template - 5th Homework 5th A
4 pages
Ex 4 Harshan R
No ratings yet
Ex 4 Harshan R
3 pages
NLP EXP 3 (B) - Word Generation
No ratings yet
NLP EXP 3 (B) - Word Generation
2 pages
NLP Exp-123
No ratings yet
NLP Exp-123
6 pages
7 Exp
No ratings yet
7 Exp
6 pages
Big Data With Hadoop & Spark - Introduction
No ratings yet
Big Data With Hadoop & Spark - Introduction
28 pages
Countries Region Mapping
No ratings yet
Countries Region Mapping
9 pages
Binomialdistribution 190124111432 PDF
No ratings yet
Binomialdistribution 190124111432 PDF
26 pages
K Nearest Neighbor Algorithm in Python - Towards Data Science
No ratings yet
K Nearest Neighbor Algorithm in Python - Towards Data Science
7 pages
02-Stemming - Jupyter Notebook
No ratings yet
02-Stemming - Jupyter Notebook
4 pages
Additional Reading Material-Probability
No ratings yet
Additional Reading Material-Probability
11 pages
Bag of Words 03 and 04 Model
No ratings yet
Bag of Words 03 and 04 Model
4 pages