0% found this document useful (0 votes)

16 views8 pages

NLP - Project 2

Nlp project for final year

Uploaded by

sagar865241

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

NLP - Project 2

Nlp project for final year

Uploaded by

sagar865241

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

AIML MODULE

PROJECT
©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited
AIML MODULE PROJECT

5
AIML module projects are designed

1
to have a detailed hands on to
integrate theoretical knowledge with
actual practical implementations.

AIML module projects are designed

2 to enable you as a learner to work

on realtime industry scenarios,
problems and datasets.

AIML module projects are designed

to enable you simulating the

3 designed solution using AIML

techniques onto python technology
platform.

Takeaways 4
AIML module projects are designed
to be scored using a prede ined
rubric based system.

AIML module projects are designed

to enhance your learning above and

5 beyond. Hence, it might require you

to experiment, research, self learn
and implement.

AIML
MODULE
PROJECT
©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited Page 1
f
AIML MODULE PROJECT

SEQUENTIAL
NLP
AIML module project Part I and II consists of industry based
NLP dataset which can be used to design a text classi ier using
sequential NLP models.

TOTAL
SCORE 60
©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited Page 2
f
AIML MODULE PROJECT

PART
ONE PROJECT BASED TOTAL
SCORE 30
• DOMAIN: Digital content and entertainment industry

• CONTEXT: The objective of this project is to build a text classi ication model that
analyses the customer's sentiments based on their reviews in the IMDB database. The
model uses a complex deep learning model to build an embedding layer followed by
a classi ication algorithm to analyse the sentiment of the customers.

• DATA DESCRIPTION: The Dataset of 50,000 movie reviews from IMDB, labelled by
sentiment (positive/negative). Reviews have been preprocessed, and each review is
encoded as a sequence of word indexes (integers). For convenience, the words are
indexed by their frequency in the dataset, meaning the for that has index 1 is the
most frequent word. Use the irst 20 words from each review to speed up training,
using a max vocabulary size of 10,000. As a convention, "0" does not stand for a
speci ic word, but instead is used to encode any unknown word.

• PROJECT OBJECTIVE: Build a sequential NLP classi ier which can use input text
parameters to determine the customer sentiments.

Steps and tasks: [ Total Score: 30 points]

1. Import and analyse the data set.
Hint: - Use `imdb.load_data()` method
- Get train and test set
- Take 10000 most frequent words
2. Perform relevant sequence adding on the data
3. Perform following data analysis:
• Print shape of features and labels
• Print value of any one feature and it's label
4. Decode the feature value to get original sentence
5. Design, train, tune and test a sequential model.

Hint: The aim here Is to import the text, process it such a way that it can be taken as an inout to the ML/NN
classi iers. Be analytical and experimental here in trying new approaches to design the best model.

6. Use the designed model to print the prediction on any one sample.

©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited Page 3
f
f
f
f
f
f
AIML MODULE PROJECT

PART
TWO PROJECT BASED TOTAL
SCORE 30
• DOMAIN: Social media analytics

• CONTEXT: Past studies in Sarcasm Detection mostly make use of Twitter datasets collected
using hashtag based supervision but such datasets are noisy in terms of labels and
language. Furthermore, many tweets are replies to other tweets and detecting sarcasm in
these requires the availability of contextual tweets.In this hands-on project, the goal is to
build a model to detect whether a sentence is sarcastic or not, using Bidirectional LSTMs.

• DATA DESCRIPTION:
The dataset is collected from two news websites, theonion.com and hu ingtonpost.com.
This new dataset has the following advantages over the existing Twitter datasets:
Since news headlines are written by professionals in a formal manner, there are no spelling mistakes and
informal usage. This reduces the sparsity and also increases the chance of inding pre-trained embeddings.
Furthermore, since the sole purpose of TheOnion is to publish sarcastic news, we get high-quality labels with
much less noise as compared to Twitter datasets.
Unlike tweets that reply to other tweets, the news headlines obtained are self-contained. This would help us in
teasing apart the real sarcastic elements
Content: Each record consists of three attributes:
is_sarcastic: 1 if the record is sarcastic otherwise 0
headline: the headline of the news article
article_link: link to the original news article. Useful in collecting supplementary data
Reference: https://github.com/rishabhmisra/News-Headlines-Dataset-For-Sarcasm-Detection

• PROJECT OBJECTIVE: Build a sequential NLP classi ier which can use input text parameters
to determine the customer sentiments.

Steps and tasks: [ Total Score: 30 points]

1. Read and explore the data

2. Retain relevant columns
3. Get length of each sentence
4. De ine parameters
5. Get indices for words
6. Create features and labels
7. Get vocabulary size
8. Create a weight matrix using GloVe embeddings
9. De ine and compile a Bidirectional LSTM model.
Hint: Be analytical and experimental here in trying new approaches to design the best model.
10. Fit the model and check the validation accuracy

©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited Page 4
f
f
f
ff
f
AIML MODULE PROJECT

LEARNING
OUTCOME
Hands on experience on importing, pre-processing and computing a text
dataset using python.

Using your learnings on text embeddings.

Realtime experience working on designing, training, tuning and testing

sequential NLP classi iers.

“ Put yourself in the shoes of an actual ”

DATA SCIENTIST

THAT’s YOU
Assume that you are working at the company which
has received the above problem statement from
internal/external client. Finding the best solution for
the problem statement will enhance the business/
operations for your organisation/project. You are
responsible for the complete delivery. Put your best
analytical thinking hat to squeeze the raw data into
relevant insights and later into an AIML working model.

PLEASE NOTE
Designing a data driven decision product typically traces the following process:
1. Data and insights:

Warehouse the relevant data. Clean and validate the data as per the the functional requirements of the problem statement. Capture and validate

all possible insights from the data as per the the functional requirements of the problem statement. Please remember there will be numerous

ways to achieve this. Sticking to relevance is of utmost importance. Pre-process the data which can be used for relevant AIML model.

2. AIML training:

Use the data to train and test a relevant AIML model. Tune the model to achieve the best possible learnings out of the data. This is an iterative

process where your knowledge on the above data can help to debug and improvise. Di erent AIML models react di erently and perform

depending on quality of the data. Baseline your best performing model and store the learnings for future usage.

3. AIML end product:

Design a trigger or user interface for the business to use the designed AIML model for future usage. Maintain, support and keep the model/

product updated by continuous improvement/training. These are generally triggered by time, business or change in data.

IMPORTANT
POINTERS
Project should be submitted as a single “.html” and “.ipynb” ile. Follow the below
best practices where your submission should be:
• ”.html” and ".ipynb" iles should be an exact match.
• Pre-run codes with all outputs intact.
• Error free & machine independent i.e. run on any machine without adding any extra code.
• Well commented for clarity on code designed, assumptions made, approach taken, insights
found and results obtained.

Project should be submitted on or before the

deadline given by the program o ice.

Project submission should be an original work

from you as a learner. If any percentage of
Submission plagiarism found in the submission, the project
will not be evaluated and no score will be given.

CNN - Project
No ratings yet
CNN - Project
8 pages
NLP-2 - Problem Statement
No ratings yet
NLP-2 - Problem Statement
3 pages
NLP-2 - Problem Statement
No ratings yet
NLP-2 - Problem Statement
3 pages
Sentiment Analysis Using NLP
No ratings yet
Sentiment Analysis Using NLP
42 pages
Internship Presentation
No ratings yet
Internship Presentation
16 pages
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
No ratings yet
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
7 pages
Ashwin Prasanth PT1 Project
No ratings yet
Ashwin Prasanth PT1 Project
38 pages
Text Classification and Processing Using NLP
No ratings yet
Text Classification and Processing Using NLP
21 pages
Software Report - Final 10 Pages
No ratings yet
Software Report - Final 10 Pages
15 pages
NLP Long Que Ans
No ratings yet
NLP Long Que Ans
20 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
Malignant Comments Classifier Project
No ratings yet
Malignant Comments Classifier Project
30 pages
Document From Atharva
No ratings yet
Document From Atharva
8 pages
Building An AI Model Capable of Judging User Sentiments
No ratings yet
Building An AI Model Capable of Judging User Sentiments
2 pages
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
No ratings yet
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
8 pages
Report Dhruv
No ratings yet
Report Dhruv
28 pages
NPL Assignment 1
No ratings yet
NPL Assignment 1
5 pages
Artificial and Intelligence
No ratings yet
Artificial and Intelligence
19 pages
Major Project-II Project Report AIML Batch 1
No ratings yet
Major Project-II Project Report AIML Batch 1
61 pages
Batch 17
No ratings yet
Batch 17
27 pages
Project
No ratings yet
Project
11 pages
Cream and Dark Brown Aesthetic Abstract Corner Project Presentation - 20250702 - 205800 - 0000
No ratings yet
Cream and Dark Brown Aesthetic Abstract Corner Project Presentation - 20250702 - 205800 - 0000
17 pages
New Lms Project 1
No ratings yet
New Lms Project 1
70 pages
Group 4 MovieReview
No ratings yet
Group 4 MovieReview
10 pages
Machine Learning Engineer Nanodegree Program Syllabus PDF
No ratings yet
Machine Learning Engineer Nanodegree Program Syllabus PDF
6 pages
Python - Project 2 Problem Statement
No ratings yet
Python - Project 2 Problem Statement
3 pages
Theolaaaa4273 Merged
No ratings yet
Theolaaaa4273 Merged
76 pages
CS663-2024-Executive NLP - Assignment Sentiment Analysis
No ratings yet
CS663-2024-Executive NLP - Assignment Sentiment Analysis
4 pages
AIML PGCP Project B21
No ratings yet
AIML PGCP Project B21
6 pages
Shivamani
No ratings yet
Shivamani
63 pages
Arsalan's Project
No ratings yet
Arsalan's Project
4 pages
Sentiment Analysis
100% (1)
Sentiment Analysis
35 pages
Quiz - Generation - Model - Using - Machine - Learning PPT AMIT KUMAR
No ratings yet
Quiz - Generation - Model - Using - Machine - Learning PPT AMIT KUMAR
8 pages
COMP 4650 6490 Assignment 3 2023-v1.1
No ratings yet
COMP 4650 6490 Assignment 3 2023-v1.1
6 pages
Report
No ratings yet
Report
18 pages
Analyzing Sentiment Using IMDb Dataset
No ratings yet
Analyzing Sentiment Using IMDb Dataset
4 pages
Presentation 16
No ratings yet
Presentation 16
8 pages
Final IBM-CBSE - AI - Project - Logbook
No ratings yet
Final IBM-CBSE - AI - Project - Logbook
52 pages
Detect Sarcastic
No ratings yet
Detect Sarcastic
34 pages
Arsalan's Project New
No ratings yet
Arsalan's Project New
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
Youtube Analysis3
No ratings yet
Youtube Analysis3
58 pages
1-5 Cs PDH
No ratings yet
1-5 Cs PDH
5 pages
Sentiment Analysis Chatbot
No ratings yet
Sentiment Analysis Chatbot
8 pages
Martin, Adrián Rodríguez, Barcelona - 2018 - Toxic Comment Classification Using Convolutional and Recurrent Neural Networks-Annotated
No ratings yet
Martin, Adrián Rodríguez, Barcelona - 2018 - Toxic Comment Classification Using Convolutional and Recurrent Neural Networks-Annotated
4 pages
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
No ratings yet
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
13 pages
Report in ML
No ratings yet
Report in ML
9 pages
AI Harmful Content
No ratings yet
AI Harmful Content
4 pages
Mini Project
No ratings yet
Mini Project
16 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Mach Weird
No ratings yet
Mach Weird
8 pages
A18 CU6051NA A2 CW Coursework 16034872 Anjil Shrestha
No ratings yet
A18 CU6051NA A2 CW Coursework 16034872 Anjil Shrestha
34 pages
Harsh Internship
No ratings yet
Harsh Internship
18 pages
Deep Learning Journal
No ratings yet
Deep Learning Journal
6 pages
NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
AI Report Shivam
No ratings yet
AI Report Shivam
8 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
Prompt Engineering
100% (2)
Prompt Engineering
26 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Machine Learning in Production: Master the art of delivering robust Machine Learning solutions with MLOps (English Edition)
From Everand
Machine Learning in Production: Master the art of delivering robust Machine Learning solutions with MLOps (English Edition)
Suhas Pote
No ratings yet
Ads R2022
No ratings yet
Ads R2022
178 pages
Document
No ratings yet
Document
6 pages
Project 1name - Excel Activities in Email Automation - People - Email
No ratings yet
Project 1name - Excel Activities in Email Automation - People - Email
4 pages
ETECH WorkSheet 1
No ratings yet
ETECH WorkSheet 1
10 pages
D8129 Octo Relay Mod Installation Manual enUS 2538142603
No ratings yet
D8129 Octo Relay Mod Installation Manual enUS 2538142603
8 pages
Cics Mock Test III
No ratings yet
Cics Mock Test III
6 pages
B.tech Syllabus 3rd Semester For CSE IPU
No ratings yet
B.tech Syllabus 3rd Semester For CSE IPU
8 pages
Twinkle
No ratings yet
Twinkle
2 pages
KS18 Data Centres - An Introduction To Concepts and Design (2012)
No ratings yet
KS18 Data Centres - An Introduction To Concepts and Design (2012)
85 pages
De Pin
No ratings yet
De Pin
22 pages
(English (Auto-Generated) ) How To Scrape Leads From EVERY Social Media Platform (2025) (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) How To Scrape Leads From EVERY Social Media Platform (2025) (DownSub - Com)
32 pages
A Wormhole Attack Detection and Prevention Techniq
No ratings yet
A Wormhole Attack Detection and Prevention Techniq
9 pages
05 Interfacing and Communication
No ratings yet
05 Interfacing and Communication
57 pages
Lec6 PDF
No ratings yet
Lec6 PDF
22 pages
2.PC Jotun Chart1011 PDF
No ratings yet
2.PC Jotun Chart1011 PDF
3 pages
Exit Process Deck - V1.16
No ratings yet
Exit Process Deck - V1.16
24 pages
Download: Solutions Intermediate Progress Tests Unit 1answer
No ratings yet
Download: Solutions Intermediate Progress Tests Unit 1answer
2 pages
DrTVelmurugan Profile 1
No ratings yet
DrTVelmurugan Profile 1
35 pages
Java Theory (9th Class)
No ratings yet
Java Theory (9th Class)
13 pages
Document 1
No ratings yet
Document 1
3 pages
Nihilize
No ratings yet
Nihilize
6 pages
1.2 How To Create Routes and AVCS Order
No ratings yet
1.2 How To Create Routes and AVCS Order
5 pages
Barkatullah University Online Migration Form
67% (6)
Barkatullah University Online Migration Form
34 pages
Lecture 1: Matrices and Systems of Linear Equations: Brandon Behring
No ratings yet
Lecture 1: Matrices and Systems of Linear Equations: Brandon Behring
37 pages
EQP S3 Software
No ratings yet
EQP S3 Software
57 pages
Open Source Intelligence Techniques Resources For Searching and Analyzing Online Information 6th Edition Michael Bazzell Download
No ratings yet
Open Source Intelligence Techniques Resources For Searching and Analyzing Online Information 6th Edition Michael Bazzell Download
86 pages
Cypress Programmer User Guide
No ratings yet
Cypress Programmer User Guide
28 pages
Air-to-Air Visual Detection of Micro-UAVs An Experimental Evaluation of Deep Learning
No ratings yet
Air-to-Air Visual Detection of Micro-UAVs An Experimental Evaluation of Deep Learning
8 pages
Sample Output To Test PDF Combine Only
No ratings yet
Sample Output To Test PDF Combine Only
122 pages
Resume 2
No ratings yet
Resume 2
1 page

NLP - Project 2

Uploaded by

NLP - Project 2

Uploaded by

AIML MODULE

AIML module projects are designed

2 to enable you as a learner to work

AIML module projects are designed

3 designed solution using AIML

AIML module projects are designed

5 beyond. Hence, it might require you

Steps and tasks: [ Total Score: 30 points]

Steps and tasks: [ Total Score: 30 points]

1. Read and explore the data

Using your learnings on text embeddings.

Realtime experience working on designing, training, tuning and testing

“ Put yourself in the shoes of an actual ”

3. AIML end product:

Project should be submitted on or before the

Project submission should be an original work

You might also like