0% found this document useful (0 votes)

15 views42 pages

Lab Manual-MLT

Uploaded by

sabarish1705

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views42 pages

Lab Manual-MLT

Uploaded by

sabarish1705

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 42

(An Autonomous Institution)

Department of Artificial Intelligence

and Data Science

LABORATORY MANUAL

Course Code : UAD1511

Course Name : Machine Learning Techniques Laboratory
Year / Semester : Third Year / Fifth Semester
Department : Artificial Intelligence and Data Science
Faculty Name : Ms.R.Dhaaraani
Mrs.R.K.Ananthi
Academic year : 2022 – 2023, ODD
INDEX

Sl.
No.
Name of the Exercises
1. Implementation of decision trees for real world problem

2. Detection of Spam mails using Support vector machine

3. Implementation of facial recognition application with artificial neural network

4. Implementation of amazon toolkit: Sagemaker

5. Implementation of character recognition using Multilayer Perceptron

6. Implementation of the non-parametric Locally Weighted Regression algorithm

Implementation of sentiment analysis using random forest optimization
7. algorithm

8. Construction of a Bayesian network considering medical data

Implementation of online fraud detection using best machine learning
9. algorithm
10. Mini Project

11. Sentimental Analysis using Lexicon Classification Algorithm

Ex.No:1 IMPLEMENTATION OF DECISION TREES FOR REAL WORLD PROBLEM

Aim:
To Implement the concept of decision trees with suitable data set from real world
problem and classify the data set to produce new sample.

Algorithm:
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset in the .csv format.

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the decision tree algorithm to solve real world problem

Step 6: Print the obtained results

Step 7: Stop the program

Program
import pandas
from sklearn import tree
from sklearn.tree import DecisionTreeClassifier
import matplotlib.pyplot as plt
df = pandas.read_csv("C:\\Users\\IT\\Downloads\\data - Sheet1.csv")
d = {'UK': 0, 'USA': 1, 'N': 2}
df['Nationality'] = df['Nationality'].map(d)
d = {'YES': 1, 'NO': 0}
df['Go'] = df['Go'].map(d)
features = ['Age', 'Experience', 'Rank', 'Nationality']
X = df[features]
y = df['Go']
dtree = DecisionTreeClassifier()
dtree = dtree.fit(X, y)
tree.plot_tree(dtree, feature_names=features)
plt.show()

Output:

Result:
Thus, the implementation of the decision trees with suitable real-world problem has
been completed successfully.
Ex.No:2 DETECTION OF SPAM MAILS USING SUPPORT VECTOR MACHINE

Aim:
To implement the detection of spam mails using Support Vector Machine

Algorithm

Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset in the .csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the support vector machine algorithm to detect the spam mails

Step 6: Print the obtained results

Step 7: Stop the program

Program
import numpy as np

import pandas as pd

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.model_selection import GridSearchCV

from sklearn import svm

data = pd.read_csv(“C:\\Users\\IT\\Downloads\\spam.csv”)

print(data)

print(data.info())

X=data[‘EmailText’].values

print(X)

Y=data[‘Label’].values
print(Y)

from sklearn.svm import SVC

classifier = SVC(kernel = 'rbf', random_state = 10)
classifier.fit(X_train, y_train)
print(classifier.score(X_test,y_test))

Output

Result:
Thus, the detection of spam mails using Support Vector Machine has been completed
successfully.

EX. NO:3 IMPLEMENTATION OF FACIAL RECOGNITION APPLICATION WITH

ARTIFICIAL NEURAL NETWORK

Aim:
To implement facial recognition application with Artificial Neural Network.

Algorithm
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with images

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the artificial neural network to recognize the faces

Step 6: Print the obtained results

Step 7: Stop the program

Program
import numpy as np
import tensorflow as tf
import keras
from keras.models import Sequential
from keras.layers import Conv2D,MaxPooling2D,Dense,Flatten,Dropout
import matplotlib.pyplot as plt
from keras.layers.normalization import BatchNormalization
from keras_preprocessing import image
from tensorflow.keras.preprocessing.image import ImageDataGenerator

train_dir="../input/face-recognition-dataset/Original Images/Original Images/"

generator = ImageDataGenerator()
train_ds = generator.flow_from_directory(train_dir,target_size=(224,
224),batch_size=32)
classes = list(train_ds.class_indices.keys())

model = Sequential()
model.add(Conv2D(32, kernel_size = (3, 3), activation='relu',
input_shape=(224,224,3)))

model.add(MaxPooling2D(pool_size=(2,2)))

model.add(BatchNormalization())

model.add(Conv2D(64, kernel_size=(3,3), activation='relu'))

model.add(MaxPooling2D(pool_size=(2,2)))

model.add(BatchNormalization())

model.add(Conv2D(64, kernel_size=(3,3), activation='relu'))

model.add(MaxPooling2D(pool_size=(2,2)))

model.add(BatchNormalization())

model.add(Conv2D(96, kernel_size=(3,3), activation='relu'))

model.add(MaxPooling2D(pool_size=(2,2)))

model.add(BatchNormalization())

model.add(Conv2D(32, kernel_size=(3,3), activation='relu'))

model.add(MaxPooling2D(pool_size=(2,2)))

model.add(BatchNormalization())

model.add(Dropout(0.2))

model.add(Flatten())

model.add(Dense(128, activation='relu'))

#model.add(Dropout(0.3))

model.add(Dense(len(classes),activation='softmax'))

model.compile(loss = 'categorical_crossentropy',optimizer = 'adam', metrics =

["accuracy"])

model.summary()

history = model.fit(train_ds,epochs= 30, batch_size=32)

plt.plot(history.history['accuracy'])

plt.plot(history.history['loss'])
plt.xlabel('Time')

plt.legend(['accuracy', 'loss'])

plt.show()

def predict_image(image_path):

img = image.load_img(image_path, target_size=(224,224,3))

plt.imshow(img)

plt.show()

x = image.img_to_array(img)

x = np.expand_dims(x, axis=0)

images = np.vstack([x])

pred = model.predict(images, batch_size=32)

print("Actual: "+(image_path.split("/")[-1]).split("_")[0])

print("Predicted: "+classes[np.argmax(pred)])

predict_image("../input/face-recognition-dataset/Original Images/Original
Images/Brad Pitt/Brad Pitt_102.jpg")

Output:

Result:
Thus, the artificial neural network for facial recognition have been implemented
successfully.
Ex.No:4 IMPLEMENTATION OF AMAZON TOOLKIT: SAGEMAKER

Aim:
To implement the Amazon Toolkit Sagemaker

Procedure:
1.Setting up Sagemaker

2.Uploading Data to S3 Storage

3.Writing the Machine Learning Pipeline using Python

4.Creating notebook interface for Amazon Sagemaker

5.Creating Role setup for Amazon Sagemaker

6.Notebook created in Amazon Sagemaker

7.Selecting S3 storage in AWS

8.Creating Custom Dataset in S3 storage in AWS

9.Data uploaded in the S3 storage in AWS

10.Creating Notebook in Amazon Sagemaker

11. Python code for importing the data set from S3 storage and using in the machine
learning pipeline in Amazon Sagemaker notebook

12. Amazon’s proprietary machine learning models available in Amazon Sagemaker

Result:
Thus, the implementation of Amazon Toolkit Sagemaker was completed
successfully.

Ex.No:5 IMPLEMENTATION OF CHARACTER RECOGNITION USING MULTILAYER

PERCEPTRON

Aim:
To implement the character recognition using Multilayer Perceptron

Algorithm
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with images

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the multilayer perceptron to recognize the character

Step 6: Print the obtained results

Step 7: Stop the program

Program
# importing modules
import tensorflow as tf
import numpy as np
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Flatten
from tensorflow.keras.layers import Dense
from tensorflow.keras.layers import Activation
import matplotlib.pyplot as plt
(x_train, y_train), (x_test, y_test) = tf.keras.datasets.mnist.load_data()
# Cast the records into float values
x_train = x_train.astype('float32')
x_test = x_test.astype('float32')
# normalize image pixel values by dividing
# by 255
gray_scale = 255
x_train /= gray_scale
x_test /= gray_scale
print("Feature matrix:", x_train.shape)
print("Target matrix:", x_test.shape)
print("Feature matrix:", y_train.shape)
print("Target matrix:", y_test.shape)
fig, ax = plt.subplots(10, 10)
k=0
for i in range(10):
for j in range(10):
ax[i][j].imshow(x_train[k].reshape(28, 28),aspect='auto')
k += 1
plt.show()
model = Sequential([
# reshape 28 row * 28 column data to 28*28 rows
Flatten(input_shape=(28, 28)),
# dense layer 1
Dense(256, activation='sigmoid'),
# dense layer 2
Dense(128, activation='sigmoid'),
# output layer
Dense(10, activation='sigmoid'),])
model.compile(optimizer='adam',loss='sparse_categorical_crossentropy',metrics=['accur
acy'])
model.fit(x_train, y_train, epochs=10,batch_size=2000,validation_split=0.2)
results = model.evaluate(x_test, y_test, verbose = 0)
print('test loss, test acc:', results)

Output:
Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-
datasets/mnist.npz
11493376/11490434 [==============================] – 2s 0us/step
Feature matrix: (60000, 28, 28)
Target matrix: (10000, 28, 28)
Feature matrix: (60000,)
Target matrix: (10000,)
test loss, test acc: [0.27210235595703125, 0.9223999977111816]

Result:
Thus the character recognition using multilayer perceptron was implemented
successfully.

Ex.No 6 IMPLEMENTATION OF THE NON-PARAMETRIC LOCALLY WEIGHTED

REGRESSION ALGORITHM
Aim:
To implement the non-parametric locally weighted regression algorithm

Algorithm
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the locally weighted regression algorithm

Step 6: Print the obtained results

Step 7: Stop the program

Program
import numpy as np

from bokeh.plotting import figure, show, output_notebook

from bokeh.layouts import gridplot

from bokeh.io import push_notebook

def local_regression(x0, X, Y, tau):# add bias term

x0 = np.r_[1, x0] # Add one to avoid the loss in information

X = np.c_[np.ones(len(X)), X]

# fit model: normal equations with kernel

xw = X.T * radial_kernel(x0, X, tau) # XTranspose * W

beta = np.linalg.pinv(xw @ X) @ xw @ Y #@ Matrix Multiplication or Dot Product

# predict value

return x0 @ beta # @ Matrix Multiplication or Dot Product for prediction

def radial_kernel(x0, X, tau):

return np.exp(np.sum((X - x0) ** 2, axis=1) / (-2 * tau * tau))

# Weight or Radial Kernal Bias Function

n = 1000

# generate dataset

X = np.linspace(-3, 3, num=n)

print("The Data Set ( 10 Samples) X :\n",X[1:10])

Y = np.log(np.abs(X ** 2 - 1) + .5)

print("The Fitting Curve Data Set (10 Samples) Y :\n",Y[1:10])

# jitter X

X += np.random.normal(scale=.1, size=n)

print("Normalised (10 Samples) X :\n",X[1:10])

domain = np.linspace(-3, 3, num=300)

print(" Xo Domain Space(10 Samples) :\n",domain[1:10])

def plot_lwr(tau):

# prediction through regression

prediction = [local_regression(x0, X, Y, tau) for x0 in domain]

plot = figure(plot_width=400, plot_height=400)

plot.title.text='tau=%g' % tau

plot.scatter(X, Y, alpha=.3)

plot.line(domain, prediction, line_width=2, color='red')

return plot

show(gridplot([ [plot_lwr(10.), plot_lwr(1.)],[plot_lwr(0.1), plot_lwr(0.01)]]))

Output
Result:
Thus the implementation the non-parametric locally weighted regression algorithm has
been completed successfully.

Ex.No:7 IMPLEMENTATION OF SENTIMENT ANALYSIS USING RANDOM FOREST

OPTIMIZATION ALGORITHM

Aim:
To implement the sentiment analysis using Random Forest optimization Algorithm

Algorithm:

Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the Random Forest algorithm to analysis the sentiments

Step 6: Print the obtained results

Step 7: Stop the program

Program:
import numpy as np # linear algebra

import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)

import seaborn as sns

import matplotlib.pyplot as plt

import re

import nltk

# Input data files are available in the read-only "../input/" directory

# For example, running this (by clicking run or pressing Shift+Enter) will list all files
under the input directory

import os

for dirname, _, filenames in os.walk('/kaggle/input'):

for filename in filenames:

print(os.path.join(dirname, filename))

imdb=pd.read_csv("C:\\Users\\produ\\Downloads\\new\\Train.csv")

print(imdb.info())

print(imdb.shape)

print(imdb.head(10))

imdb['label'].value_counts().plot.pie(figsize=(6,6),title="Distribution
of reviews per

sentiment",labels=['',''],autopct='%1.1f%%')

labels=["Positive","Negative"]
plt.legend(labels,loc=3)

plt.gca().set_aspect('equal')

from sklearn.model_selection import train_test_split

features = imdb.drop("label",axis=1)

labels = imdb["label"]

X_train, X_test, y_train, y_test = train_test_split(features,labels,test_size = 0.90,

random_state=42)

X_train, X_test, y_train, y_test = train_test_split(X_train,y_train,test_size = 0.5,

random_state=42)

X_val, X_test, y_val, y_test = train_test_split(X_test,y_test,test_size = 0.5,

random_state=42)

print("Data distribution:\n- Train: {} \n- Validation: {} \n-Test:

{}".format(len(y_train),len(y_val),len(y_test)))

Output:
Result:
Thus, the implementation of sentimental analysis using random forest optimization
algorithm performed successfully using a python program.

Ex.No:8 CONSTRUCTION OF A BAYESIAN NETWORK CONSIDERING MEDICAL DATA

Aim:
To write a python program to construct a Bayesian network considering Medical
data

Algorithm:
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the Bayesian Network considering medical data

Step 6: Print the obtained results

Step 7: Stop the program

Program:
import pandas as pd
from pgmpy.estimators import MaximumLikelihoodEstimator
from pgmpy.models import BayesianNetwork
from pgmpy.inference import VariableElimination
data = pd.read_csv("C:\\Users\\produ\\Desktop\\ds4.csv")
heart_disease = pd.DataFrame(data)
print(heart_disease)
model = BayesianNetwork([
('age', 'Lifestyle'),
('Gender', 'Lifestyle'),
('Family', 'heartdisease'),
('diet', 'cholestrol'),
('Lifestyle', 'diet'),

('cholestrol', 'heartdisease'),
('diet', 'cholestrol')])
model.fit(heart_disease, estimator=MaximumLikelihoodEstimator)
HeartDisease_infer = VariableElimination(model)
print('For Age enter SuperSeniorCitizen:0, SeniorCitizen:1, MiddleAged:2, Youth:3,
Teen:4')
print('For Gender enter Male:0, Female:1')
print('For Family History enter Yes:1, No:0')
print('For Diet enter High:0, Medium:1')
print('for LifeStyle enter Athlete:0, Active:1, Moderate:2, Sedentary:3')
print('for Cholesterol enter High:0, BorderLine:1, Normal:2')
q = HeartDisease_infer.query(variables=['heartdisease'], evidence={
'age': int(input('Enter Age: ')),
'Gender': int(input('Enter Gender: ')),
'Family': int(input('Enter Family History: ')),
'diet': int(input('Enter Diet: ')),
'Lifestyle': int(input('Enter Lifestyle: ')),
'cholestrol': int(input('Enter Cholestrol: '))})
print(q)
Output:
Result:
Thus, the construction of Bayesian network considering medical data was
implemented successfully using python.

Ex.No: 9 IMPLEMENTATION OF ONLINE FRAUD DETECTION USING BEST

MACHINE LEARNING ALGORITHM

Aim:
To implement online fraud detection using best machine learning algorithm

Algorithm:
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the Random Forest algorithm for online fraud detection

Step 6: Print the obtained results

Step 7: Stop the program

Program:
import pandas as pd
data=pd.read_csv("credit.csv",encoding='windows-1252').dropna()
print(data)
X=data.drop(['Class'],axis=1)
print(X)
Y=data['Class']
print(Y)
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(X, Y, test_size=0.30)
print(x_train)
print(x_test)
print(y_train)
print(y_test)
from sklearn.ensemble import RandomForestClassifier
clf=RandomForestClassifier(n_estimators=100)
clf.fit(x_train,y_train)
y_pred=clf.predict(x_test)
print(y_pred)
from sklearn.metrics import accuracy_score
acc1=accuracy_score(y_pred,y_test)
print(acc1)

Output:
Result:
Thus, the online fraud detection using best machine algorithm was implemented
successfully

Ex.No:10 MINI PROJECT

Aim:
To write a python program to implement a socially relevant problem which needs
machine learning solution

Algorithm:
Step 1: Start the program
Step 2: Import the module which are needed

Step 3: Load the dataset with csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values

Step 5: Implement the Random Forest algorithm for online fraud detection

Step 6: Print the obtained results

Step 7: Stop the program

Program:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
from matplotlib.pylab import rcParams
rcParams['figure.figsize']=20,10
from keras.models import Sequential
from keras.layers import LSTM,Dropout,Dense
from sklearn.preprocessing import MinMaxScaler
df=pd.read_csv("NSE-TATA.csv")

df.head()
df["Date"]=pd.to_datetime(df.Date,format="%Y-%m-%d")
df.index=df['Date']
plt.figure(figsize=(16,8))
plt.plot(df["Close"],label='Close Price history')
data=df.sort_index(ascending=True,axis=0)
new_dataset=pd.DataFrame(index=range(0,len(df)),columns=['Date','Close'])
for i in range(0,len(data)):
new_dataset["Date"][i]=data['Date'][i]
new_dataset["Close"][i]=data['Close'][i]
scaler=MinMaxScaler(feature_range=(0,1))
final_dataset=new_dataset.values
train_data=final_dataset[0:987,:]
valid_data=final_dataset[987:,:]
new_dataset.index=new_dataset.Date
new_dataset.drop("Date",axis=1,inplace=True)
scaler=MinMaxScaler(feature_range=(0,1))
scaled_data=scaler.fit_transform(final_dataset)
x_train_data,y_train_data=[],[]
for i in range(60,len(train_data)):
x_train_data.append(scaled_data[i-60:i,0])
y_train_data.append(scaled_data[i,0])
x_train_data,y_train_data=np.array(x_train_data),np.array(y_train_data)
x_train_data=np.reshape(x_train_data,(x_train_data.shape[0],x_train_data.shape[1],1))

lstm_model=Sequential()

lstm_model.add(LSTM(units=50,return_sequences=True,input_shape=(x_train_data.
shape[1],1)))

lstm_model.add(LSTM(units=50))
lstm_model.add(Dense(1))

inputs_data=new_dataset[len(new_dataset)-len(valid_data)-60:].values
inputs_data=inputs_data.reshape(-1,1)
inputs_data=scaler.transform(inputs_data)
lstm_model.compile(loss='mean_squared_error',optimizer='adam')
lstm_model.fit(x_train_data,y_train_data,epochs=1,batch_size=1,verbose=2)
X_test=[]
for i in range(60,inputs_data.shape[0]):
X_test.append(inputs_data[i-60:i,0])
X_test=np.array(X_test)
X_test=np.reshape(X_test,(X_test.shape[0],X_test.shape[1],1))
predicted_closing_price=lstm_model.predict(X_test)
predicted_closing_price=scaler.inverse_transform(predicted_closing)
lstm_model.save(“saved_model.h5”)

Output:
Result:
Thus, implementation of socially relevant problem which needs machine
learning solution successfully using a python program.

Ex.No:11 CONTENT BEYOND SYLLABUS

SENTIMENTAL ANALYSIS USING LEXICON CLASSIFICATION ALGORITHM

Aim:
To write a python program to implement a sentimental analysis using Lexicon
Classification Algorithm

Algorithm:
Step 1: Start the program

Step 2: Import the module which are needed

Step 3: Load the dataset with csv format

Step 4: Perform various basic operations with loaded dataset to avoid missing values
Step 5: Implement the Lexicon Classification Algorithm for sentimental analysis

Step 6: Print the obtained results

Step 7: Stop the program

Program
import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

import seaborn as sns

import re

%matplotlib inline

import warnings

warnings.filterwarnings(“ignore”)

import os

print(os.listdir(“../input”))

pd.set_option(‘display.max_columns’,None)

US_comments=pd.read_csv(‘../input/youtube/UScomments.csv’,error_bad_lines=False)

US_videos=pd.read_csv(‘../input/youtube/USvideos.csv’,error_bad_lines=False)

US_videos.head()

US_videos.shape

US_videos.nunique()

US_videos.info()

US_videos.head()

US_comments.head()

US_comments.shape
US_comments.isnull().sum()

US_comments.dropna(inplace=True)

US_comments isnull().sum()

US_comments.shape

US_comments.nunique()

US_comments.info()

US_comments.drop(41587,inplace=True)

US_comments=US_comments.reset_index().drop(‘index’,axis=1)

US_comments.likes=US_comments.likes.astype(int)

US_comments.replies=US_comments.replies.astype(int)

US_comments.head()

US_comments[‘comment_text’]=US_comments[‘comment_text’].str.replace(“[^a-zA-
Z#]”,” “)

US_comments[‘comment_text’]=US_comments[‘comment_text’].apply(lambda x: ‘
‘.join([w for w in x.split() if len(w)>3]))

US_comments[‘comment_text’]=US_comments[‘comment_text’].apply(lambda
x:x.lower())

tokenized_tweet=US_comments[‘comment_text’].apply(lambda x:x.split())

tokenized_tweet.head()

from nltk.stem import WordNetLemmatizer

from nltk.corpus import stopwords

wnl = WordNetLemmatizer()

tokenized_tweet.apply(lambda x: [wnl.lemmatize(i) for i in x if i not in

set(stopwords.words('english'))])

tokenized_tweet.head()
US_comments['comment_text'] = tokenized_tweet

import nltk

nltk.download('vader_lexicon')

from nltk.sentiment.vader import SentimentIntensityAnalyzer

sia = SentimentIntensityAnalyzer()

US_comments['Sentiment Scores'] = US_comments['comment_text'].apply(lambda

x:sia.polarity_scores(x)['compound'])

US_comments.head()

US_comments['Sentiment'] = US_comments['Sentiment Scores'].apply(lambda s :

'Positive' if s > 0 else ('Neutral' if s == 0 else 'Negative'))

US_comments.head()

US_comments.Sentiment.value_counts()

videos = []

for i in range(0,US_comments.video_id.nunique()):

a = US_comments[(US_comments.video_id == US_comments.video_id.unique()[i])
& (US_comments.Sentiment == 'Positive')].count()[0]

b=US_comments[US_comments.video_id==US_comments.video_id.unique()[i]]
['Sentiment'].value_counts().sum()

Percentage = (a/b)*100

videos.append(round(Percentage,2))

Positivity = pd.DataFrame(videos,US_comments.video_id.unique()).reset_index()

Positivity.columns = ['video_id','Positive Percentage']

Positivity.head()

channels = []
for i in range(0,Positivity.video_id.nunique()):

channels.append(US_videos[US_videos.video_id == Positivity.video_id.unique()[i]]
['channel_title'].unique()[0])

Positivity['Channel'] = channels

Positivity.head()

Positivity[Positivity['Positive Percentage'] == Positivity['Positive Percentage'].max()]

Positivity[Positivity['Positive Percentage'] == Positivity['Positive Percentage'].min()]

all_words = ' '.join([text for text in US_comments['comment_text']])

from wordcloud import WordCloud

wordcloud = WordCloud(width=800, height=500, random_state=21,

max_font_size=110).generate(all_words)

plt.figure(figsize=(10, 7))

plt.imshow(wordcloud, interpolation="bilinear")

plt.axis('off')

plt.show()

all_words_posi = ' '.join([text for text in US_comments['comment_text']

[US_comments.Sentiment == 'Positive']])

wordcloud_posi = WordCloud(width=800, height=500, random_state=21,

max_font_size=110).generate(all_words_posi)

plt.figure(figsize=(10, 7))

plt.imshow(wordcloud_posi, interpolation="bilinear")

plt.axis('off')

plt.show()
all_words_nega = ' '.join([text for text in US_comments['comment_text']
[US_comments.Sentiment == 'Negative']])

wordcloud_nega = WordCloud(width=800, height=500, random_state=21,

max_font_size=110).generate(all_words_nega)

plt.figure(figsize=(10, 7))

plt.imshow(wordcloud_nega, interpolation="bilinear")

plt.axis('off')

plt.show()

all_words_neu = ' '.join([text for text in US_comments['comment_text']

[US_comments.Sentiment == 'Neutral']])

wordcloud_neu = WordCloud(width=800, height=500, random_state=21,

max_font_size=110).generate(all_words_neu)

plt.figure(figsize=(10, 7))

plt.imshow(wordcloud_neu, interpolation="bilinear")

plt.axis('off')

plt.show()

Output
['youtube']
(7992, 11)
video_id 2364
title 2398
channel_title 1230
category_id 16
tags 2204
views 7939
likes 6624
dislikes 2531
comment_total 4152
thumbnail_link 2364
date 40
dtype: int64
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 7992 entries, 0 to 7991
Data columns (total 11 columns):
video_id 7992 non-null object
title 7992 non-null object
channel_title 7992 non-null object
category_id 7992 non-null int64
tags 7992 non-null object
views 7992 non-null int64
likes 7992 non-null int64
dislikes 7992 non-null int64
comment_total 7992 non-null int64
thumbnail_link 7992 non-null object

date 7992 non-null float64

dtypes: float64(1), int64(5), object(5)
memory usage: 686.9+ KB
(691400, 4)
video_id 0
comment_text 25
likes 0
replies 0
dtype: int64
video_id 0
comment_text 0
likes 0
replies 0
dtype: int64
(691375, 4)
video_id 2266
comment_text 434076
likes 1284
replies 479
dtype: int64
<class 'pandas.core.frame.DataFrame'>
Int64Index: 691375 entries, 0 to 691399
Data columns (total 4 columns):
video_id 691375 non-null object
comment_text 691375 non-null object
likes 691375 non-null object
replies 691375 non-null object
dtypes: object(4)
memory usage: 26.4+ MB

0 [logan, paul]
1 [been, following, from, start, your, vine, cha...
2 [kong, maverick]
3 [attendance]
4 [trending]
Name: comment_text, dtype: object
0 [logan, paul]
1 [been, following, from, start, your, vine, cha...
2 [kong, maverick]
3 [attendance]
4 [trending]
Name: comment_text, dtype: object
Positive 305358
Neutral 260986
Negative 125030
Name: Sentiment, dtype: int64

POSITIVE COMMENTS

NEGATIVE COMMENTS
NEUTRAL COMMENTS

Thomas NumericalPartialDifferentialEquations1 PDF
100% (1)
Thomas NumericalPartialDifferentialEquations1 PDF
451 pages
20AI16 - ML Record
No ratings yet
20AI16 - ML Record
24 pages
ML 4 To 9 Keyur
No ratings yet
ML 4 To 9 Keyur
21 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
Exp 5
No ratings yet
Exp 5
9 pages
24CSPC212-PIC Lab Manual
No ratings yet
24CSPC212-PIC Lab Manual
45 pages
NNDL Record Manual
No ratings yet
NNDL Record Manual
36 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
54 pages
Naan Mudhalvan
No ratings yet
Naan Mudhalvan
24 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
AI Manual
No ratings yet
AI Manual
69 pages
CCS355 Neural Networks and Deep Learning Lab
No ratings yet
CCS355 Neural Networks and Deep Learning Lab
43 pages
Shaurya DL File
No ratings yet
Shaurya DL File
75 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
DL Record
No ratings yet
DL Record
11 pages
DL & AI - Lab Manual
No ratings yet
DL & AI - Lab Manual
33 pages
Deep Learning Practical
No ratings yet
Deep Learning Practical
12 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
DL Programs
No ratings yet
DL Programs
12 pages
Comprehensive Overview of Common ML Techniques
No ratings yet
Comprehensive Overview of Common ML Techniques
7 pages
DLT Lab PDF
No ratings yet
DLT Lab PDF
41 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
178 DL
No ratings yet
178 DL
31 pages
FA I - Unit5
No ratings yet
FA I - Unit5
11 pages
ML Practical
No ratings yet
ML Practical
61 pages
CV Prince
No ratings yet
CV Prince
120 pages
AI Manual
No ratings yet
AI Manual
36 pages
Deep Learning Observation
No ratings yet
Deep Learning Observation
35 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Week 4
No ratings yet
Week 4
15 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
5 pages
CCS355 - Neural Network CSE
No ratings yet
CCS355 - Neural Network CSE
38 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Deep Learning Lab Manual - 23-24
No ratings yet
Deep Learning Lab Manual - 23-24
41 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
AI Lab M.Tech
No ratings yet
AI Lab M.Tech
29 pages
Lab 12
No ratings yet
Lab 12
6 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
Minor Project
No ratings yet
Minor Project
21 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
39 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
DL Practical
No ratings yet
DL Practical
23 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
ML Priyesha - 778
No ratings yet
ML Priyesha - 778
23 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Machine Learning for iOS Developers
From Everand
Machine Learning for iOS Developers
Abhishek Mishra
No ratings yet
Lisa Uzakbaeva MachineLearning Lead
No ratings yet
Lisa Uzakbaeva MachineLearning Lead
6 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Manual .
No ratings yet
Deep Learning Manual .
34 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
Final DL
No ratings yet
Final DL
26 pages
IT2070 - Data Structures and Algorithms
No ratings yet
IT2070 - Data Structures and Algorithms
5 pages
Li 2015
No ratings yet
Li 2015
15 pages
AITools Unit-4
No ratings yet
AITools Unit-4
25 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
Block 3 L1. Normal Distribution (Intro)
No ratings yet
Block 3 L1. Normal Distribution (Intro)
9 pages
DSP Lab Record
No ratings yet
DSP Lab Record
51 pages
Humss 4 Trends
No ratings yet
Humss 4 Trends
21 pages
4' - FDM - Examples
No ratings yet
4' - FDM - Examples
24 pages
Data Structures, Sample Test Questions For The Material After Test 2, With Answers
No ratings yet
Data Structures, Sample Test Questions For The Material After Test 2, With Answers
12 pages
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
No ratings yet
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
11 pages
Brin Et Al
No ratings yet
Brin Et Al
10 pages
ECN 2101 2017-2018 Course Outline 2
No ratings yet
ECN 2101 2017-2018 Course Outline 2
2 pages
Automatic Time Table Management System: A Project Report ON
No ratings yet
Automatic Time Table Management System: A Project Report ON
64 pages
APPC 1.6A WKST Polynomial End Behavior
No ratings yet
APPC 1.6A WKST Polynomial End Behavior
2 pages
José M Soler 2002 J. Phys. Condens. Matter 14 2745
No ratings yet
José M Soler 2002 J. Phys. Condens. Matter 14 2745
36 pages
UT Dallas Syllabus For cs6324.001 05f Taught by Hsingmean Sha (Edsha)
No ratings yet
UT Dallas Syllabus For cs6324.001 05f Taught by Hsingmean Sha (Edsha)
3 pages
Field Theory
100% (1)
Field Theory
97 pages
Mid 2 N
No ratings yet
Mid 2 N
2 pages
MA385
No ratings yet
MA385
50 pages
The Application of The Z Transform 5.6 Transfer Functions For Digital Filters 5.7 Amplitude and Delay Distortion
No ratings yet
The Application of The Z Transform 5.6 Transfer Functions For Digital Filters 5.7 Amplitude and Delay Distortion
53 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
The United Republic of Tanzania President'S Office, Regional Administration and Local Government
No ratings yet
The United Republic of Tanzania President'S Office, Regional Administration and Local Government
4 pages
A Multilayer Feed-Forward Neural Network
No ratings yet
A Multilayer Feed-Forward Neural Network
9 pages
ENC CS Extension
No ratings yet
ENC CS Extension
2 pages
Ma034 RP
No ratings yet
Ma034 RP
9 pages
AI Exam 2014 05 27 Solutions
No ratings yet
AI Exam 2014 05 27 Solutions
12 pages
Oma351 Full Notes
No ratings yet
Oma351 Full Notes
134 pages
Data Structures Using C
100% (1)
Data Structures Using C
7 pages
Week 9: PHY 305 Classical Mechanics Instructor: Sebastian W Uster, IISER Bhopal, 2020
No ratings yet
Week 9: PHY 305 Classical Mechanics Instructor: Sebastian W Uster, IISER Bhopal, 2020
11 pages