0% found this document useful (0 votes)

58 views8 pages

Big Data Analytics To Predict Breast Cancer

Breast Cancer is the second cause of death among women. Early prediction of breast cancer will help with the survival of breast cancer patient. Machine Learning and Data Mining have been widely used in the prediction of breast cancer and on the early detection of breast cancer.

Uploaded by

IJRASETPublications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views8 pages

Big Data Analytics To Predict Breast Cancer

Uploaded by

IJRASETPublications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

10 III March 2022

https://doi.org/10.22214/ijraset.2022.41045
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

Big Data Analytics to Predict Breast Cancer

Hardi Patel1, Dr. Mehul P. Barot2
1 2
Research Scholar, Associate Professor, Department of Computer Engineering, LDRP ITR, KSV

Abstract: Breast Cancer is the second cause of death among women. Early prediction of breast cancer will help with the survival
of breast cancer patient. Machine Learning and Data Mining have been widely used in the prediction of breast cancer and on
the early detection of breast cancer. This paper compares the machine learning techniques which are used for the prediction of
breast cancer.
Keywords: Breast Cancer, Malignant, Benign, Machine Learning, Big Data Analytics.

I. INTRODUCTION
In the whole world, breast cancer is the most common and dangerous cancer in women. According to the WHO report in 2020, “It
is estimate that worldwide over 685000 women died due to breast cancer.”
Data mining and machine learning have been widely used in the diagnosis of breast cancer. Also, machine learning and data mining
assist the medical researchers to identify relationships between different variables and make them able to predict the outcome of
disease using datasets. Machine learning can be applied to improve breast cancer detection. Also, it could be an assistance to
accurate decision making. Therefore, the aim of this research is to analyse the data mining and machine learning techniques in
breast cancer detection. This research is organized as follows; Section 2 introduces of breast cancer. Section 3 explains the
algorithms and tools of data mining and machine learning which are used to predict breast cancer. Section 4 discusses about the
dataset of the breast cancer. Section 5 discusses the literature survey. Section 6 explains proposed architecture to compare the
accuracy of different algorithms. Finally, Section 7 includes conclusion of the survey.

II. BREAST CANCER

Normally, cells in the body divide (reproduce) only when new cells are needed. Sometimes, cells grow and they divide out of
control, which creates a mass of tissue called a tumour. If the tumor is benign then the cells that are growing out of control that are
normal cells. If, however, the cells are growing out of control are abnormal and don't function like the body's normal cells, the tumor
is called malignant.
Cancers are named after the body part from which they originate. The cancer which is originates in the breast tissue is called Breast
Cancer. Like other cancers, breast cancer can grow into the tissue surrounding the breast. It can also travel from breast to other parts
of the body and create new tumors, a process called metastasis[2].

A. Types of Tumors
Tumors can be benign or malignant.
1) Benign: Benign tumors are those that stay in their primary position without overrunning other parts of the body. They do not
spread to distant parts of the body. Benign growths will often develop gradually. Benign cancers have unmistakable lines [4].
Benign tumors are not problematic. However, they can end up massive and compress constructions nearby, inflicting ache or
different scientific complications. For example, a giant benign lung tumor ought to purpose issue in breathing. This would want
to press surgical operation to get rid of the most cancers from the physique. Benign tumors are unlikely to recur once removed.
The two common benign tumors are fibroids in the uterus and lipomas in the skin. Some benign tumors can flip into malignant
tumors. These kinds of tumors are monitored intently and may additionally require surgical operation to dispose of it. For
example, colon polyps can end up malignant consequently it wishes surgical operation to eliminate [4].
2) Malignant: Malignant tumors have cells that develop uncontrollably and unfold to the different components of the body. These
sorts of tumors are cancerous. They unfold to different phase of the physique by way of the bloodstream or the lymphatic
system. This spread is called metastasis. Metastasis can occur anywhere in the body and mostly it is found in the liver, lungs,
breast, brain, and bone [4]. Malignant tumors can spread frequently and require surgery or treatment to avoid spread. If we can
find it early, then it can be prevented by treatment. Treatments for malignant tumor is like: chemotherapy or radiotherapy. If the
cancer has spread, the treatment is likely to be systemic, such as chemotherapy or immunotherapy.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2004
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

B. Symptoms of Breast Cancer

Different people have different types of symptoms of breast cancer. Some people do not have any symptoms at all [8].
Some different types of symptoms are as follows:
1) New lump will be created in the breast or underarm.
2) Thickening of section of the breast vicinity or swelling of section of the breast area.
3) Irritation of breast pores and skin or dimpling of breast skin.
4) Redness or flaky pores and skin in the nipple location or the breast.
5) Pulling in of the nipple or ache in the nipple area.
6) Nipple discharge different than breast milk, consisting of blood.
7) Change in the dimension or the form of the breast.
8) Pain in place of the breast.

C. Stages of Breast Cancer

Breast Cancer has four stages.
1) T0: There is no evidence of cancer in the breast.[3]
2) T1: The tumor in the breast is 20 millimetres (mm) or smaller in size at its widest area. This is a little less than an inch. This
stage is then broken into 4 substages depending on the size of the tumor[3]:
a) T1mi is a tumor that is 1 mm or smaller.
o T1a is a tumor that is larger than 1 mm but 5 mm or smaller.
o T1b is a tumor that is larger than 5 mm but 10 mm or smaller.
o T1c is a tumor that is larger than 10 mm but 20 mm or smaller.
b) T2: The tumor is larger than 20 mm but not larger than 50 mm.
c) T3: The tumor is larger than 50 mm.
d) T4: The tumor falls into 1 of the following groups:
o T4a means the tumor has grown into the chest wall.
o T4b is when the tumor has grown into the skin.
o T4c is cancer that has grown into the chest wall and the skin.
o T4d is inflammatory breast cancer.

III. BIG DATA ANALYTICS AND MACHINE LEARNING

1) Big Data
Big data analytics is the use of advanced analytic techniques against large, diverse data sets that include structured, semi-structured
and unstructured data, from different sources, and in different sizes from [5].
Big data is a time period utilized to datasets whose measurement or kind is beyond the capability of relational databases to capture,
control and system the statistics with low latency. Big data has following characteristics: high volume, high velocity, high variety,
veracity, and value.
Applications of big data analytics can improve the services which are patient based, to detect diseases earlier, generate new patterns
into disease mechanisms, monitor the quality of the medical and healthcare institutions as well as provide better methods of
treatments [6].

2) Machine Learning
Machine Learning is a learning program from experience to improve its performance without human instruction
There are two types of learning:
a) Supervised Learning
b) Unsupervised Learning

A. Data Mining Algorithms

There are many algorithms such as Naïve Bayes, K-Nearest Neighbor, k-mean, Random Forest; They are used for analysing a huge
amount of data.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2005
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

Some popular Data Mining Algorithms are discussed as follows:

1) Naïve Bayes: It is a probabilistic classifier [10] ; it is one of the efficient classification algorithms based on applying Bayes'
theorem with strong (naïve) independent assumptions. It assumes the value of the feature is independent of the value of any
other features, given the class variable. Based on the maximum probability. It detects the class membership for the given tuple
to a particular class.
2) K-Nearest Neighbor: KNN algorithm is also called as Instance-Based Learning. KNN is the simplest approach for classification
of samples. Here different distance measures are used for classifying samples. K-nearest Neighbor finds the number of samples
from training data which is near to the test samples and assigns to the frequent class label [14]. In this algorithm, training
samples generate the classification rules without considering extra information. It has excessive likelihood when associated
cases belonging to the same type [14]. Based on K training samples KNN algorithm identifies the test samples. For every
situation, K value will be a positive integer.
3) Support Vector Machine: Support Vector Machine (SVM) which is designed in 1990’s. To achieve machine learning tasks
support vector machine is used, and it is a simple and prominent process. During this technique, a collection of training samples
is given each sample is divided into different categories. Support vector machine mainly used for classification and regression
problems.
4) Decision Tree Algorithm(J48): Decision tree algorithms are successful machine learning classification techniques. They are the
supervised learning methods which use information gained and pruned to improve results. Moreover, decision tree algorithms
are commonly used for classification in many research, for example, in the medicine emergency and health issues. There are
many types of decision tree algorithms like ID3 and C4.5. However, J48 is the most popular and useful decision tree algorithm.
J48 is the implementation of an improved version of C4.5 and is an extension of ID3.
5) Random Forest: A random forest is a machine learning technique that’s used to solve regression and classification problems. It
utilizes ensemble learning. Ensemble learning is a technique which combines many classifiers to provide solutions to complex
problems. A random forest algorithm contains many decision trees. The ‘forest’ generated by the random forest algorithm is
trained through bagging or bootstrap aggregating. The algorithm establishes the outcome based on the predictions of the
decision trees. It takes the mean or average of the output from the various trees and then predict the outcome. To increase the
precision of the outcome we must increase the number of trees. A random forest eradicates the limitations of a decision tree
algorithm. It reduces the overfitting of datasets and increases precision. It generates predictions without requiring many
configurations in packages.

B. Data Mining Tools

Data mining tools provide ready to use an implementation of the mining algorithms. Most of them are free opensource software.
Some of the popular data mining tools are discussed in the following:
1) WEKA: The Weka is a collection of machine learning algorithms and data pre-processing tools for Knowledge Learning.
WEKA stands for Waikato Environment for Knowledge Analysis. It is a computer program that was developed at the
University of Waikato (New Zealand). The program is written in Java, and it runs on almost any operating system. It is a free
data mining software. WEKA supports evaluating, visualizing, and preparing the input data. It supports different machine
learning algorithms like classification, clustering, and regression.
2) Tanagra: Tanagra is a free machine learning software for research and academic purposes. It was developed by Ricco
Rakotomalala at the Lumière University, France. Tanagra supports different types of data mining tasks like visualization,
descriptive statistics, regression, clustering, classification, and association rule learning.
3) Orange: Orange is a Python-based tool for machine learning and data mining. Its visual programming interface is clean and
easily understood. The orange may be more suited for novice researchers and small projects [7].
4) MATLAB: MATLAB as a data mining tool has an interpreted language and graphical user interfaces. It also has hundreds of
mathematical functions to support multi-paradigm numerical calculations which make it suitable to the computing environment.

IV. BREAST CANCER DATASET

For the prediction of breast cancer, we used breast cancer Wisconsin(original) dataset. The dataset includes 699 instances and 11
attributes along with the class label. The distribution of class will be 458 instances belong to the benign class and other 241
instances belong to the malignant class.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2006
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

Data Description

Breast Cancer Dataset [9]

1) Sample code number indicates id number.

2) Clump thickness determines whether it contains single or multi layered cells.
3) Uniformity of cell size means in the given samples it determines the size of cells which are consistency.
4) Uniformity of cell shape: It recognizes marginal differences and determines the cell shapes.
5) Marginal adhesion: It evaluates how many cells present on the external of the epithelial and they are stick together.
6) Single epithelial cell size: It identifies the epithelial cells that are necessarily expanded, and it also describes the uniformity of
cells.
7) Bare nuclei: it computes the hypothesis of the bunch of cells that are not encircled by the cytoplasm.
8) Bland chromatin: it ranks the pattern of a nucleus from admirable to rude.
9) Normal nucleoli: it identifies either the nucleoli are tiny, hardly apparent or huge, most clearly visible.
10) Mitoses: mitoses illustrate the level of the mitotic state.

V. LITERATURE SURVEY
A. Mining Big Data: Breast Cancer Prediction using DT-SVM Hybrid Model
In this paper, K. Sivakami uses Decision tree and Support Vector Machines (DT-SVM) both are hybrid methods. To introduce a
disorder status prognosis, they employ DT-SVM methods. The experiment was performed through Weka tool. The authors have
considered the Wisconsin breast cancer dataset that includes 699 instances; in those 458 instances belong to not cancer (benign)
class and other 241 instances belong to cancer (malignant) class. Finally, the author compared the output of the DT-SVM model
with Naive Bayes, instance-based learning (IBK), and sequential minimal optimization (SMO) and conclude that DT-SVM gives
better accuracy i.e., 91% compared to NB, IBK, and SMO.

B. Big Data Analytics to Predict Breast Cancer Recurrence on SEER Dataset using MapReduce Approach
In this paper, D.R. Umesh and B. Ramachandra [1] have utilized Expectation Maximization (EM) algorithm for identifying the
breast cancer recurrence. To find out the classification accuracy they have used SEER dataset which contains 2,20,811 instances
with 17 attributes. The authors have performed their experiment through Amazon cloud computing environment (EC2) and declare
expectation maximization algorithm gives 88.54% of accuracy.

C. Breast Cancer Diagnosis and Prediction Using Machine Learning and Data Mining Techniques: A Review
In this paper, Hiba Asri et al. [7] performed this experiment to determine the efficiency and effectiveness of various algorithms like
Support
Vector Machine (SVM), K Nearest Neighbor (K-NN), Decision Tree (C4.5), and Naive Bayes (NB). They utilized Wisconsin breast
cancer (original) dataset taken from UCI machine learning repository contains 699 instances with 11 attributes. The experiment is
performed on WEKA tool and outcomes show that the SVM gives higher accuracy 97.13% compared to K-NN, C4.5 i.e., 95.27%,
95.13%.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2007
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

D. Prediction of Breast Cancer using Big Data Analytics

In this paper, K. Shailaja et al [12] uses KNN algorithm to classify cancer tumor as either benign or malignant. This approach is
evaluated and compared using Wisconsin Breast Cancer dataset. The authors have applied feature selection on the dataset to remove
duplicate and irrelevant features. The experiment result shows the accuracy, precision, recall and F-measure are increased by the
proposed method when compared with different models. Accuracy before feature selection is 96.6% and after feature selection is
98.14%.

E. Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis
In this paper, Hiba Asri et al [11] employed four main algorithms: SVM, Naïve Bayes, KNN, C4.5 on the Wisconsin Breast Cancer
(original) Dataset. The authors try to compare efficiency and effectiveness of those algorithms in terms of accuracy, precision,
sensitivity, and specificity to find the best classification accuracy. SVM reaches at higher accuracy of 97.13%. In conclusion, SVM
has proven its efficiency in Breast Cancer prediction and diagnosis and achieves the best performance in terms of precision and low
error rate.

F. Early Diagnosis of Breast Cancer Prediction using Random Forest Classifier

In this paper, P. R. Anisha et al [16] used six main machine learning algorithms to predict and diagnose the breast cancer.
Comparison of the six algorithms: Logistic Regression, Decision Tree, K- nearest Neighbor, Naïve Bayes, Support Vector Classifier
and Random Forest Classifier. The author got higher accuracy 98% of the Random Forest classifier.

G. Performance Analysis of Different Classifiers in Prediction of Breast Cancer

In this paper, S. Roobini et al [14] performed different methodology and perform analysis of different classifiers in prediction of
breast cancer.
In this research, 10-fold cross validation is used to validate the results. The dataset is divided into ten equal subsets randomly. One
of the partition act as a testing set, whereas the rest of the partitions act as training set to train the model. A relative report on the
execution of existing and proposed grouping model is talked about dependent on Accuracy, Error rate, F - measure, exactness, and
review. Precision quantum’s the means by which profound the settled tuples are being ordered effectively, TP embodies to positive
tuples and TN epitomizes to negative tuples characterized by the essential classifiers. So also, FP ascribes to positive tuples and FN
attributes to negative tuples which is inaccurately grouped by the classifiers.
The performance of Fuzzy C-Means Clustering [FCM] with Naive Bayesian classifier provides a better prediction when compared
to other classifiers.

VI. PROPOSED ARCHITECTURE

To understand the efficiency of different algorithms, we construct the confusion matrix to compare different algorithms like Naïve
Bayes, SVM (Support Vector Machine), KNN and Random Forest.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2008
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue III Mar 2022- Available at www.ijraset.com

A. Confusion Matrix

Algorithm Benign Malignant Class Accuracy

Naïve Bayes 436 22 Benign 95.99%
6 235 Malignant
SVM 445 13 Benign 96.71%
10 231 Malignant
KNN 445 13 Benign 97.6%
20 221 Malignant
Random 443 15 Benign 96.85%
Forest 7 234 Malignant

VII. CONCLUSION
In this paper, we compared different type of machine learning algorithms to find the most accurate algorithm to classify the breast
cancer dataset into two different classes benign and malignant. we performed these algorithms on WEKA tool. This experiment
shows different accuracy of all the algorithms. KNN got the highest accuracy of 97.6%.

REFERENCES
[1] D.R Umesh et al., “Big Data Analytics to Predict Breast Cancer Recurrence on SEER Dataset using MapReduce Approach”, International Journal of Computer
Applications, volume 7, 2016.
[2] https://my.clevelandclinic.org/health/diseases/3986-breast-cancer
[3] https://www.cancer.net/cancer-types/breast-cancer/stages
[4] https://jamanetwork.com/journals/jamaoncology/fullarticle/2768634
[5] https://www.ibm.com/in-en/analytics/hadoop/big-data-analytics
[6] https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6340124/
[7] Saria Eltalhi. “Breast Cancer Diagnosis and Prediction Using Machine Learning and Data Mining Techniques: A Review.” IOSR Journal of Dental and Medical
Sciences (IOSR JDMS), vol. 18, no. 04, 2019, pp 85-94.
[8] https://www.cdc.gov/cancer/breast/basic_info/symptoms.htm
[9] https://www.researchgate.net/figure/Breast-cancer-dataset_tbl1_323952426
[10] G. Sumalatha et al., “A Study on Early Prevention and Detection of Breast Cancer using Data Mining Techniques”, International Journal of Innovative Research
in Computer and Communication Engineering, volume 5,2017.
[11] Hiba Asri, “Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis”, The 6th International Symposium on Frontiers in Ambient
and Mobile Systems, pp.1064-1069
[12] K. Shailaja, ” Prediction of Breast Cancer Using Big Data Analytic”, International Journal of Engineering & Technology, volume 7, 2018.
[13] Eltalhi, Saria & Kutrani, Huda. (2019). Breast Cancer Diagnosis and Prediction using Machine Learning and Data Mining Techniques: A Review. IOSR Journal
of Dental and Medical Sciences. 18. 85-94.
[14] S. Roobini and J. Fenila Naomi, “Performance Analysis of Different Classifier in Prediction of Breast Cancer” , International Journal of Science and Technology ,
volume 12(8) , 2019.
[15] Emanelwerfally, & Kutrani, Huda & Eltalhi, Saria & Ashleik, Naeima. (2021). Predicting Breast Cancer Treatment Using Decision Tree Algorithms and
Statistical Metrics. IOSR Journal of Dental and Medical Sciences. 20. 48-54
[16] V. Sivakumar et al, “Feasibility Study on Data Mining Techniques in Diagnosis of Breast Cancer”, International Journal of Machine Learning and
Computing”, Volume 9 ,2019.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2133)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Se of Optimism Software To Observe Effect of Different Sources in Optical Fiber
No ratings yet
Se of Optimism Software To Observe Effect of Different Sources in Optical Fiber
7 pages
Air Conditioning Heat Load Analysis of A Cabin
No ratings yet
Air Conditioning Heat Load Analysis of A Cabin
9 pages
11 V May 2023
No ratings yet
11 V May 2023
34 pages
IoT-Based Smart Medicine Dispenser
100% (1)
IoT-Based Smart Medicine Dispenser
8 pages
Study and Analysis of Non-Newtonian Fluid Speed Bump
No ratings yet
Study and Analysis of Non-Newtonian Fluid Speed Bump
8 pages
Design and Analysis of Fixed-Segment Carrier at Carbon Thrust Bearing
No ratings yet
Design and Analysis of Fixed-Segment Carrier at Carbon Thrust Bearing
10 pages
Adsorption Study On Waste Water Characteristics by Using Natural Bio-Adsorbents
No ratings yet
Adsorption Study On Waste Water Characteristics by Using Natural Bio-Adsorbents
6 pages
Study and Analysis of Non-Newtonian Fluid Speed Bump
No ratings yet
Study and Analysis of Non-Newtonian Fluid Speed Bump
8 pages
Advanced Wireless Multipurpose Mine Detection Robot
No ratings yet
Advanced Wireless Multipurpose Mine Detection Robot
7 pages
Topology Optimisation of Piston
No ratings yet
Topology Optimisation of Piston
8 pages
Design and Analysis of Components in Off-Road Vehicle
No ratings yet
Design and Analysis of Components in Off-Road Vehicle
23 pages
Controlled Hand Gestures Using Python and OpenCV
No ratings yet
Controlled Hand Gestures Using Python and OpenCV
7 pages
Skill Verification System Using Blockchain SkillVio
No ratings yet
Skill Verification System Using Blockchain SkillVio
6 pages
Design and Analysis of Fixed Brake Caliper Using Additive Manufacturing
No ratings yet
Design and Analysis of Fixed Brake Caliper Using Additive Manufacturing
9 pages
Role of Artificial Intelligence in Emotion Recognition
No ratings yet
Role of Artificial Intelligence in Emotion Recognition
5 pages
Smart Parking System Using MERN Stack
No ratings yet
Smart Parking System Using MERN Stack
6 pages
TNP Portal Using Web Development and Machine Learning
No ratings yet
TNP Portal Using Web Development and Machine Learning
9 pages
Real Time Human Body Posture Analysis Using Deep Learning
100% (1)
Real Time Human Body Posture Analysis Using Deep Learning
7 pages
A Review On Speech Emotion Classification Using Linear Predictive Coding and Neural Networks
No ratings yet
A Review On Speech Emotion Classification Using Linear Predictive Coding and Neural Networks
5 pages
Low Cost Scada System For Micro Industry
No ratings yet
Low Cost Scada System For Micro Industry
5 pages
Structural Analysis of The Performance of The Diagrid System With and Without Shear Wall
No ratings yet
Structural Analysis of The Performance of The Diagrid System With and Without Shear Wall
13 pages
Image Detection and Real Time Object Detection
100% (1)
Image Detection and Real Time Object Detection
8 pages
BIM Data Analysis and Visualization Workflow
No ratings yet
BIM Data Analysis and Visualization Workflow
7 pages
Pneumonia Detection Using X-Rays by Deep Learning
No ratings yet
Pneumonia Detection Using X-Rays by Deep Learning
6 pages
Comparative in Vivo Study On Quality Analysis On Bisacodyl of Different Brands
No ratings yet
Comparative in Vivo Study On Quality Analysis On Bisacodyl of Different Brands
17 pages
Credit Card Fraud Detection Using Machine Learning and Blockchain
100% (1)
Credit Card Fraud Detection Using Machine Learning and Blockchain
9 pages
CryptoDrive A Decentralized Car Sharing System
100% (1)
CryptoDrive A Decentralized Car Sharing System
9 pages
Fund Future Empowering The Crowdfunding
No ratings yet
Fund Future Empowering The Crowdfunding
6 pages
Dark Store E-Commerce Website Using Sentiment Analysis Prediction
No ratings yet
Dark Store E-Commerce Website Using Sentiment Analysis Prediction
6 pages
Business Support System For Local Stores
No ratings yet
Business Support System For Local Stores
8 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Literature Review Breast Engorgement
100% (2)
Literature Review Breast Engorgement
7 pages
OSCE Stations Checklists in History and Examinations
100% (1)
OSCE Stations Checklists in History and Examinations
94 pages
Adsef 106
No ratings yet
Adsef 106
1 page
Chapter 25: The Patient With Cancer: Multiple Choice
No ratings yet
Chapter 25: The Patient With Cancer: Multiple Choice
11 pages
Breast Examination 2
No ratings yet
Breast Examination 2
7 pages
Encapsulated Apocrine Papillary Carcinoma
No ratings yet
Encapsulated Apocrine Papillary Carcinoma
7 pages
Reading Passage 'Breast Cancer'
100% (1)
Reading Passage 'Breast Cancer'
2 pages
Benefits of Breastfeeding
No ratings yet
Benefits of Breastfeeding
3 pages
Prescription Template 10
No ratings yet
Prescription Template 10
2 pages
Clinical Endocrinology - 2025 - Mukherjee - Update On Menopause Hormone Therapy Current Indications and Unanswered
No ratings yet
Clinical Endocrinology - 2025 - Mukherjee - Update On Menopause Hormone Therapy Current Indications and Unanswered
14 pages
Wa0000.
No ratings yet
Wa0000.
34 pages
T Categories For Breast Cancer
No ratings yet
T Categories For Breast Cancer
3 pages
Nursing Care Plan 106
No ratings yet
Nursing Care Plan 106
25 pages
Breast Cancer Surgery & Treatment
100% (1)
Breast Cancer Surgery & Treatment
38 pages
Zheng 2021
No ratings yet
Zheng 2021
12 pages
Cancer Detection Using Machine Learning
No ratings yet
Cancer Detection Using Machine Learning
8 pages
NICE RG Submission
No ratings yet
NICE RG Submission
8 pages
CTR Conference Booklet
No ratings yet
CTR Conference Booklet
150 pages
IDOR 2012 OncologyImaging Lowres
No ratings yet
IDOR 2012 OncologyImaging Lowres
40 pages
Breast Cancer Thesis
100% (1)
Breast Cancer Thesis
4 pages
International Manual of Oncology Practice iMOP Principles of Medical Oncology 1st Edition Ramon Andrade de Mello
100% (1)
International Manual of Oncology Practice iMOP Principles of Medical Oncology 1st Edition Ramon Andrade de Mello
59 pages
HormonesBalance Superfoods Powerherbs Guides FINAL
100% (5)
HormonesBalance Superfoods Powerherbs Guides FINAL
28 pages
The Obesity Breast Cancer Link A Multidisciplinary Perspective Rev Cancer Mets 2022
No ratings yet
The Obesity Breast Cancer Link A Multidisciplinary Perspective Rev Cancer Mets 2022
19 pages
4 5989917285429543644 PDF
No ratings yet
4 5989917285429543644 PDF
14 pages
Senior Director Oncology Marketing in USA Resume Mark Matthews
No ratings yet
Senior Director Oncology Marketing in USA Resume Mark Matthews
2 pages
Improving Outcomes For Breast Cancer Survivors
100% (1)
Improving Outcomes For Breast Cancer Survivors
280 pages
Dr. Manoj Pandey - Oncology BHU
No ratings yet
Dr. Manoj Pandey - Oncology BHU
65 pages
University Physics For The Physical and Life Sciences Volume 2 Philip R. Kesten Philip R. Kesten &amp David L. Tauck
No ratings yet
University Physics For The Physical and Life Sciences Volume 2 Philip R. Kesten Philip R. Kesten &amp David L. Tauck
34 pages
Surgery Review Manual
No ratings yet
Surgery Review Manual
158 pages
Lesson F A
No ratings yet
Lesson F A
5 pages

Big Data Analytics To Predict Breast Cancer

Uploaded by

Big Data Analytics To Predict Breast Cancer

Uploaded by

10 III March 2022

Big Data Analytics to Predict Breast Cancer

II. BREAST CANCER

B. Symptoms of Breast Cancer

C. Stages of Breast Cancer

III. BIG DATA ANALYTICS AND MACHINE LEARNING

A. Data Mining Algorithms

Some popular Data Mining Algorithms are discussed as follows:

B. Data Mining Tools

IV. BREAST CANCER DATASET

Breast Cancer Dataset [9]

1) Sample code number indicates id number.

D. Prediction of Breast Cancer using Big Data Analytics

F. Early Diagnosis of Breast Cancer Prediction using Random Forest Classifier

G. Performance Analysis of Different Classifiers in Prediction of Breast Cancer

VI. PROPOSED ARCHITECTURE

Algorithm Benign Malignant Class Accuracy

You might also like