0% found this document useful (0 votes)

80 views10 pages

Chapter 4. Development Process

The document discusses the development process for a brain tumor detection system using Python and Anaconda. It describes requirement analysis, features of Python like being easy to code and open source. It then discusses Anaconda distribution which comes with over 250 packages pre-installed and its advantages over pip like dependency management. It also describes modules in the proposed system like image processing, pre-processing, segmentation, classification to detect brain tumors using a convolutional neural network with dense layers in deep learning.

Uploaded by

Aishwarya Balaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views10 pages

Chapter 4. Development Process

Uploaded by

Aishwarya Balaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

CHAPTER 4

DEVELOPMENT PROCESS
4.1. REQUIREMENT ANALYSIS
Requirements are a feature of a system or description of something that the system is
capable of doing in order to fulfil the system’s purpose. It provides the appropriate
mechanism for understanding what the customer wants, analyzing the needs assessing
feasibility, negotiating a reasonable solution, specifying the solution unambiguously,
validating the specification and managing the requirements as they are translated into an
operational system.

4.1.1. PYTHON:

Python is a dynamic, high level, free open source and interpreted programming
language. It supports object-oriented programming as well as procedural oriented
programming. In Python, we don’t need to declare the type of variable because it is a
dynamically typed language.

For example, x=10 .Here, x can be anything such as String, int, etc.
Python is an interpreted, object-oriented programming language similar to PERL, that has
gained popularity because of its clear syntaxand readability. Python is said to be relatively
easy to learn and portable, meaning its statements can be interpreted in a number of operating
systems, including UNIX-based systems, Mac OS, MS-DOS, OS/2, and various versions of
Microsoft Windows 98. Python was created by Guido van Rossum, a former resident of the
Netherlands, whose favourite comedy group at the time was Monty Python's Flying Circus.
The source code is freely available and open for modification and reuse. Python has a
significant number of users.

Features in Python

There are many features in Python, some of which are discussed below
 Easy to code
 Free and Open Source
 Object-Oriented Language
 GUI Programming Support
 High-Level Language
 Extensible feature
 Python is Portable language
 Python is Integrated language
 Interpreted Language

4.2. ANACONDA

Anaconda distribution comes with over 250 packages automatically installed, and

over 7,500 additional open-source packages can be installed from PyPI as well as
the conda package and virtual environment manager. It also includes a GUI, Anaconda
Navigator,[12] as a graphical alternative to the command line interface (CLI).

The big difference between conda and the pip package manager is in how package
dependencies are managed, which is a significant challenge for Python data science and the
reason conda exists.

When pip installs a package, it automatically installs any dependent Python packages
without checking if these conflict with previously installed packages. It will install a package
and any of its dependencies regardless of the state of the existing installation. Because of this,
a user with a working installation of, for example, Google Tensorflow, can find that it stops
working having used pip to install a different package that requires a different version of the
dependent numpy library than the one used by Tensorflow. In some cases, the package may
appear to work but produce different results in detail.

In contrast, conda analyses the current environment including everything currently

installed, and, together with any version limitations specified (e.g. the user may wish to have
Tensorflow version 2,0 or higher), works out how to install a compatible set of dependencies,
and shows a warning if this cannot be done.

Opensource packages can be individually installed from the Anaconda

repository, Anaconda Cloud (anaconda.org), or the user's own private repository or mirror,
using the conda install command. Anaconda, Inc. compiles and builds the packages available
in the Anaconda repository itself, and provides binaries for Windows 32/64 bit, Linux 64 bit
and MacOS 64-bit. Anything available on PyPI may be installed into a conda environment
using pip, and conda will keep track of what it has installed itself and what pip has installed.
Custom packages can be made using the conda build command, and can be shared
with others by uploading them to Anaconda Cloud, PyPI or other repositories.

The default installation of Anaconda2 includes Python 2.7 and Anaconda3 includes
Python 3.7. However, it is possible to create new environments that include any version of
Python packaged with conda.

4.2.1. Anaconda Navigator

Anaconda Navigator is a desktop graphical user interface (GUI) included in
Anaconda distribution that allows users to launch applications and manage conda packages,
environments and channels without using command-line commands. Navigator can search for
packages on Anaconda Cloud or in a local Anaconda Repository, install them in an
environment, run the packages and update them. It is available
for Windows, macOS and Linux.

The following applications are available by default in Navigator:

 JupyterLab
 Jupyter Notebook
 QtConsole
 Spyder
 Glue
 Orange
 RStudio
 Visual Studio Code

4.2.2. JUPYTER NOTEBOOK

Jupyter Notebook (formerly IPython Notebooks) is a web-based

interactive computational environment for creating Jupyter notebook documents. The
"notebook" term can colloquially make reference to many different entities, mainly the
Jupyter web application, Jupyter Python web server, or Jupyter document format depending
on context. A Jupyter Notebook document is a JSON document, following a versioned
schema, containing an ordered list of input/output cells which can contain code, text
(using Markdown), mathematics, plots and rich media, usually ending with the ".ipynb"
extension.

Jupyter Notebook can connect to many kernels to allow programming in different

languages. By default, Jupyter Notebook ships with the IPython kernel. As of the 2.3
release[11][12] (October 2014), there are currently 49 Jupyter-compatible kernels for many
programming languages, including Python, R, Julia and Haskell.

The Notebook interface was added to IPython in the 0.12 release [14] (December 2011),
renamed to Jupyter notebook in 2015 (IPython 4.0 – Jupyter 1.0). Jupyter Notebook is similar
to the notebook interface of other programs such as Maple, Mathematica, and SageMath, a
computational interface style that originated with Mathematica in the 1980s. According
to The Atlantic, Jupyter interest overtook the popularity of the Mathematica notebook
interface in early 2018.

4.3. RESOURCE REQUIREMENTS :

SOFTWARE REQUIREMENTS:

Op e r a t i n g S y s t e m Windows 7or later

Simulation Tool Anaconda (Jupyter notebook)
Do c u m e n t a t i o n Ms – Office

HARDWARE REQUIREMENTS:

CPU type Intel Pentium

Ram size 4GB
Hard disk capacity 80 GB
Keyboard type Internet keyboard
Monitor type 15 Inch colour monitor
CD -drive type 52xmax
4.4.SYSTEM ARCHITECTURE

Dataset Collection
CNN in Deep
Training Dense Layer
Learning

Pre-
Processing

Testing

Segmentation

Classification

Prediction Of
Brain Tumor
4.4.1. USECASE DIAGRAM

Image Processing

Pre-Processing

Segmentation

Classification

admin

4.5. PROPOSED SYSTEM

 Our proposed system involves Dense Layer in Convolutional Neural Network (CNN)
Algorithm in Deep Learning concept used to train the dataset.
 In Dense Layer, each layer obtains additional inputs from all preceding layers and
passes on its own feature-maps to all subsequent layers.
 In Dense Layer uses features of all complexity levels. It tends to give more smooth
decision boundaries.

4.5.1. ADVANTAGES

 Easy detection of the Brain Tumor with the concluded technique.

 Time consuming.
 Best accuracy Model helps in better treatment as early.
 Detection of best Model will quick the treatment which is life saving
SYSTEM MODULES:
 Module 1: Image Processing
 Module 2: Pre-Processing
 Module 3: Segmentation
 Module 4: Classification

Module 1: Dataset Collection and Pre-processing

A dataset (or data set) is a collection of data, usually presented in tabular form. Each

column represents a particular variable. Each row corresponds to a given member of
the dataset in question. It lists values for each of the variables, such as height and weight of
an object. Each value is known as a datum.

We have chosen to use a publicly-available Healthcare dataset which contains a

relatively small number of inputs and cases. The data is arranged in such a way that will
allow those trained in medical disciplines to easily draw parallels between familiar statistical
and novel ML techniques. Additionally, the compact dataset enables short computational
times on almost all modern computers.

The sklearn.preprocessingpackage provides several common utility functions and

transformer classes to change raw feature vectors into a representation that is more suitable
for the downstream estimators.

In general, learning algorithms benefit from standardization of the data set. If some
outliers are present in the set, robust scalers or transformers are more appropriate. The
behaviors of the different scalers, transformers, and normalizers on a dataset containing
marginal outliers is highlighted in Compare the effect of different scalers on data with
outliers.

Standardization, or Mean removal and Variance Scaling

Standardization of datasets is a common requirement for many machine learning
estimators implemented in scikit-learn; they might behave badly if the individual features do
not more or less look like standard normally distributed data: Gaussian with zero mean and
unit variance.
Scaling features to a range
In practice we often ignore the shape of the distribution and just transform the data to
center it by removing the mean value of each feature, then scale it by dividing non-constant
features by their standard deviation.
For instance, many elements used in the objective function of a learning algorithm
(such as the RBF kernel of Support Vector Machines or the l1 and l2 regularizers of linear
models) assume that all features are centered around zero and have variance in the same
order. If a feature has a variance that is orders of magnitude larger than others, it might
dominate the objective function and make the estimator unable to learn from other features
correctly as expected.
An alternative standardization is scaling features to lie between a given minimum and
maximum value, often between zero and one, or so that the maximum absolute value of each
feature is scaled to unit size. This can be achieved using MinMaxScaler or MaxAbsScaler,
respectively.
The motivation to use this scaling include robustness to very small standard
deviations of features and preserving zero entries in sparse data.
MaxAbsScaler works in a very similar fashion, but scales in a way that the training data lies
within the range [-1,1] by dividing through the largest maximum value in each feature. It is meant for
data that is already centered at zero or sparse data.

Normalization
Normalization is the process of scaling individual samples to have unit norm. This
process can be useful if you plan to use a quadratic form such as the dot-product or any other
kernel to quantify the similarity of any pair of samples.
This assumption is the base of the Vector Space Model often used in text
classification and clustering contexts.

Module 3: Segmentation
Image segmentation is the process of dividing the image into non- overlapping
meaningful regions. The main objective if an image segmentation is to divide an image into
many sections for the further analysis, so we can get the only necessary or a segment of
information. We use various image segmentation algorithms to split and group a certain set of
pixels together from the image. By doing so, we are actually assigning labels to pixels and the
pixels with the same label fall under a category where they have some or the other thing
common in them.

Using these labels, we can specify boundaries, draw lines, and separate the most
required objects in an image from the rest of the not-so-important ones. In the below
example, from a main image on the left, we try to get the major components, e.g. chair, table
etc. and hence all the chairs are colored uniformly. In the next tab, we have detected
instances, which talk about individual objects, and hence the all the chairs have different
colors.

This is how different methods of segmentation of images work in varying degrees of

complexity and yield different levels of outputs.

Module 4: Classification
Image classification is to identify and portray, as a unique gray level (or color), the
features occurring in an image in terms of the object or type of land cover these features
actually represent on the ground. Image classification is perhaps the most important part of
digital image analysis.
K-Nearest Neighbours
Neighbours based classification is a type of lazy learning as it does not attempt to
construct a general internal model, but simply stores instances of the training data.
Classification is computed from a simple majority vote of the k nearest neighbours of each
point.

Support Vector Machine

Support vector machine is a representation of the training data as points in space

separated into categories by a clear gap that is as wide as possible. New examples are then
mapped into that same space and predicted to belong to a category based on which side of the
gap they fall.

Download Introduction to Python in Earth Science Data Analysis 1st Edition Maurizio Petrelli ebook All Chapters PDF
100% (5)
Download Introduction to Python in Earth Science Data Analysis 1st Edition Maurizio Petrelli ebook All Chapters PDF
55 pages
Python For Civil and Structural Engineers
100% (2)
Python For Civil and Structural Engineers
259 pages
Google Collab & Python
100% (1)
Google Collab & Python
50 pages
Software Requirement Specification
No ratings yet
Software Requirement Specification
3 pages
Software Environment
No ratings yet
Software Environment
6 pages
Python Unit 1 & 2
No ratings yet
Python Unit 1 & 2
16 pages
Lesson 03 Programming Environment Setup
No ratings yet
Lesson 03 Programming Environment Setup
33 pages
1 Introduction Python Programming For Data Science
No ratings yet
1 Introduction Python Programming For Data Science
11 pages
1.1. Scope
No ratings yet
1.1. Scope
45 pages
02 Chapter Two - Hello Python
No ratings yet
02 Chapter Two - Hello Python
7 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Python For Data Analytics Scientific and Technical Applications
No ratings yet
Python For Data Analytics Scientific and Technical Applications
6 pages
Python Quick Guide - Tutorialspoint
No ratings yet
Python Quick Guide - Tutorialspoint
199 pages
Python idle
No ratings yet
Python idle
20 pages
Python Module 1
No ratings yet
Python Module 1
9 pages
Python
No ratings yet
Python
323 pages
BUILD THE RECOGNITION OF SPEECH EMOTION USING LIBROSA THROUGH THE MACHINE LEARNING ALGORITHMS - Copy
No ratings yet
BUILD THE RECOGNITION OF SPEECH EMOTION USING LIBROSA THROUGH THE MACHINE LEARNING ALGORITHMS - Copy
37 pages
Python Lecture 1
No ratings yet
Python Lecture 1
36 pages
Python ppt
No ratings yet
Python ppt
124 pages
Python Quick Guide - Tutorialspoint PDF
0% (1)
Python Quick Guide - Tutorialspoint PDF
183 pages
Introduction
No ratings yet
Introduction
45 pages
COM 315 - Python Note
No ratings yet
COM 315 - Python Note
75 pages
python pdf
No ratings yet
python pdf
34 pages
2.introduction for python
No ratings yet
2.introduction for python
22 pages
1.1-1.4_Introduction to Python
No ratings yet
1.1-1.4_Introduction to Python
50 pages
Python Basics
No ratings yet
Python Basics
33 pages
PP - Chapter - 1-1
No ratings yet
PP - Chapter - 1-1
106 pages
Unit I Python Introduction
No ratings yet
Unit I Python Introduction
65 pages
python unit-1
No ratings yet
python unit-1
24 pages
Unit I Python Introduction
No ratings yet
Unit I Python Introduction
65 pages
Software Environment
No ratings yet
Software Environment
11 pages
Micro Project Report Format
No ratings yet
Micro Project Report Format
11 pages
Machine Learning in Python Main Developments and T
100% (1)
Machine Learning in Python Main Developments and T
44 pages
IEEE Paper (DEVELOPMENT OF PROGRAMMING LANGUAGE PYTHON)
No ratings yet
IEEE Paper (DEVELOPMENT OF PROGRAMMING LANGUAGE PYTHON)
16 pages
python_mat_unit1
No ratings yet
python_mat_unit1
44 pages
Module03-Introduction To Python
No ratings yet
Module03-Introduction To Python
40 pages
Python Report
No ratings yet
Python Report
49 pages
sample project
No ratings yet
sample project
12 pages
ML LAB Record
No ratings yet
ML LAB Record
51 pages
Installing Python and Python IDEs
No ratings yet
Installing Python and Python IDEs
30 pages
Python-2
No ratings yet
Python-2
18 pages
Machine Learning Python
No ratings yet
Machine Learning Python
48 pages
Anurag008python
No ratings yet
Anurag008python
39 pages
Introduction To Python
No ratings yet
Introduction To Python
4 pages
MODULE 7 - NOTES
No ratings yet
MODULE 7 - NOTES
6 pages
PPM-Unit 2-RM
No ratings yet
PPM-Unit 2-RM
29 pages
Unit 1 1
No ratings yet
Unit 1 1
20 pages
Id CARD GENRATOR FILE
No ratings yet
Id CARD GENRATOR FILE
26 pages
Question Bank Solution-1
No ratings yet
Question Bank Solution-1
51 pages
1 Python Features
No ratings yet
1 Python Features
4 pages
Final
No ratings yet
Final
47 pages
BCA III Year Major-II Python
No ratings yet
BCA III Year Major-II Python
66 pages
Python Notes
No ratings yet
Python Notes
67 pages
PY_CHAPTER_1_TOPIC_5
No ratings yet
PY_CHAPTER_1_TOPIC_5
7 pages
Python Notes Ch 1
No ratings yet
Python Notes Ch 1
23 pages
Python Basic
No ratings yet
Python Basic
145 pages
Unit 1
100% (1)
Unit 1
12 pages
Unit 1
No ratings yet
Unit 1
57 pages
Ganesh
No ratings yet
Ganesh
28 pages
Python Introduction Lecture 1
No ratings yet
Python Introduction Lecture 1
65 pages
ML Lab 1
No ratings yet
ML Lab 1
24 pages
Linux Services Deployment
From Everand
Linux Services Deployment
Fabian Mestre
No ratings yet
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
From Everand
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
Steve Will
4.5/5 (3)
Pandas Cookbook Recipes for Scientific Computing Time Series Analysis and Data Visualization using Python 1st Edition Theodore Petrou - Download the ebook now to never miss important content
100% (1)
Pandas Cookbook Recipes for Scientific Computing Time Series Analysis and Data Visualization using Python 1st Edition Theodore Petrou - Download the ebook now to never miss important content
73 pages
Deepthi Internship Report
No ratings yet
Deepthi Internship Report
19 pages
How To Install Jupyter - How-To Guide
No ratings yet
How To Install Jupyter - How-To Guide
13 pages
Project Report
No ratings yet
Project Report
58 pages
Assignment Title
No ratings yet
Assignment Title
4 pages
Aio2023 Module1 Yolov8 Project 240523 Update
No ratings yet
Aio2023 Module1 Yolov8 Project 240523 Update
68 pages
Tutorials List - Javatpoint
No ratings yet
Tutorials List - Javatpoint
36 pages
Conceptualizing Python In Google Colab Poornima G Naik Girish R Naik download
100% (2)
Conceptualizing Python In Google Colab Poornima G Naik Girish R Naik download
80 pages
Intership Final
No ratings yet
Intership Final
23 pages
Jupyter Notebook Stable
No ratings yet
Jupyter Notebook Stable
157 pages
SASPY
No ratings yet
SASPY
15 pages
Applied Quantitative Finance Using Python for Financial Analysis 1st ed. 2021 Edition Mauricio Garita pdf download
No ratings yet
Applied Quantitative Finance Using Python for Financial Analysis 1st ed. 2021 Edition Mauricio Garita pdf download
79 pages
Lec 2
No ratings yet
Lec 2
39 pages
Lesson 02 Python Environment Setup and Essentials
No ratings yet
Lesson 02 Python Environment Setup and Essentials
77 pages
Julia Quick Syntax Reference: A Pocket Guide for Data Science Programming 1st Edition Antonello Lobianco pdf download
100% (3)
Julia Quick Syntax Reference: A Pocket Guide for Data Science Programming 1st Edition Antonello Lobianco pdf download
68 pages
Module 3 - 3rd Edition
No ratings yet
Module 3 - 3rd Edition
71 pages
2019 - Computer Proteomics With Jupyter and Python
No ratings yet
2019 - Computer Proteomics With Jupyter and Python
12 pages
(Ebook) Financial Theory with Python: A Gentle Introduction by Yves Hilpisch ISBN 9781098104351, 1098104358 - Get instant access to the full ebook content
100% (2)
(Ebook) Financial Theory with Python: A Gentle Introduction by Yves Hilpisch ISBN 9781098104351, 1098104358 - Get instant access to the full ebook content
56 pages
Minor Project Final Report (20bca19)
No ratings yet
Minor Project Final Report (20bca19)
85 pages
2020 Dse Bds Assign3
No ratings yet
2020 Dse Bds Assign3
2 pages
HPC Server
No ratings yet
HPC Server
6 pages
Timeline
No ratings yet
Timeline
36 pages
MAJOR PROJECT Documentation
No ratings yet
MAJOR PROJECT Documentation
67 pages
Python Content Manual (1) - 8
No ratings yet
Python Content Manual (1) - 8
1 page
FDSA MANUAL
No ratings yet
FDSA MANUAL
53 pages
Module 4 - Introduction To Jupyter Notebook
No ratings yet
Module 4 - Introduction To Jupyter Notebook
18 pages
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
No ratings yet
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
10 pages

Chapter 4. Development Process

Uploaded by

Chapter 4. Development Process

Uploaded by

CHAPTER 4

Anaconda distribution comes with over 250 packages automatically installed, and

In contrast, conda analyses the current environment including everything currently

Opensource packages can be individually installed from the Anaconda

4.2.1. Anaconda Navigator

The following applications are available by default in Navigator:

4.2.2. JUPYTER NOTEBOOK

Jupyter Notebook (formerly IPython Notebooks) is a web-based

Jupyter Notebook can connect to many kernels to allow programming in different

4.3. RESOURCE REQUIREMENTS :

Op e r a t i n g S y s t e m Windows 7or later

CPU type Intel Pentium

4.5. PROPOSED SYSTEM

 Easy detection of the Brain Tumor with the concluded technique.

Module 1: Dataset Collection and Pre-processing

A dataset (or data set) is a collection of data, usually presented in tabular form. Each

We have chosen to use a publicly-available Healthcare dataset which contains a

The sklearn.preprocessingpackage provides several common utility functions and

Standardization, or Mean removal and Variance Scaling

This is how different methods of segmentation of images work in varying degrees of

Support Vector Machine

Support vector machine is a representation of the training data as points in space

You might also like