0% found this document useful (0 votes)

68 views12 pages

Big Data Methodology

This document summarizes a research report on security challenges in big data applications. The report discusses how big data provides both opportunities and security risks due to the large volumes of diverse data from many sources. It also describes common data types used in big data like structured, unstructured, and semi-structured data. The research methodology used both qualitative and quantitative analysis of data collected through surveys of organizations using big data. Key findings from the surveys showed security and privacy of data as major challenges for organizations, along with issues of data growth, integration and understanding big data.

Uploaded by

shahab qureshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views12 pages

Big Data Methodology

Uploaded by

shahab qureshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Report Title: Security in Big Data Applications

Introduction
Big Data means the exposure of most of the data for the diverse variety of the users, it

provides access to data sources in an easy manner, security linked to the data base is provided,

the private information is opened to be analyzed by the person, the methods of the data

encoding and encryption leads to the data corruption.

Big Data is an issue without a doubt yet it isn't as close beginning at an issue or at the size of

use, control, and security of the information. The advanced security is one of the most

concerning issue on the rising. An outline would be the security attack on Target.The

programmers took all the data set aside inside targets data base, including customer's

information, for instance card information, name, address, character, social protections and

altogether more. So not considering the way that information is immense, the issue of the

advanced security and control is altogether significant. The information is excessive anyway it

also causes you make decisions speedier, quickens the system even more then ever some time

as of late.

Wrong information can be a major issue, since we are depending on data that isn't correct and

can prompt huge disappointment of the framework. It looks like having a correct name yet not

the location. Assume you convey an item like this case, the buyer can gripe that he didn't get,

however according to information it is conveyed to the ideal individual.

Another model can be a client taken care of the apparent multitude of tabs, however the bogus

information refreshed for another client. This could prompt separation of assets for the first

client. Every choice that made on defective information would prompt a disappointment. So

the information we have ought to be of clear and genuine data.

The information is costly to keep up and it is never to going to stop develop. The information

will build each day, so is the expense of upkeep. Something else is organization needs to plan
approaches for who to access and who to adjust the information and arranging it. The

information can be of various sources in various dialects and it very well may be repetitve as

well. So the organization ought to have components to deal with immense measures of

information.

The low quality of information for a model repetitive data of same thing, which increment

expenses and endeavors to look after it. Ineffectively composed information will prompt

deferral in handling and giving outcomes.

Data Sets

There are several data sets used for the big data as it includes structured, unstructured and semi-

structured. The semi-structured if further classified into variety, velocity and volume.

Structured Data

It is the data knowledge which can be controlled and then recovered in a particular fixed

organization or firm. It refers to the data that is sorted and then can be used without any

interruption and information is formed on the basis of search engine.

Unstructured Data

This data refers to the knowledge about the data which do not possess the particular structure and

it allows it to make a measurable and investigating one.

Examples of the unstructured information incorporate Relational Database System (RDBMS)

and the spreadsheets, which just responses to the inquiries regarding what occurred. These

information base just gives an understanding to an issue at the little level. Anyway so as to

improve the capacity of an association, to acquire understanding into the information and

furthermore to think about metadata unstructured information is utilized . Large information

utilizes the semi-organized and unstructured information and improves the assortment of the

information accumulated from various sources like clients, crowd or endorsers. After the

assortment, Bid information changes it into information based data .

Semi-structured Data

It relates to the information that comprise of both the arrangements linked to the organized

information and to be precise it relates mainly to the information that is not characterized and yet

it contains, yet contains imperative data or labels that isolate singular components inside the

information.

Variety

This type of data that is accumulated from different sources. This specific information must be

gathered from spreadsheets and data sets, today information arrives in a variety of structures, for

example, messages, PDFs, photographs, recordings, sounds, SM posts, thus substantially more.

Assortment is one of the significant qualities of huge information.

Velocity

It refers to the speed at which information is being gathered continuously. In a more extensive

possibility, it consists of the progress pace, connection and the particular information indexes.

Volume

It is one of the attributes of large information. We definitely realize that Big Data demonstrates

gigantic 'volumes' of information that is being created constantly using various links like web-

based media stages, business measures, machines, organizations, human collaborations, and so

on and this large amount of information are put away in information distribution centers.
Subsequently reaches the finish of qualities. It is capacity of huge measure of information would

lessen the expense for putting away the information and aids in providing the business

knowledge.

Research Methodology
The research work provided here is the secondary one and the methodology used is qualitative

and quantitative as first one involves the inclusion of non-numerical data and information and

later includes the use of numerical data as analysis is done to find out what problems and on how

much scale are they facing because of the big data.

Research Philosophy

The philosophy of the big data is explained by help of the two types of the data. Firstly, the

theoretical information that is collected from the work of the others in which data challenges and

risks of security of big data is explained in detail and after that the other philosophy linked is the

numeric data i.e. quantitative data and the data gathered for this process is from the primary

research conducted of the companies as questionnaire was filled by manager of every firm and

data was organized on the basis of that and pie chart are also drawn to exhibit the information.

Research Approach

The approach of the research is quite obvious as the data gathered from the literature review is

used for the qualitative research termed as secondary data and on other hand the data gathered
using the questionnaires are collected by surveying the organizations using the big data. The

methodology provides deep information regarding the research approach.

Research Strategy

There are several strategies used for the research process as here the one used is of questionnaire

as several questions related to big data security are asked from the high level managers of the

organizations and responses are collected by this source. The data comprise of both primary and

secondary data as the primary data is the one in which data is gathered on first hand source as by

help of the questionnaires and secondary data is that which is collected by the work of the others.

Data Analysis

In this phase, data is collected by the method as discussed above as here about 10 organizations

were taken and research was conducted from them. The necessary information will be gathered

utilizing a survey with help of the close ended questions. This will take into consideration less

equivocalness and guarantee that the information gathered is exact and thusly fitting for the

examination reason. Interestingly, the examination plan of the paper that has been counseled for

this investigation has an experimental methodology and the short pool of cases broke down

limits its materialness to a more extensive setting.

So as to gather the necessary information, particular SMEs will be chosen, commonly the

individuals who as of now have executed the utilization of large information examination in their

business measures. The inquiries will be pointed toward understanding the reduction or the

expansion in their going through with respect to the security and protection upkeep of the
information frameworks. This information as gathered will at that point be examined utilizing

SPSS to offer ascent to a lot of evident and solid outcomes

Survey Questions

Q1 Does your organization work with the big data?

Yes = 36.8

No = 19.8

Planned in near future = 43.4

Q2 Which security areas of Big Data are used?

Hadoop = 21

Cloud computing = 54

Monitoring = 18

Auditing = 7
Q 3 What are challenges of big data faced by your organization?

Lack of proper understanding = 15

Growth issues of data = 29

Integrating data = 37

Securing data = 19

Q4 What are dangers of big data?

Limitations of GDPR = 20

Online marketing can be aggressive = 15

Privacy problem = 35

Easy to remember passwords = 30

Q5 Are current laws and legislations to big data?

Yes = 45

No = 55

Q6 Is Big Data an independent phenomenon?

Yes= 35

No= 45

May be= 20

Q7 Do you prefer good data or good models?

Yes =75

No = 25
Q8 Keeping in view the threats to big data, is this really the future?

Yes = 85

No = 10

May be = 5

References

Kim, S. H., Kim, N. U., & Chung, T. M. (2013, December). Attribute relationship evaluation methodology

for big data security. In 2013 International conference on IT convergence and security (ICITCS) (pp. 1-4).

IEEE.
Zhang, Y., Zhang, G., Chen, H., Porter, A. L., Zhu, D., & Lu, J. (2016). Topic analysis and forecasting for

science, technology and innovation: Methodology with a case study focusing on big data

research. Technological Forecasting and Social Change, 105, 179-191.

Web Based Customer Management System For Electric Power Nekemte City
No ratings yet
Web Based Customer Management System For Electric Power Nekemte City
80 pages
Big Data Research Paper
No ratings yet
Big Data Research Paper
14 pages
M04 - Dax Part1
No ratings yet
M04 - Dax Part1
16 pages
Big Data (Analytics) in Power Systems
No ratings yet
Big Data (Analytics) in Power Systems
20 pages
Medium Voltage AC Drives: ACS6000 Water Cooling Units 029, 052
No ratings yet
Medium Voltage AC Drives: ACS6000 Water Cooling Units 029, 052
40 pages
1.big Data and Its Importance
No ratings yet
1.big Data and Its Importance
17 pages
Unit 1 Big Data
No ratings yet
Unit 1 Big Data
34 pages
CISSP Common Body of Knowledge Review in
No ratings yet
CISSP Common Body of Knowledge Review in
145 pages
Radio de Auro Panasonic Cq-Rx100u
No ratings yet
Radio de Auro Panasonic Cq-Rx100u
44 pages
Notes Module
No ratings yet
Notes Module
80 pages
Characteristics of New Media
No ratings yet
Characteristics of New Media
1 page
Unit I Introduction To Big Data
No ratings yet
Unit I Introduction To Big Data
36 pages
Invoice: WD Elements (WDBUZG0010BBK) 1 TB Portable External Hard Drive (Black) 1 4284 4284
No ratings yet
Invoice: WD Elements (WDBUZG0010BBK) 1 TB Portable External Hard Drive (Black) 1 4284 4284
1 page
Module I Big Data
No ratings yet
Module I Big Data
7 pages
What Is Data
No ratings yet
What Is Data
20 pages
Unit 4
No ratings yet
Unit 4
29 pages
Turnitin Originality Report FELEKI
No ratings yet
Turnitin Originality Report FELEKI
74 pages
Research IN BIG Data - AN: Dr. S.Vijayarani and Ms. S.Sharmila
No ratings yet
Research IN BIG Data - AN: Dr. S.Vijayarani and Ms. S.Sharmila
20 pages
Tutorial
No ratings yet
Tutorial
59 pages
Informatics Engineering, An International Journal (IEIJ)
No ratings yet
Informatics Engineering, An International Journal (IEIJ)
20 pages
Guide To Computer Forensics and Investigations Fourth Edition
No ratings yet
Guide To Computer Forensics and Investigations Fourth Edition
44 pages
HONO
No ratings yet
HONO
29 pages
Lorraine - de Souza - GCSE - String Manipulation With Helpsheets
No ratings yet
Lorraine - de Souza - GCSE - String Manipulation With Helpsheets
37 pages
Unit 1
No ratings yet
Unit 1
22 pages
Unit I: Chapter 1: Introduction To Big Data
No ratings yet
Unit I: Chapter 1: Introduction To Big Data
35 pages
Security Issues Associated With Big Data - Final
No ratings yet
Security Issues Associated With Big Data - Final
15 pages
BDCOM S2928 Hardware Installation Manual
No ratings yet
BDCOM S2928 Hardware Installation Manual
21 pages
Emmanuel Main Report
No ratings yet
Emmanuel Main Report
17 pages
Trawnih Et Al 2023 Determining Perceptions of Banking Customers Regarding Fingerprint Atms
No ratings yet
Trawnih Et Al 2023 Determining Perceptions of Banking Customers Regarding Fingerprint Atms
19 pages
Lect8 Spice
No ratings yet
Lect8 Spice
27 pages
Unit 2 Bda
No ratings yet
Unit 2 Bda
5 pages
AMR Assignment
No ratings yet
AMR Assignment
11 pages
Data Mining in The World of BIG Data-A Survey
No ratings yet
Data Mining in The World of BIG Data-A Survey
9 pages
Big Data: (Data Security and Integrity)
No ratings yet
Big Data: (Data Security and Integrity)
9 pages
3 Pritee 2018
No ratings yet
3 Pritee 2018
11 pages
Digital Communications: Instructor: Dr. Phan Van Ca Lecture #4: Introduction To Digital Communications
No ratings yet
Digital Communications: Instructor: Dr. Phan Van Ca Lecture #4: Introduction To Digital Communications
27 pages
Introduction To Big Data
No ratings yet
Introduction To Big Data
11 pages
APCCAS Full Schedule - Nov18
No ratings yet
APCCAS Full Schedule - Nov18
6 pages
BSC (Hons) Business Management Bmp4005 Information Systems and Big Data Analysis Assessment Number 2 Written Report and Poster Accompanying Paper
No ratings yet
BSC (Hons) Business Management Bmp4005 Information Systems and Big Data Analysis Assessment Number 2 Written Report and Poster Accompanying Paper
8 pages
Big Data Answers
No ratings yet
Big Data Answers
14 pages
Bigdata Documentation
No ratings yet
Bigdata Documentation
20 pages
Big Data Analytics Unit Test-I Answers Bank
No ratings yet
Big Data Analytics Unit Test-I Answers Bank
10 pages
Reading Teks Kelompok 2
No ratings yet
Reading Teks Kelompok 2
12 pages
(Ca) Bda Unit-I
No ratings yet
(Ca) Bda Unit-I
10 pages
Chapter 2 RRL
No ratings yet
Chapter 2 RRL
9 pages
Unit I - BDA
No ratings yet
Unit I - BDA
12 pages
Challenges in Big Data Analytics Techniques
No ratings yet
Challenges in Big Data Analytics Techniques
6 pages
A Review On Big Data
No ratings yet
A Review On Big Data
6 pages
Chapter 1 & 2 7-18-2013
No ratings yet
Chapter 1 & 2 7-18-2013
15 pages
Esaimen Ooop
No ratings yet
Esaimen Ooop
9 pages
Capstone Case Study
No ratings yet
Capstone Case Study
4 pages
Introduction To Big Data - Report 1
No ratings yet
Introduction To Big Data - Report 1
5 pages
66 Easy
No ratings yet
66 Easy
10 pages
Big Data Security Management Issues
No ratings yet
Big Data Security Management Issues
4 pages
Three V of Big Data
No ratings yet
Three V of Big Data
4 pages
The Apogee AD-8000 8-Channel, 24-Bit Converter
No ratings yet
The Apogee AD-8000 8-Channel, 24-Bit Converter
6 pages
ICT Assignment 4 Bachelors
No ratings yet
ICT Assignment 4 Bachelors
4 pages
Challenging Tools On Research Issues in Big Data Analytics
No ratings yet
Challenging Tools On Research Issues in Big Data Analytics
4 pages
Data Sheet 6ES7331-7NF00-0AB0: Input Current
No ratings yet
Data Sheet 6ES7331-7NF00-0AB0: Input Current
3 pages
Big Data
No ratings yet
Big Data
5 pages
2023 R Programming Apr May (AICTE)
No ratings yet
2023 R Programming Apr May (AICTE)
3 pages
DevOps Engineer
No ratings yet
DevOps Engineer
2 pages
Design and Simulation of Digital Down Converter Based On System Generator
No ratings yet
Design and Simulation of Digital Down Converter Based On System Generator
3 pages
Plagiarism Scan Report: Content Checked For Plagiarism
No ratings yet
Plagiarism Scan Report: Content Checked For Plagiarism
3 pages
STC Issue
No ratings yet
STC Issue
2 pages
Data Analytics for Beginners: Introduction to Data Analytics
From Everand
Data Analytics for Beginners: Introduction to Data Analytics
Anthony S. Williams
4/5 (19)
Decision Making with Data
From Everand
Decision Making with Data
Ravi Deshpande
No ratings yet
Managing Big Data Effectively
From Everand
Managing Big Data Effectively
Bhima Asan
No ratings yet
Essentials of Data Analysis
From Everand
Essentials of Data Analysis
Agasti Khatri
No ratings yet
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
From Data To Decisions: Driving Performance in the Age of Analytics
From Everand
From Data To Decisions: Driving Performance in the Age of Analytics
Babatunde Yusuf
No ratings yet
Data-Driven Decision Making
From Everand
Data-Driven Decision Making
Aadinath Pothuvaal
No ratings yet
Data Science and Analytics: Transforming Raw Data into Actionable Insights: A Comprehensive Guide
From Everand
Data Science and Analytics: Transforming Raw Data into Actionable Insights: A Comprehensive Guide
Marlowe Reyes
No ratings yet
Data Science and Analytics Essentials: The Revolution of Decision-Making: Leveraging Data in the Digital Age
From Everand
Data Science and Analytics Essentials: The Revolution of Decision-Making: Leveraging Data in the Digital Age
Daniel Richards
No ratings yet
Data Analytics and Data Processing Essentials
From Everand
Data Analytics and Data Processing Essentials
gareth thomas
No ratings yet
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
From Everand
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
Waldo Todd
No ratings yet
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
From Everand
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
Calvert Long
No ratings yet
Analytics in a Business Context: Practical guidance on establishing a fact-based culture
From Everand
Analytics in a Business Context: Practical guidance on establishing a fact-based culture
Frank Vella
No ratings yet
Data Analytics for Businesses 2019: Master Data Science with Optimised Marketing Strategies using Data Mining Algorithms (Artificial Intelligence, Machine Learning, Predictive Modelling and more)
From Everand
Data Analytics for Businesses 2019: Master Data Science with Optimised Marketing Strategies using Data Mining Algorithms (Artificial Intelligence, Machine Learning, Predictive Modelling and more)
Riley Adams
5/5 (1)
We Need To Talk: 52 Weeks To Better Cyber-Security
From Everand
We Need To Talk: 52 Weeks To Better Cyber-Security
L. Brent Huston
No ratings yet
PYTHON FOR DATA ANALYSIS: A Practical Guide to Manipulating, Cleaning, and Analyzing Data Using Python (2023 Beginner Crash Course)
From Everand
PYTHON FOR DATA ANALYSIS: A Practical Guide to Manipulating, Cleaning, and Analyzing Data Using Python (2023 Beginner Crash Course)
Ike Beck
No ratings yet
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
From Everand
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
Steven Vollmer
No ratings yet
Data Science Career Guide Interview Preparation
From Everand
Data Science Career Guide Interview Preparation
Gradient Publication
No ratings yet
Cybersecurity: Issues of Today, a Path for Tomorrow
From Everand
Cybersecurity: Issues of Today, a Path for Tomorrow
Daniel Reis
No ratings yet
Analytics and Big Data for Accountants
From Everand
Analytics and Big Data for Accountants
Jim Lindell
No ratings yet
CompTIA Data+ (Plus) The Ultimate Exam Prep Study Guide to Pass the Exam
From Everand
CompTIA Data+ (Plus) The Ultimate Exam Prep Study Guide to Pass the Exam
Jamie Murphy
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Data Analytics with Python: Data Analytics in Python Using Pandas
From Everand
Data Analytics with Python: Data Analytics in Python Using Pandas
Frank Millstein
3/5 (1)
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet

Big Data Methodology

Uploaded by

Big Data Methodology

Uploaded by

Report Title: Security in Big Data Applications

encoding and encryption leads to the data corruption.

however according to information it is conveyed to the ideal individual.

the information we have ought to be of clear and genuine data.

deferral in handling and giving outcomes.

interruption and information is formed on the basis of search engine.

it allows it to make a measurable and investigating one.

Examples of the unstructured information incorporate Relational Database System (RDBMS)

furthermore to think about metadata unstructured information is utilized . Large information

assortment, Bid information changes it into information based data .

Assortment is one of the significant qualities of huge information.

much scale are they facing because of the big data.

methodology provides deep information regarding the research approach.

limits its materialness to a more extensive setting.

SPSS to offer ascent to a lot of evident and solid outcomes

Q1 Does your organization work with the big data?

Planned in near future = 43.4

Q2 Which security areas of Big Data are used?

Lack of proper understanding = 15

Growth issues of data = 29

Q4 What are dangers of big data?

Online marketing can be aggressive = 15

Easy to remember passwords = 30

Q5 Are current laws and legislations to big data?

Q6 Is Big Data an independent phenomenon?

Q7 Do you prefer good data or good models?

research. Technological Forecasting and Social Change, 105, 179-191.

You might also like