0% found this document useful (0 votes)

130 views

A Review Paper On Big Data Analytics Tools: Article

This document provides a review of big data analytics tools. It begins with an introduction to big data and the challenges of big data analytics. It then outlines the typical lifecycle of a big data analytics project, including data collection, storage, filtering, classification, cleansing, analysis, and visualization. The document focuses on comparing popular tools used for data collection. It provides a table comparing Semantria and Opinion Crawl, two tools used to collect unstructured social media data, based on their analysis capabilities, engines, deployment options, and licensing.

Uploaded by

ECE A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

130 views

A Review Paper On Big Data Analytics Tools: Article

Uploaded by

ECE A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/326111866

A Review Paper on Big Data Analytics Tools

Article · May 2018

CITATION READS

1 991

1 author:

Ms. Komal
Amity University
8 PUBLICATIONS 17 CITATIONS

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Role of ICT for Rural Development in Haryana View project

Artificial Intelligence View project

All content following this page was uploaded by Ms. Komal on 02 July 2018.

The user has requested enhancement of the downloaded file.

International Journal of Technical Innovation in Modern
Engineering & Science (IJTIMES)
Impact Factor: 3.45 (SJIF-2015), e-ISSN: 2455-2585
Volume 4, Issue 5, May-2018

A Review Paper on Big Data Analytics Tools

Ms. Komal1
1
Department of Computer Science, Amity University Haryana, [email protected],

Abstract— Big Data analytics has become the need of the hour for academia, research and IT industry. The
exponentially growing digital information is moving at a lightning fast speed over the internet infrastructure and is
mainly in the unstructured form including Facebook posts/likes, tweets, blogs, news, articles, YouTube videos, website
clicks etc. Everyday billions of people fetch, upload and share information on social media and other platforms
through mobile phones, laptops, PDAs. The information comprises of pictures, blobs, goggle map locations, videos,
text, voice messages that are collection of structured, unstructured and complex data objects. Traditional data
processing techniques are insufficient to handle this enormous, heterogeneous and fast-paced data. E-commerce and
digital marketing has gained so much popularity over these years that business industry has become more dependent
on online transactions and services. Big data analytics has proven to be a boon for such an industry as it helps to
extract useful patterns and unknown correlations of potential consumer market, client preferences, buying attributes
and lot of other information from intricate data sources. This paper aims to provide a detailed review and comparative
assessment of latest tools and frameworks used for big data analytics.

Keywords— Big Data, Data Analytics, Hadoop, MapReduce, Cassandra, MangoDB.

I. INTRODUCTION

The term „Big Data‟ is characterized by three things- a) it is highly voluminous 2) it is created, shared and removed
online in fraction of seconds 3) it is in varied forms i.e. collection of structured, unstructured and complex datasets. Big
data analytics has quickly drawn the attention of IT industry due to its application in majority of areas like healthcare,
business firms, social media, education, banking [1] etc. Traditional means of processing and analyzing data mainly rely
on limited data set organized in a structured form. Such tools and techniques fail to add any value in big data aspects. The
six parameters of big data [2]- volume, variety, velocity, veracity, variability and complexity make the data processing
cumbersome for old data management tools and techniques.

Volume of data can still be managed as the digital storage capacities have increased over the period of time leading to
cheap hard disks, large and extensible storage in mobile phones and above all cloud services supported by many service
providers. Management of this huge repository of data is another challenging aspect. Though cloud storage has eased out
data storage issues, it has additional risk of information security associated with it. The biggest challenge still remains to
be the analysis and mining of the unstructured data on the go as it is generated over internet. Thus, big data analytics play
a crucial role in today‟s scenario.

This promising field of big data analytics comes together with many challenges for the professionals. Data inconsistency,
integrity, privacy, timeliness, storage & representation, unstructured heterogeneous data sources pose lot of challenges.
Efficient organization and representation of this huge repository of data is quite challenging. Various data pre-processing
techniques such as filtering, noise elimination, classification and transformation have their own challenges [4].These
aspects make the field of big data analytics even more interesting. Lots of tools and techniques have been developed so far
to ease out the process of data analysis. The paper provides a summarized review of these tools.

The paper is further organized as follows: Section II explains the lifecycle of Big Data analytics. Section III contains
comparative analysis of tools used at various stages of Big Data analytics. Section IV concludes the work with the
findings.

IJTIMES-2018@All rights reserved 1012

International Journal of Technical Innovation in Modern Engineering & Science (IJTIMES)
Volume 4, Issue 5, May-2018, e-ISSN: 2455-2585,Impact Factor: 3.45 (SJIF-2015)

II. METHODOLOGY OF BIG DATA ANALYTICS

This section explains various stages of lifecycle of big data analytics [3]-
A. Data identification and collection- In this phase, wide variety of data sources are identified depending upon the
severity of problem. More data resources mean more chances of finding hidden correlations and patterns. Tools
are needed to capture keywords, data and information from these heterogeneous data sources.
B. Data storage- The captured structured and unstructured data need to be stored in databases/ data warehouse.
NoSQL databases are needed to accommodate Big Data. Various frameworks and databases have been developed
by organizations like Apache, Oracle etc. that allow analytics tools to fetch and process data from these
repositories.
C. Data filtering and noise elimination- This phase is dedicated to removal of replicated, corrupt, null and irrelevant
data objects from the gathered information. However, filtered and removed data might be of some importance in
another context or analysis. Hence, it is advisable to keep a copy of original data sets in compressed form to save
storage space [3].
D. Data classification and extraction- This phase is responsible for extracting incongruent data and converting it
into a common data format that the underlying analytics tool can use for its purpose. This may also involve
extracting relevant fields or texts to reduce the volume of data to be submitted to analytics engine.
E. Data cleansing, validation and aggregation- This stage applies validation rules based on the business case to
confirm the necessity and relevance of data extracted for analysis. Although it may be difficult sometimes to
apply validation constraints to the extracted data due to complexity. Aggregation helps to combine multiple data
sets into fewer numbers based on common fields. This simplifies further data processing.
F. Data analysis and processing- This stage carries out actual data mining and analysis to establish unique and
hidden patterns for making business decisions. Data analytics technique may vary depending upon the scenario
i.e. exploratory, confirmatory, predictive, prescriptive, diagnostic or descriptive [3].
G. Data visualization- This phase involves representation of analysis results into visual or graphical form that makes
it easier to understand for the audience.

III. COMPARATIVE ASSESSMENT OF BIG DATA ANALYTICS TOOLS

Since the advent of big data, a number of tools have been developed by programmers and agencies to assist in the process
of data analysis. These tools have been categorized into different stages of big data lifecycle based on their usage and
implementation. This section classifies and compares some of the most popular and widely used tools.
A. Data Collection tools
Though data collection is dependent on business case scenario and type of data sources identified. Unstructured data is
captured mostly from social networking. There are some popular tools to collect data from embedded websites with the
help of semantic and text analysis. Below table compares such data collection tools.

TABLE I
COMPARISON OF POPULAR DATA COLLECTION TOOLS [4]

Characteristics of tool
Tool Open/License/
Type of analysis Analysis Engine
Deployability Enterprise solution
Text and sentiment Web, cloud
Semantria NLP based Proprietary License
analysis API, Excel
Opinion Crawl Sentiment analysis SenseBot Web Open website
Window
Content management
OpenText Red Dot, Captiva based server Enterprise
and analysis
application
Influence and Web (social
Trackur Trackur Proprietary License
sentiment analysis media)

IJTIMES-2018@All rights reserved 1013

International Journal of Technical Innovation in Modern Engineering & Science (IJTIMES)
Volume 4, Issue 5, May-2018, e-ISSN: 2455-2585,Impact Factor: 3.45 (SJIF-2015)

B. Data Storage tools and frameworks

Most of the data processing and analysis tools work on top of a database framework. Hence, some of the popular
companies have come into the league of providing database solutions and frameworks. Following table provides a
summarized assessment of these popular NoSQL databases.

TABLE III
COMPARISON OF POPULAR DATA STORAGE TOOLS [5]

Characteristics of tool

NoSQL Databases
Zero Downtime Secondary
Data Model
(on node failure) Concurrency Indexes

Apache HBase (Hadoop Yes (optimistic

Column-oriented Yes No
database) concurrency)

Yes (optimistic
CouchDB Document-oriented No Yes
concurrency)

MangoDB Document-oriented No Yes Yes

Apache Cassandra Column-oriented Yes Yes No

Apache Ignite Multi-model Yes Yes Yes

Oracle NoSQL Yes

Key-value based No Yes
Database

C. Data filtering and extraction tools

Data filtering and extraction tools are used to create structured output from unstructured data gathered in previous
stages. Some of these tools are compared below.

TABLE IIIII
COMPARISON OF POPULAR DATA FILTERING AND EXTRACTION TOOLS [7]

Characteristics of tool
Tool
Free/ Paid version Extensible Feature Output

Both free and enterprise ETL and data mining Structured data
Pentaho Yes
paid version capabilities

Both free and paid Structured

OctoParse No Web scrapping
version spreadsheets

Both free and paid Cloud-based desktop Excel,CSV, Google

ParseHub No
version app sheet

Paid Enterprise and Structured data

Mozenda Yes Web scraper
Professional version (JSON,XML and CSV)

Web scrapping with Structured data (XML,

Content Grabber Paid version Yes debugging and error CSV and databases)
handling

IJTIMES-2018@All rights reserved 1014

International Journal of Technical Innovation in Modern Engineering & Science (IJTIMES)
Volume 4, Issue 5, May-2018, e-ISSN: 2455-2585,Impact Factor: 3.45 (SJIF-2015)

D. Data cleaning and validation tools

Data cleaning tools are extremely helpful in reducing the processing time and computational speed of data analytics
tools and engines. Though, they are not used as often as other tools. A significant comparison of latest data cleaning tools
is provided in the table below.
TABLE IVV
COMPARISON OF POPULAR DATA CLEANING TOOLS [6]

Characteristics of tool
Tool Additional
Processing model Data Source
features
Data
Integration with
transformation,
DataCleaner Record and field processig Hadoop
validation and
database
reporting
Searching,
sorting, Hadoop
MapReduce Parallel data processing
clustering and database
translation
Filtering, Internal
Rapidminer GUI and batch processing aggregation and database
merging integration
Transforming Web services
OpenRefine Batch processing data from one and external
form to another data
Numerous
Talend Streaming, batch processing Data integration
databases

E. Data analysis tools

Most of the tools in this category are not only analysis tools but perform other functions too. However, they deploy data
mining, artificial intelligence and other techniques for data analysis. A summarized review of these tools is provided in the
table below.

TABLE V
COMPARISON OF POPULAR DATA ANALYSIS TOOLS [8][9]

Characteristics of tool
Tool Language
Processing model latency
support
Hive Streaming SQL-like high

Scala,Java,
Apache Spark Mini/ micro batches, streaming seconds
Python

Apache Storm A record at a time Any milli-seconds

Java, Ruby, More

MapReduce Parallel Processing
Python, C++ (seconds)

Python,Scala,
Qubole Stram processing,ad-hoc queries seconds
R, Go

Scala,Java,
Flink Batch and stram processing seconds
Python

International Journal of Technical Innovation in Modern Engineering & Science (IJTIMES)
Volume 4, Issue 5, May-2018, e-ISSN: 2455-2585,Impact Factor: 3.45 (SJIF-2015)

F. Data Visualization tools

There are plenty of data visualization tools available in the market. Most of them are integrated version of data
extraction, analysis and visualization. Following table compares most popular and widely used data visualization tools.

TABLE VV
COMPARISON OF POPULAR DATA VISUALIZATION TOOLS [11][12]

Characteristics of tool

Tool
Licensed/ Open- Data Source Coding/Programming
Output features
source compatibility Language need
Bar chart,line
CSV,PDF,Excel,
DataWrapper Open-source chart,map,
Ready-to-use codes
CMS
graphs

Maps, Bar
Tableau Open-source Database , API
charts, Scatter plots
No coding

scatter plots, bar

files, SQL tables, and
charts, trees,
Orange Open-source data tables or can paint No programming needed
dendrograms, networks
random data
and heat maps

Database, spreadsheet, Programming language and

Qlik Licensed Dashboard, Apps
website SQL knowledge needed

pie charts, bar charts,

Google Fusion Google‟s web Comma-separated value lineplots,
tables service file formats No programming needed
scatterplots, timelines

Location data, plenty of CartoCSS

CartoDB Open-source Maps
data types language

Own visual query language Line/bar/

Chartio Open-source Multiple Data sources pie charts, dashboard
sharing as pdf reports

CSV, No programming
Gephi Open-source GraphML, GML,GDF Graphs and networks
Spread-sheet

IV. CONCLUSIONS
The rate of development of information processing tools is comparatively much slower than the rate of development of
information. Currently available tools in the market do not address all the issues of Big Data analytics. Even the most
high-tech tools and techniques like Hadoop, Cassandra and Ignite can‟t justify real-time analysis in true sense. Though
they have fairly increased the ease of handling diverse data sets and reduced the time of data processing. There are still
some unaddressed issues related to effective storage, searching, analysis, sharing and security. This paves a way for
future improvements and developments of Big Data analytics tools.

International Journal of Technical Innovation in Modern Engineering & Science (IJTIMES)
Volume 4, Issue 5, May-2018, e-ISSN: 2455-2585,Impact Factor: 3.45 (SJIF-2015)

REFERENCES

[1] M. Chen, S. Mao, and Y. Liu, “Big data: a survey”, Mobile Networks and Applications, vol. 19, No. 2, pp. 171–209,
2014.
[2] S. Mujawar, S. Kulkami, “Big Data: Tools and Applications”, International Journal of Computer Applications, vol.
115, No. 23, pp. 7-11, 2015.
[3] T. Erl, W. Khattak, and P. Buhler, Big Data Fundamentals: Concepts, Drivers & Techniques, Prentice Hall, India,
pp. 65-88, 2015.
[4] N. Khan et. al, “Big Data: Survey, Technologies, Opportunities, and Challenges”, The Scientific World Journal,
vol.2014, Issue.4, pp.1-18, 2014.
[5] Online source, [Available] https://www.import.io/post/all-the-best-big-data-tools-and-how-to-use-them/, 2018.
[6] Online source, [Available] https://www.guru99.com/big-data-tools.html, 2018.
[7] Online source, [Available] https://www.octoparse.com/blog/yes-there-is-such-thing-as-a-free-web-scraper/, 2018.
[8]https://data-flair.training/blogs/apache-storm-vs-spark-streaming/
[9] A. Narang, “A review-Cloud and cloud security”, International journal of Computer Science and mobile Computing,
vol. 6, issue 1,pp. 178-181, 2017.
[10] K. Komal, “Cognitive Science: Bridging the Gap between Machine and Human Intelligence”, International Journal
of Computer Applications, vol. 114, issue 5, pp. 16-19,2015.
[11] S Kaushal, J.K. Bajwa, “Analytical Review of User Perceived Testing Techniques”, International Journal of
Advanced Research in Computer Science and Software Engineering, vol. 2, issue 10, 2012.
[12] S. M. Ali et.al, “Big Data Visualization: Tools and Challenges”, 2nd International Conference on Contemporary
Computing and Informatics,2016.

View publication stats

Ardaas With Explanation
No ratings yet
Ardaas With Explanation
25 pages
cp5293 Big Data Analytics Question Bank
0% (1)
cp5293 Big Data Analytics Question Bank
13 pages
International Education: Issues For Teachers, Second Edition (Toronto: Canadian
No ratings yet
International Education: Issues For Teachers, Second Edition (Toronto: Canadian
38 pages
Literature Review On Big Data Analytics Vishal Kumar Harsh Bansal
No ratings yet
Literature Review On Big Data Analytics Vishal Kumar Harsh Bansal
6 pages
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
No ratings yet
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
15 pages
1 Res PDF
No ratings yet
1 Res PDF
15 pages
Big Data - 1544723612 PDF
No ratings yet
Big Data - 1544723612 PDF
15 pages
Research Paper - Reading Materials
No ratings yet
Research Paper - Reading Materials
15 pages
Big Data - 1544723672 PDF
No ratings yet
Big Data - 1544723672 PDF
15 pages
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
No ratings yet
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
15 pages
Big Data Analytics: A Literature Review Paper: Abstract. in The Information Era, Enormous Amounts of Data Have Become
No ratings yet
Big Data Analytics: A Literature Review Paper: Abstract. in The Information Era, Enormous Amounts of Data Have Become
14 pages
A Review of Big Data Analytics
No ratings yet
A Review of Big Data Analytics
15 pages
chp3A10.10072F978 3 319 08976 8 - 16
No ratings yet
chp3A10.10072F978 3 319 08976 8 - 16
15 pages
Unit 1-BigDataTools
No ratings yet
Unit 1-BigDataTools
69 pages
37 A Review Paper On Big Data Analytics
No ratings yet
37 A Review Paper On Big Data Analytics
4 pages
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
No ratings yet
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
15 pages
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
No ratings yet
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
16 pages
Unit 1 Big Data
No ratings yet
Unit 1 Big Data
124 pages
Big Data Research Paper
No ratings yet
Big Data Research Paper
14 pages
Big_Data_Big_Data_Analysis,_I
No ratings yet
Big_Data_Big_Data_Analysis,_I
10 pages
Challenges in Big Data Analytics Techniques
No ratings yet
Challenges in Big Data Analytics Techniques
6 pages
Big Data Analysis: Concepts, Tools and Applications: Poonam
No ratings yet
Big Data Analysis: Concepts, Tools and Applications: Poonam
8 pages
Big Data Analytics Tools, BHARATH.S (Assignment-1)
No ratings yet
Big Data Analytics Tools, BHARATH.S (Assignment-1)
17 pages
Big Data Analytics 1
No ratings yet
Big Data Analytics 1
22 pages
Pub Res Feb 20231
No ratings yet
Pub Res Feb 20231
5 pages
Big Data Analytics: September 2015
No ratings yet
Big Data Analytics: September 2015
11 pages
BDA1-4 bunits
No ratings yet
BDA1-4 bunits
113 pages
Big Data Analytics Unit-1
100% (2)
Big Data Analytics Unit-1
5 pages
Introduction
No ratings yet
Introduction
10 pages
Unit 1 - From Big Data Analytics PDF
No ratings yet
Unit 1 - From Big Data Analytics PDF
5 pages
j.ijdsa.20241005.11
No ratings yet
j.ijdsa.20241005.11
14 pages
Big data analytics in financial reporting- Trends and challenges
No ratings yet
Big data analytics in financial reporting- Trends and challenges
17 pages
TP 4 2docuatrimestre
No ratings yet
TP 4 2docuatrimestre
10 pages
Big Data Analytics: September 2015
No ratings yet
Big Data Analytics: September 2015
11 pages
Big Data Analytics
100% (3)
Big Data Analytics
79 pages
Reviews of Big Data Techniques and Tools For Predictive Analytics Analysis
No ratings yet
Reviews of Big Data Techniques and Tools For Predictive Analytics Analysis
8 pages
BIG data1
No ratings yet
BIG data1
49 pages
(IJCST-V5I4P10) :M Dhavapriya
No ratings yet
(IJCST-V5I4P10) :M Dhavapriya
5 pages
Report On Bigdata
No ratings yet
Report On Bigdata
3 pages
chp3A10.10072F978 3 319 08976 8 - 16
No ratings yet
chp3A10.10072F978 3 319 08976 8 - 16
15 pages
Big Data
No ratings yet
Big Data
76 pages
Reading Teks Kelompok 2
No ratings yet
Reading Teks Kelompok 2
12 pages
Big Data Analytics
No ratings yet
Big Data Analytics
7 pages
2892-ArticleText-18886-3-10-20191006
No ratings yet
2892-ArticleText-18886-3-10-20191006
14 pages
Big Data: How To Handle: A Survey: Dinesh MCA Deptt. PDM University, Bahadurgarh ABC MCA Deptt
No ratings yet
Big Data: How To Handle: A Survey: Dinesh MCA Deptt. PDM University, Bahadurgarh ABC MCA Deptt
8 pages
presentation file
No ratings yet
presentation file
6 pages
BIG DATA ANALYSIS-1-5 (1)_pagenumber_organized
No ratings yet
BIG DATA ANALYSIS-1-5 (1)_pagenumber_organized
4 pages
Cp5293 Big Data Analytics Question Bank
0% (1)
Cp5293 Big Data Analytics Question Bank
13 pages
BDA Notes
No ratings yet
BDA Notes
54 pages
Big Data Analytics- Applications, Challenges & Future Directions
No ratings yet
Big Data Analytics- Applications, Challenges & Future Directions
6 pages
The Future of Big Data Analytics and Its Progress: November 2022
No ratings yet
The Future of Big Data Analytics and Its Progress: November 2022
10 pages
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
100% (1)
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
8 pages
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
Big Data Lec4
No ratings yet
Big Data Lec4
38 pages
Buat PDM
No ratings yet
Buat PDM
19 pages
Big Data Analytics Importance, Challenges, Categories, Techniques, and Tools (Article) Author Sarah Alswedani, Mostafa Saleh
100% (1)
Big Data Analytics Importance, Challenges, Categories, Techniques, and Tools (Article) Author Sarah Alswedani, Mostafa Saleh
9 pages
Data Management & Data Architecture
No ratings yet
Data Management & Data Architecture
21 pages
Big Data Analytics - Abstract
No ratings yet
Big Data Analytics - Abstract
1 page
239700a5-6c7a-43c1-810e-687c652d046e
No ratings yet
239700a5-6c7a-43c1-810e-687c652d046e
14 pages
Abdul Azam - Final Research Report
No ratings yet
Abdul Azam - Final Research Report
9 pages
CC Unit 3 Imp Questions
No ratings yet
CC Unit 3 Imp Questions
15 pages
Jsaer2016 03 01 21 24
No ratings yet
Jsaer2016 03 01 21 24
4 pages
Informatica
No ratings yet
Informatica
5 pages
Build: "Webpack ./src/app - Js ./dist/bundle - JS": Code On Demand
No ratings yet
Build: "Webpack ./src/app - Js ./dist/bundle - JS": Code On Demand
4 pages
Gradle Hello or Gradle - Q Hello
No ratings yet
Gradle Hello or Gradle - Q Hello
3 pages
Chennai Mathematical Institute
No ratings yet
Chennai Mathematical Institute
18 pages
Curriculum: Bulacan State University College of Education
No ratings yet
Curriculum: Bulacan State University College of Education
9 pages
Construction of Box-Culvert On Hankaran Nallah Near Bhuri Village
No ratings yet
Construction of Box-Culvert On Hankaran Nallah Near Bhuri Village
168 pages
TOEFL Full Test Correction Reading and Listening
No ratings yet
TOEFL Full Test Correction Reading and Listening
3 pages
Grade 8 Technology Term 4
No ratings yet
Grade 8 Technology Term 4
12 pages
Repairing Logitech Driving Force Pro Pedals
No ratings yet
Repairing Logitech Driving Force Pro Pedals
5 pages
FEM - Solid - Mechanics - 2022-2023
No ratings yet
FEM - Solid - Mechanics - 2022-2023
52 pages
Complete Download (Ebook) Prudent Practices in the Laboratory: Handling and Management of Chemical Hazards, Updated Version (National Research Council) by The Committee on Prudent Practices in the Laboratory: An Update, National Research Council ISBN 9780309138642, 9780309138659, 0309138647 PDF All Chapters
No ratings yet
Complete Download (Ebook) Prudent Practices in the Laboratory: Handling and Management of Chemical Hazards, Updated Version (National Research Council) by The Committee on Prudent Practices in the Laboratory: An Update, National Research Council ISBN 9780309138642, 9780309138659, 0309138647 PDF All Chapters
82 pages
QQ BMS Commissioning Method Statement-Draft
71% (7)
QQ BMS Commissioning Method Statement-Draft
84 pages
Biotic and Abiotic Factors
No ratings yet
Biotic and Abiotic Factors
46 pages
AlfaLaval EPC41 Unidad de Control
100% (1)
AlfaLaval EPC41 Unidad de Control
28 pages
Chapter 7 Tutorials
No ratings yet
Chapter 7 Tutorials
5 pages
Examkrackers MCAT Organic Chemistry 7th Edition Jonathan Orsay download
100% (1)
Examkrackers MCAT Organic Chemistry 7th Edition Jonathan Orsay download
58 pages
10762
No ratings yet
10762
71 pages
NSC Unit - 2 - 221218 - 100752
No ratings yet
NSC Unit - 2 - 221218 - 100752
25 pages
Infectious Agents James N KC
No ratings yet
Infectious Agents James N KC
3 pages
Assignment1 QUIMO
No ratings yet
Assignment1 QUIMO
2 pages
User, Password Verification: Body Form Div Asp Textbox Asp Textbox Asp Label
No ratings yet
User, Password Verification: Body Form Div Asp Textbox Asp Textbox Asp Label
6 pages
Improving Academic Achievement - ScienceDirect
No ratings yet
Improving Academic Achievement - ScienceDirect
2 pages
MPDF
No ratings yet
MPDF
1 page
JS1 3RD Term Business Studies
100% (1)
JS1 3RD Term Business Studies
28 pages
Harmony and Proportion - Pythagoras - Music and Space
No ratings yet
Harmony and Proportion - Pythagoras - Music and Space
2 pages
DCN Full
No ratings yet
DCN Full
215 pages
Goenka 2022 Full DAY List FINAL PUBLISH
No ratings yet
Goenka 2022 Full DAY List FINAL PUBLISH
304 pages
Indigenous Peoples' Day Resolution
No ratings yet
Indigenous Peoples' Day Resolution
1 page
Jul 21
No ratings yet
Jul 21
15 pages
Global+Cardio 2025-2-70 Def
No ratings yet
Global+Cardio 2025-2-70 Def
28 pages
Cemm 1 Ps
No ratings yet
Cemm 1 Ps
16 pages
ENERGETICS Ebook
No ratings yet
ENERGETICS Ebook
96 pages

A Review Paper On Big Data Analytics Tools: Article

Uploaded by

A Review Paper On Big Data Analytics Tools: Article

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

A Review Paper on Big Data Analytics Tools

Article · May 2018

Role of ICT for Rural Development in Haryana View project

Artificial Intelligence View project

The user has requested enhancement of the downloaded file.

A Review Paper on Big Data Analytics Tools

Keywords— Big Data, Data Analytics, Hadoop, MapReduce, Cassandra, MangoDB.

IJTIMES-2018@All rights reserved 1012

II. METHODOLOGY OF BIG DATA ANALYTICS

III. COMPARATIVE ASSESSMENT OF BIG DATA ANALYTICS TOOLS

IJTIMES-2018@All rights reserved 1013

B. Data Storage tools and frameworks

Apache HBase (Hadoop Yes (optimistic

MangoDB Document-oriented No Yes Yes

Apache Cassandra Column-oriented Yes Yes No

Apache Ignite Multi-model Yes Yes Yes

Oracle NoSQL Yes

C. Data filtering and extraction tools

Both free and paid Structured

Both free and paid Cloud-based desktop Excel,CSV, Google

Paid Enterprise and Structured data

Web scrapping with Structured data (XML,

IJTIMES-2018@All rights reserved 1014

D. Data cleaning and validation tools

E. Data analysis tools

Apache Storm A record at a time Any milli-seconds

Java, Ruby, More

IJTIMES-2018@All rights reserved 1015

F. Data Visualization tools

scatter plots, bar

Database, spreadsheet, Programming language and

pie charts, bar charts,

Location data, plenty of CartoCSS

Own visual query language Line/bar/

IJTIMES-2018@All rights reserved 1016

IJTIMES-2018@All rights reserved 1017

You might also like