The document discusses the rapid growth of data generation, particularly through the Internet and smartphones, leading to the emergence of Big Data characterized by its volume, velocity, variety, veracity, and value. Traditional data processing tools are inadequate for handling the complexity and size of Big Data, which includes unstructured data from various sources. There is a focus on developing new methods and software to effectively process and analyze Big Data due to its potential high business value.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
2 views2 pages
Big Data
The document discusses the rapid growth of data generation, particularly through the Internet and smartphones, leading to the emergence of Big Data characterized by its volume, velocity, variety, veracity, and value. Traditional data processing tools are inadequate for handling the complexity and size of Big Data, which includes unstructured data from various sources. There is a focus on developing new methods and software to effectively process and analyze Big Data due to its potential high business value.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 2
With technology making
an inroad into almost every
sphere of our lives, data is being produced at a colossal rate. Today, there are over a billion Internet users, and a majority of the world’s web traffic is coming from smartphones. Figure 2.8 shows that at the current pace, around 2.5 quintillion bytes of data are created each day, and the pace is increasing with the continuous evolution of the Internet of Things (IoT). This results in the generation of data sets of enormous volume and complexity called Big Data. Such data cannot be processed and analysed using traditional data processing tools as the data is not only voluminous, but also unstructured like our posts, instant messages and chats, photographs that we share through various sites, our tweets, blog articles, news items, opinion polls and their comments, audio/video chats, etc. Big data not only represents voluminous data, it also involves various challenges like integration, storage, analysis, searching, processing, transfer, querying and visualisation of such data. Big data sometimes hold rich information and knowledge which is of high business value, and therefore there is a keen effort in developing software and methods to process and analyse big data. Characteristics of Big Data Big data exhibits following five characteristics shown in Figure 2.9, that distinguish it from traditional data. (A) Volume The most prominent characteristic of big data is its enormous size. If a particular data set is of such large size that it is difficult to process it with traditional DBMS tools, it can be termed as big data. (B) Velocity It represents the rate at which the data under consideration is being generated and stored. Big data has an exponentially higher rate of generation than traditional data sets. (C) Variety It asserts that a data set has varied data, such as structured, semi-structured and unstructured data. Some examples are text, images, videos, web pages and so on. (D) Veracity Big data can be sometimes inconsistent, biased, noisy or there can be abnormality in the data or issues with the data collection methods. Veracity refers to the trustworthiness of the data because processing such incorrect data can give wrong results or mislead the interpretations. (E) Value Big data is not only just a big pile of data, but also possess to have hidden patterns and useful knowledge which can be of high business value. But as there is cost of investment of resources in processing big data, we should make a preliminary enquiry to see the potential of the big data in terms of value discovery or else our efforts could be in vain.