0% found this document useful (0 votes)
11 views22 pages

290 Mis L5

Uploaded by

fclase28
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views22 pages

290 Mis L5

Uploaded by

fclase28
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

Technology in Business

- LECTURE FIVE -

The Current Issues and Challenge Facing


the Development of Big-data
Related Chapters in the Text:
Chpt. 6, 7, 8

Ruben Xing, PhD


Copyright ©2019 Dow Jones & Company, Inc. All Rights Reserved
Historic Reviews for the Cognitive Errors,
Issues & Strategic Omissions Facing Big-data
《 Cognitive-Error 1 》

“ Big-data Era ”
《 Cognitive-Error 2 》

“ Year of Big-data ”
《 Cognitive-Error 3 》

“ Big-data is Irrelevant with Blockchain ”


《 Strategic Issues 》

“Strategic Omissions, Challenge & Opportunities”


《 Reconsidering WIC’s Direction of Sci-Tech Development 》

“The World is Moving from IT Stage to DT Era ”


《 Cognitive-Error 1》

Top IT Strategic Targets to be Focused


Determined by CIO Summit

What is the Position of Big-data ?


CIO 2020 Summit Future Predictions
Big-data is a Continuation of the Carbon-data
The Internet Development is the Key Driving Force for the Quick Growth of
the Silicon-Data
《 Cognitive-Error 2 》

Amazing Symmetry of Ancient Natural Numbers


Invented 5000 Years Ago Has Revealed
Incredible Digital Connotations

1x1=1
11 x 11 = 121
111 x 111 = 12321
1111 x 1111 = 1234321
11111 x 11111 = 123454321
111111 x 111111 = 12345654321
1111111 x 1111111 = 1234567654321
11111111 x 11111111 = 123456787654321
111111111 x 111111111 = 12345678987654321
The So-Called Big-Data was Undoubtedly
Originated and Evolved from the Carbon Data
The Miracle of Pyramid

1 x 9 +2= 11
12 x 9 +3= 111
123 x 9 +4= 1111
1234 x 9 +5= 11111
12345 x 9 +6= 111111
123456 x 9 +7= 1111111
1234567 x 9 +8= 11111111
12345678 x 9 +9= 111111111
123456789 x 9 +10= 1111111111
The Magic Power of Natural Numbers is
Comparable to Today's Silicon-based Big Data
The Most Mysterious Natural Number Found
in Ancient Pyramids“142857”
142857 X 1 = 142857, 142857 X 2 = 285714
142857 X 3 = 428571, 142857 X 4 = 571428
142857 X 5 = 714285, 142857 X 6 = 857142
142857 X 7 = 999999 ,
1,2,3,4,5,6,8,9 / 7 = The Most Amazing Infinite Loop Decimal !!

The Most Powerful Natural Number that Leads to the Universe - Tesla
Code: 3 6 9
Stage of Web 0.1 –
Modern OS Based Computing Data Storage
Big-Data wasn’t recognized, tagged only because of
the limited storage capacity confined by old OS

Verb of the Stage of Web 0.1 - COPY


Big Data is NOT a New Field, But the Inevitable
Product Along With the Internet Movement
Glance of Silicon-data Growth History

Bit (b) 1 or 0 Binary digit, computers use it to compose, store


and process data; One hole per bit on Punch-card
Byte (B) 8 bits An English letter or number in computer code.
70 bytes max. created per punch-card

Kilobyte 1024 B, One regular typed text page is 2KB; An 51/4


(KB) or floppy disk stores 80~800 KB data max.
103 bytes
Megabyte 1024 KB, In 1950’s, IBM developed first computer hard-
(MB) or drive contains 5MB of data, which covers the
106 bytes complete works of Shakespeare. By 1960’s, 650
MB CD was developed to store multimedia data
Gigabyte 1024 MB, By 1990, IBM first developed a 1-GB computer
(GB) or disk drive. By 2005, 100s-GB based DVD came
109 bytes out A two-hour film contains 1~2 GB
Google Search Based Multimedia
Web 1.0 – Silicon Data Starts Quick Growth
Web1.0 is a Read-only Internet Platform, Created the Term of Bigdata

Verb of the Stage of Web 1.0 - GOOGLE

Multimedia Sampling Rate > = (2) * (BW)


BW= Bandwidth ,250 Word pages ~ 1MB;
Down/Upload rate: 25mbps/5mbps
Web 2.0 - Social Network
A Writeable Internet Platform with Large Number of
Mobile and Interactive Users
Data Volume Growth Never Ends
Terabyte 1024 GB, or 2009, SONY first developed 2 TB memory card. US
(TB) 1012 bytes Congress Library collections are digitized and publicly
available on the Internet is about 74 TB

Petabyte 1024 TB, or Total 6 PB of letters US postal service delivered in 2012;


(PB) 1015 bytes Google process 20 PB data per day; A human’s brain
memory capacity ≈ 2.5 PB
Exabyte (EB) 1024 PB, or By 2010, computer storage reached 1-EB capacity .
1018 bytes

Zettabyte (ZB) 1024 EB, or Data grows double in every two years from now on. By 2020, data
1021 bytes will reach 40 ZB. 2014 global amount of data has reached 5-ZB, (as
all carved into the DVD (5G/Pd) discs can be superimposed with a
total length of two round-trip distance from the Earth to the Moon, a
total of about 1.6 million km).

Yottabyte 1024 ZB, or Big data grows will continue, and be stored on the Cloud
(YB) 1024 bytes with virtually unlimited space.
Brontobyte 1024 YB or In the near future, BB will be the measurement to describe
(BB) 1027 Bytes the type of sensor data that will be generated from the IoT

22p(n)
In the age of quantum communications, data will grow at a
double exponential power
Web 3.0: Internet of Thing (IoT)
NB-IoT Focused on Fog Computing Featured with AI and VR
A New Internet that not Just Integrate Data but Everything
AI-Driven Emerging Trends of IoT:
Quitting the Cloud;
MATTER Matters Unified AI-Environment

1)XML framed 2)NB-IoT structured with 3)Featured in 3-I,


with RDF, Wireless Sensor Network Functioned with
Ontology making (WSN) and M2M based AI, VR, AR, MR,
Web semantic Fog Computing LA, PA, and FA
《 Cognitive-Error 3 》

Blockchain Technology Makes Essential


Changes to Traditional Big-data

Modern Global Consensus - Digital Survival


While the traditional Internet digitized Information, Multimedia, and
Everything, the Blockchain creates Digital Value based on its
Decentralized and Concensus mechanism .
Bitcoin The Most Successful Application of Blockchain
• The emergence of decentralized Digital encryption currency on the
blockchain Internet has completely subverted the traditional value concept
of the Gold or Dollar Standards currency.
• Built on Blockchain platform, Bitcoin is the earliest successful Digital
Cryptocurrency. With total 21M bitcoins launched by (Satoshi-Nakamoto)
defined in 2008, the first official transaction in 2010 (400: $1) to 2023
(1:$42,600, and the total market value reached $1,156 billion in 2021)
• Blockchain has become a major hope to resolve some key issues facing the
current AI development. “It could track in granular detail the data that AI is
trained on, and could be useful when AI churns out dubious results” (WSJ 1/11/2024)
《 Strategic Issues 》

Strategic Challenges Facing Big-Data


- The 5-V Strategies -
• VOLUME of Big-data created every moment
• VARIETIES of Big-data generation

• VELOCITY of Big-data being captured, structured and


processed
• VALUE challenge facing Big-data’s usability

• VERACITY challenge facing Big-data security, reliability


and originality
The Main Sources of Current Big Data
(VARIETY)
More than 80% of big data comes from social networking sites,
postings, digital images, videos, photos, and information on
weather, traffic, safety, public place management monitors, online
shopping transaction records, mobile phone information, GPS
signals. In addition to the following varieties, the massive data
generated by the IoT, Fog, AI/VR will soon become other big-data
sources, which are most unstructured, unmarked, and unanalyzed.
This is the content of big data today.
Challenge Facing Big-data
Capturing, Structuring, and Processing
Traditional Database is Being Replaced by the Rise of Data-Lake
Verb of the Stage of Web 3.0: Hadoop/Python/Tableau…
• Big-Data should be structured, formatted, tapped that makes it suitable for
data mining and subsequent analysis. Hadoop is the key.
• Hadoop and other data analytics technologies are becoming a major Driving
Force for developing M2M based Big-data and the key measures to collect,
clean and organize Data-Lake contained data
The Current Competitions of Big-data
Integration Processing Technology
Challenge Facing Big-data’s Value
Availability, Quality, Accuracy & Intelligence
 More than 80% of data captured : Sensors used to gather climate, traffic, security information,
and public administrations; Posts to social media sites, Digital pictures and videos, purchase
transaction Records, and Cell phone messages, GPS signals... All of these are unstructured,
untapped, unanalyzed, and is so-called Big Data today.
 Big-Data is useful ONLY IF they are tapped, structured and analyzed (IBM)
 However, even with a generous estimate, the amount of information in the digital universe that is
"tagged" accounts for only about 3% of the digital universe in 2012, and that which is analyzed
is half a percent of the digital universe. (IDC)
 5K scientific and technological revolution theoretical data, 64K lunar
landing data triggered reflection on Smaller and precise data
Challenge Facing Big-data’s Value
Safety, Quality, Unbiasedness, Accuracy & Intelligence

• In addition to large amount of garbage data Spurious Correlations becomes


another potential tragedy of big data: The more variables, the more correlations
that can show significance. Falsity also grows faster than information;
 Algorithm Business (AB) is an essential strategy for the Ingenuity orientated
algorithm to deal with big-data issues, and quickly become the hot spot for all
businesses;
 UPS Algorithm Revolutions – a typical AB example
 The bottleneck of ensuring AI algorithm is safe, unbiased and accurate has
brought Blockchain a hopeful solution for business
The Current Bottleneck of the Internet/AI
and the Promises that can Hardly to Ensure

Information Security
The massive C/S based servers make IoT
vulnerable to DDoS attacks

Information Reliability
Ensured by Hash Algorithm based Blocks

Information Originality
All the information delivered on traditional Internet
is a replica. The Block-based data is original,
Traceable,, and verifiable
CONCLUSIONS
 Big data does not represent a new era, nor a new field, but an
important resource in the four domains of the contemporary
technological industry revolution. Big data is the historical
development process of carbon-silicon-based and even
quantitative-based data generated by human civilization
 The traditional Internet digitizes information, the media, and
every existing objects. The mutual trust mechanism based
Blockchain Internet creates value digitization.
 Big data should be comprehensively reviewed with a 5-V strategy.
Integrated technologies with high-speed processing, intelligent
algorithms, secure, reliable, high quality and efficiency are the
major challenges and the development opportunities

Tackling the Problems Associated with Big-Data


Takes More than Intelligence: It Takes Ingenuity !
Time to Redefine what IT and BI Should Mean Today !!

You might also like