0% found this document useful (0 votes)
15 views13 pages

Topic 1a - Introduction To Data Mining

The document introduces data mining, its history, techniques, and applications, emphasizing the importance of extracting meaningful patterns from large datasets. It discusses the relationship between data, information, knowledge, and wisdom, and highlights the growth of data availability in the modern era. Additionally, it touches on the transition from data mining to big data mining and its implications for discovering insights from social media and other platforms.

Uploaded by

2024793147
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views13 pages

Topic 1a - Introduction To Data Mining

The document introduces data mining, its history, techniques, and applications, emphasizing the importance of extracting meaningful patterns from large datasets. It discusses the relationship between data, information, knowledge, and wisdom, and highlights the growth of data availability in the modern era. Additionally, it touches on the transition from data mining to big data mining and its implications for discovering insights from social media and other platforms.

Uploaded by

2024793147
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Topic 1:

Introducti
on to Data
Mining
Ts. Dr. Tuan Norhafizah Tuan
Zakaria
Objectives

To introduce about To discuss the history, To discuss Data Mining


Data Mining and its evolution and techniques, tasks,
relationship with data motivation of Data applications and some
and knowledge Mining major issues
Pattern Recognition and Data Mining
PATTERN RECOGNITION
a process of recognizing a pattern using machine (computer), it can be viewed through several aspects

Pattern Recognition by Computer Pattern Recognition from Data


Pattern Recognition by Human  benefit of automated pattern  learn or observe from large
 perceptual (emotions,
recognition amounts of data
feelings)  advantage in complex  study the dependencies and
 specialized – decision making extract knowledge from data
calculations
Data, Information, Knowledge & Wisdom

Data

• the basic facts such as names, numbers or characters Pak Kassim needs to taking care his food
Applied
that come in different forms (like text or image). intake and body health.
• Raw fact with no meaning. Wisdom

Information Context Pak Kassim has high blood pressure


Knowledge
• Fact that has meaning

Knowledge 140mmHg is the blood pressure


Information Meaning
of Pak Kassim
• Information that has context

Wisdom
Data Raw 140mmHg

• Applied of knowledge
What is Data?
• Table 1 - a sample of data with five (5) variables, where the last column indicates the outcome
of that sample.

# Names Studies Education Work_performance Income (D)

1 Amni Ali Poor High School Poor None


2 Chuah Ah Lan Moderate High School Poor Low
3 Daria Danial Poor High School Poor None
4 Marisa Malik Moderate Diploma Poor Low
5 Nur Aini Mat Poor High School Good Low
6 Suria Mohd Moderate Diploma Poor Low
7 Ozaila Othman Good Master Good Medium

99 Muhd Haris Aziz Poor High School Good Low

100 Zulhairi Yatim Moderate Diploma Poor Low


What is
Knowledge?
• the processed or organized data (information) that is given some values to uncover the relationship for deeper understanding.
• Sample of knowledge in the form of IF then ELSE rules:

studies(Poor) AND work(Poor)  income(None)


studies(Poor) AND work(Good)  income(Low)
education(Diploma)  income(Low)
education(Master)  income(Medium)
OR income(High)
studies(Moderate)  income(Low)
studies(Good)  income(Medium)
OR income(High)
education(SPM) AND work(Good)  income(Low)
https://www.ontotext.com/knowledgehub/fundamentals/dikw-pyramid/
Definition:
What is Data • extraction of interesting (non-trivial, implicit, previously
unknown and potentially useful) patterns or knowledge
Mining? from huge amount of data
• exploration and analysis, by automatic or semi-
automatic means, of large quantities of data in order to
discover meaningful patterns

Alternatives names:
• Knowledge discovery (mining) in databases (KDD),
knowledge extraction, data/pattern analysis, data
archeology, data dredging, information harvesting

Is everything “data mining”?


• Simple search and query processing, like query of
information about “Shopee products”
Why Data Mining?

• Today, massive growth of data availability, from Terabyte to Yottabyte, it is everywhere and anywhere
• “There were 5 exabytes of information created between the dawn of civilization through 2003, but that much
information is now created every 2 days” – Eric Schmidt, Executive Chairman of Google
• “Information is the oil of 21st century, and analytics is the combustion engine.” – Peter Sondergaard, Gartner
Research.
• Source of data ?

Facebook, Instagram, Telegram Blogs, News Amazon, Shopee, Lazada


(Social Media) (Society) (E-commerce)
From Data Mining to Big Data Mining

• What is big data?


• A term which refers to a large amount of data where the concept is related to
the characteristics of the data itself.
• Implies 5V

https://www.techentice.com/the-data-vera
city-big-data/
From Data Mining to Big Data
Mining: Examples
Big data mining

• referred to the collective data mining or extraction techniques that is performed on large volume of data or the
big data.

Goal

• to discover insights from the social media platforms (Instagram, Twitter, Facebook) with thousand of postings.

Classifying youth emotions based on Twitter data Sentiment analysis on reviews of Proton Cars in
Malaysia using Facebook postings
Conclusions

DATA MINING is simply…

Finds relationship
(that exist within the dataset)
and
makes prediction
References
1. Pang-Ning Tan, Michael Steinbach & Vipin Kumar, Introduction to Data Mining, Addison Wesley, 2019.
2. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, 3rd Edition, Morgan Kaufmann,
2012.
3. Che D., Safran M., Peng Z. (2013) From Big Data to Big Data Mining: Challenges, Issues, and Opportunities.
In: Hong B., Meng X., Chen L., Winiwarter W., Song W. (eds) Database Systems for Advanced Applications.
DASFAA 2013. Lecture Notes in Computer Science, vol 7827. Springer, Berlin, Heidelberg.
https://doi.org/10.1007/978-3-642-40270-8_1
4. Razak, Z. I., & Mutalib, S. (2018). Web Mining In Classifying Youth Emotions. Malaysian Journal of
Computing, 3(1), 1-11.
5. Wah, Y. B., Abdullah, N., Abdul-Rahman, S., & Tan, M. L. P. (2018). text mining and sentiment analysis on
reviews of proton cars in malaysia. Malaysian Journal of Science, 37(2), 137-153.

You might also like