0% found this document useful (0 votes)

9 views4 pages

CSC425 Data Mining

The document discusses various techniques for web usage mining, including association rules, clustering, and classification, which analyze user behavior on the web. It also explains the concept of a transfer function in neural networks, providing examples such as linear, sigmoid, and ReLU functions. Additionally, it differentiates between supervised and unsupervised learning, describes information retrieval in text mining, and highlights pitfalls in data mining.

Uploaded by

filee010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

CSC425 Data Mining

Uploaded by

filee010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

2a. Web usage mining focuses on analyzing user behavior on the web.

List and explain briefly three (3) techniques that can be used to achieve
this.

Techniques for Web Usage Mining:

1. Association Rules – This technique discovers relationships between web pages visited
together frequently. It helps in identifying patterns in user navigation.
2. Clustering – Users with similar browsing behavior are grouped together to analyze trends. It
helps in personalization and recommendation systems.
3. Classification – This technique involves categorizing user behavior into predefined groups
based on their web activities. It helps in predicting future user actions.

2b(i). What is “Transfer Function” in a Neural Network model?

A transfer function in a neural network is a mathematical function that determines

how input data is transformed into output at each neuron. It helps in defining the
activation level of a neuron.

2b(ii). Give three (3) examples of a transfer function.

1. Linear Transfer Function – The output is directly proportional to the input.

2. Sigmoid Transfer Function – Produces an S-shaped curve and is commonly used in
classification problems.
3. ReLU (Rectified Linear Unit) Function – Outputs zero for negative inputs and the same
value for positive inputs, improving training efficiency.

2c(i). What is Information Retrieval in the context of text mining?

Information Retrieval (IR) is the process of obtaining relevant textual data from a
large collection of unstructured text. It helps in retrieving useful information from
databases, documents, or search engines.
2c(ii). Illustrate with a diagram, the general Information Retrieval
system architecture.

Your course manual contains a diagram of the General Information Retrieval

System Architecture, which includes:

 Document Collection (Source Data)

 Indexing System (Processing and Storage)
 Query Processor (User Interaction)
 Retrieval Engine (Matching and Ranking)
 User Interface (Results Display)

Refer to the diagram in your manual for the correct structure.

3a. Differentiate between Supervised and Unsupervised Learning in a

tabular form.

Aspect Supervised Learning Unsupervised Learning

Learn from labeled data to predict Identify patterns and structures in
Goal
outputs. data.
Uses labeled datasets (input-output Uses unlabeled datasets without
Data
pairs). predefined outputs.
Examples: Decision Trees, Neural
Examples: K-Means Clustering,
Algorithms Networks, Support Vector
DBSCAN, Hierarchical Clustering.
Machines.

3b. In a neural network training, when is a network said to be

Overfitting?

A neural network is overfitting when it learns the training data too well, including
noise and irrelevant details. This results in poor generalization to new data, meaning
the model performs well on training data but poorly on unseen data.
3c. State three (3) differences between Classification and Clustering.

Feature Classification Clustering

Assigns predefined labels to Groups data based on similarity
Definition
data. without predefined labels.
Supervision Supervised learning. Unsupervised learning.
Example Decision Trees, SVM, K-Means, DBSCAN, Hierarchical
Algorithms Neural Networks. Clustering.

4a. k-means is not a suitable algorithm for clustering alphabetic data.

Discuss.

 K-Means clustering is based on numerical distance measures (such as Euclidean distance),

which are ineffective for alphabetic data.
 Alphabetic data, such as words or text, do not have a natural numeric representation for
distance computation.
 Alternative approaches like Hierarchical Clustering or Latent Semantic Analysis (LSA)
are better suited for text data.

4b. List and discuss five (5) data mining pitfalls.

1. Overfitting – The model performs well on training data but fails on unseen data.
2. Ignoring Data Quality Issues – Poor data leads to inaccurate predictions.
3. Selection Bias – Using non-representative data can mislead conclusions.
4. Improper Feature Selection – Using irrelevant or redundant features reduces model
efficiency.
5. Misinterpretation of Results – Correlation does not imply causation, leading to incorrect
insights.
4c. Use the following statements to draw a neural network structure.

To complete this, refer to your course manual’s example of a Neural Network

Structure Diagram, which typically consists of:

 Input Layer (Features or Inputs)

 Hidden Layers (Processing Layers with Neurons)
 Output Layer (Final Decision or Classification)

Use the provided statements to accurately structure your diagram.

Unit 2 Advanced Concepts of Modeling in AI
50% (2)
Unit 2 Advanced Concepts of Modeling in AI
5 pages
DPWH Guidelines On Value Engineering PDF
100% (7)
DPWH Guidelines On Value Engineering PDF
103 pages
King's 2.0 - Iq
100% (1)
King's 2.0 - Iq
8 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
DM VSAQ
No ratings yet
DM VSAQ
8 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
DWDM SR2
No ratings yet
DWDM SR2
21 pages
Mmds
No ratings yet
Mmds
12 pages
Updated_AAM_QB_(1)[1]
No ratings yet
Updated_AAM_QB_(1)[1]
6 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
QUESTION BANK BCA_IDS
No ratings yet
QUESTION BANK BCA_IDS
3 pages
DM_QB
No ratings yet
DM_QB
3 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
Data Analytics
No ratings yet
Data Analytics
6 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Data Analytics - Unit-1,2,3, & 4 questions - Assignment
No ratings yet
Data Analytics - Unit-1,2,3, & 4 questions - Assignment
6 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
Data Science
No ratings yet
Data Science
13 pages
ADVANCE AIML CIE3 ANS
No ratings yet
ADVANCE AIML CIE3 ANS
5 pages
ML MQP1 Solved
No ratings yet
ML MQP1 Solved
22 pages
CS1004 DWM 2marks 2013
No ratings yet
CS1004 DWM 2marks 2013
22 pages
Book Exercises NayelliAnswers
No ratings yet
Book Exercises NayelliAnswers
3 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Mid Objective
No ratings yet
Mid Objective
5 pages
HW1
No ratings yet
HW1
4 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
data mining Unitwise imp questions
No ratings yet
data mining Unitwise imp questions
3 pages
Datamining Quiz
No ratings yet
Datamining Quiz
173 pages
DWDM
No ratings yet
DWDM
18 pages
STAT243 Chapter 1 Tutorial Questions With Solutions_23
No ratings yet
STAT243 Chapter 1 Tutorial Questions With Solutions_23
3 pages
Unit 2
No ratings yet
Unit 2
57 pages
Data Mining and Warehousing
100% (3)
Data Mining and Warehousing
30 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Important questions in WDM 9.2.25
No ratings yet
Important questions in WDM 9.2.25
4 pages
Stma Answers Set 1 - Converted
No ratings yet
Stma Answers Set 1 - Converted
3 pages
DWH-DM Assignment
No ratings yet
DWH-DM Assignment
5 pages
DWM Mid 2 Question Bank
No ratings yet
DWM Mid 2 Question Bank
5 pages
DWM IA-2 QB
No ratings yet
DWM IA-2 QB
10 pages
Datamining Bits
No ratings yet
Datamining Bits
16 pages
Questions and Answers[1]
No ratings yet
Questions and Answers[1]
7 pages
DM Shorts
No ratings yet
DM Shorts
2 pages
MLT Syllabus
No ratings yet
MLT Syllabus
3 pages
Consolidated Cse Question Bank1
No ratings yet
Consolidated Cse Question Bank1
170 pages
data analytics-1
No ratings yet
data analytics-1
21 pages
Grade 10 AI_Part -B_Unit-2 Notes
No ratings yet
Grade 10 AI_Part -B_Unit-2 Notes
3 pages
Data Mining and Warehousing (1)
No ratings yet
Data Mining and Warehousing (1)
7 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
Question paper pattern_SET - Answer Key (1)
No ratings yet
Question paper pattern_SET - Answer Key (1)
23 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
Datamining
No ratings yet
Datamining
3 pages
Basic Concepts in Data Structures
From Everand
Basic Concepts in Data Structures
K.Meenendranath Reddy
No ratings yet
Pyqp - Cs402-Qp-Jun21
No ratings yet
Pyqp - Cs402-Qp-Jun21
3 pages
Q.1. What Is Data Mining?
No ratings yet
Q.1. What Is Data Mining?
15 pages
Ranjit_jadhav Term Work Sem-II
No ratings yet
Ranjit_jadhav Term Work Sem-II
31 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
Important Questions From All Units
No ratings yet
Important Questions From All Units
3 pages
ML CAT 1 sol (1)
No ratings yet
ML CAT 1 sol (1)
8 pages
Copy of Unit 2 Notes - Google Docs
No ratings yet
Copy of Unit 2 Notes - Google Docs
6 pages
Machine Learning ISA-2 Answer Bank
No ratings yet
Machine Learning ISA-2 Answer Bank
28 pages
CHAPTER 1 TOPIC 3 A Brief History of Motor Control and Motor Learning
No ratings yet
CHAPTER 1 TOPIC 3 A Brief History of Motor Control and Motor Learning
3 pages
Conduct of Remedial Classes During Summer For The K To 12 Basic Education Program
50% (2)
Conduct of Remedial Classes During Summer For The K To 12 Basic Education Program
8 pages
AAA Attendance Policy
No ratings yet
AAA Attendance Policy
3 pages
Module 4 - Company Situation Analysis
No ratings yet
Module 4 - Company Situation Analysis
58 pages
Part-Of-Speech Tagging: A Simple But Useful Form of Linguistic Analysis
No ratings yet
Part-Of-Speech Tagging: A Simple But Useful Form of Linguistic Analysis
18 pages
Cyber Laws in Pakistan
No ratings yet
Cyber Laws in Pakistan
28 pages
Journal Article Review by Gemma Rose T. Cunamay
No ratings yet
Journal Article Review by Gemma Rose T. Cunamay
3 pages
RK 3-Day Split, 2024
No ratings yet
RK 3-Day Split, 2024
3 pages
Service Manual L32B1120
No ratings yet
Service Manual L32B1120
57 pages
Simple machines
No ratings yet
Simple machines
3 pages
Cyber Crime and Punishments
No ratings yet
Cyber Crime and Punishments
19 pages
Android:An Open Handset Alliance Project
No ratings yet
Android:An Open Handset Alliance Project
23 pages
Acute Complications During Hemodialysis - UpToDate
No ratings yet
Acute Complications During Hemodialysis - UpToDate
19 pages
Conversations in English
No ratings yet
Conversations in English
41 pages
Chacarera Del Rancho - Flauta
No ratings yet
Chacarera Del Rancho - Flauta
1 page
Health Safety Health: Key Performance Indicator Month of October
100% (2)
Health Safety Health: Key Performance Indicator Month of October
1 page
b1 20 1
No ratings yet
b1 20 1
31 pages
PRESENTATION Chapter 13, 14, 15
No ratings yet
PRESENTATION Chapter 13, 14, 15
43 pages
Plasmolysis
No ratings yet
Plasmolysis
2 pages
Sarah Nordgren
No ratings yet
Sarah Nordgren
2 pages
Newsletter 2011 August 10
No ratings yet
Newsletter 2011 August 10
2 pages
Spinny Intro Deck - 2023
No ratings yet
Spinny Intro Deck - 2023
20 pages
Oop 1
No ratings yet
Oop 1
17 pages
F.J. S.J.: Shuqaiq 3 Independent Water Project
No ratings yet
F.J. S.J.: Shuqaiq 3 Independent Water Project
1 page
CH 2
No ratings yet
CH 2
4 pages
4.3.1 Journal - Your Susceptibility To Disease (Journal)
No ratings yet
4.3.1 Journal - Your Susceptibility To Disease (Journal)
3 pages
War in The Balkans - Richard C. Hall Edi PDF
75% (4)
War in The Balkans - Richard C. Hall Edi PDF
436 pages
Production Management - BBA Notes
50% (2)
Production Management - BBA Notes
6 pages

CSC425 Data Mining

Uploaded by

CSC425 Data Mining

Uploaded by

2a. Web usage mining focuses on analyzing user behavior on the web.

Techniques for Web Usage Mining:

2b(i). What is “Transfer Function” in a Neural Network model?

A transfer function in a neural network is a mathematical function that determines

2b(ii). Give three (3) examples of a transfer function.

1. Linear Transfer Function – The output is directly proportional to the input.

2c(i). What is Information Retrieval in the context of text mining?

Your course manual contains a diagram of the General Information Retrieval

 Document Collection (Source Data)

Refer to the diagram in your manual for the correct structure.

3a. Differentiate between Supervised and Unsupervised Learning in a

Aspect Supervised Learning Unsupervised Learning

3b. In a neural network training, when is a network said to be

Feature Classification Clustering

4a. k-means is not a suitable algorithm for clustering alphabetic data.

 K-Means clustering is based on numerical distance measures (such as Euclidean distance),

4b. List and discuss five (5) data mining pitfalls.

To complete this, refer to your course manual’s example of a Neural Network

 Input Layer (Features or Inputs)

Use the provided statements to accurately structure your diagram.

You might also like