0% found this document useful (0 votes)
110 views4 pages

CSC649 Group Project and Presentation

The document outlines a group project for the course CSC649: Special Topics in Computer Science, focusing on data mining and related technologies. It details the project components, including a proposal, model/system development, report, and presentation, along with various data mining techniques and their applications. Students are required to work in groups, select unique topics, and utilize appropriate tools to analyze datasets with over 1000 records.

Uploaded by

2024793147
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
110 views4 pages

CSC649 Group Project and Presentation

The document outlines a group project for the course CSC649: Special Topics in Computer Science, focusing on data mining and related technologies. It details the project components, including a proposal, model/system development, report, and presentation, along with various data mining techniques and their applications. Students are required to work in groups, select unique topics, and utilize appropriate tools to analyze datasets with over 1000 records.

Uploaded by

2024793147
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

CSC649: Special Topics in Computer Science

Semester March 2025– August 2025

GROUP PROJECT (70%)

(5% PROPOSAL, 25% MODEL/SYSTEM, 25% REPORT & 15% SLIDES-PRESENTATION)

TASK in Data Mining/Machine Learning/Data Analysis/Game/ChatBot/Any New


technology skills

Data mining is used for various purposes, including:

1. Predictive Modeling: Predictive modeling involves using data mining techniques to


create models that can predict future outcomes based on historical data. This can
be used for forecasting customer behavior, predicting market trends, and
detecting fraud.
2. Market Basket Analysis: Market basket analysis involves analyzing customer
purchase patterns to identify which products are often purchased together. This
can be used to recommend products to customers or to optimize product
placement in stores.
3. Customer Segmentation: Customer segmentation involves dividing customers
into groups based on shared characteristics, such as age, gender, or purchasing
behavior. This can be used to tailor marketing campaigns and customer
experiences to specific groups.
4. Anomaly Detection: Anomaly detection involves identifying unusual patterns or
outliers in data. This can be used for fraud detection, fault detection, and quality
control.
5. Text Mining: Text mining involves extracting insights from unstructured data, such
as customer feedback, social media posts, and news articles. This can be used
for sentiment analysis, topic modeling, and content recommendation.
6. Recommendation Systems: Recommendation systems involve using data mining
techniques to suggest products or services to customers based on their past
behavior or preferences. This can be used for product recommendations on e-
commerce websites or for personalized content recommendations on streaming
platforms.
7. Image and Video Analysis: Image and video analysis involves using data mining
techniques to extract insights from visual data, such as identifying objects,
detecting faces, and tracking movements. This can be used for surveillance,
object recognition, and autonomous vehicles.
8. Customer Churn Analysis: Customer churn analysis involves predicting which
customers are likely to leave a company or service, based on their past behavior
and preferences. This can be used to identify customers who need additional
attention or to improve customer retention strategies.
9. Fraud Detection: Fraud detection involves identifying fraudulent activities, such
as credit card fraud or insurance fraud, based on patterns and anomalies in the
data.
10. Health Care Analytics: Health care analytics involves using data mining
techniques to analyze medical data, such as electronic health records, to identify
trends, patterns, and insights that can be used to improve patient care and
outcomes.

These are just a few examples of the purposes of data mining. With the increasing
availability of data and the development of more advanced data mining techniques, the
potential applications of data mining are constantly expanding.

1. Prediction
2. Association
3. Optimization
4. Clustering
5. Classification
6. Visualization
7. Others (new method or skill on current technology)

Instructions:

• Construct a group of maximum 4 members. Develop a data mining project based


on the group interest.
• Choose different technique and domain and data for each group. The title must
be unique and different from other groups.
• Get the Data Set (database, image, sounds, video, time series, text or other
relevant type).
• Choose one domain with any type of data set with more than 1000 – 5000
records.
• Tools: Can use Python/Rapidminer/ Tableau /Weka / Matlab / PowerBI or other
suitable tool or combination of tools and can use any available libraries in the
tools
Data mining / Data Analysis Technique

Here's a list of data mining techniques and algorithms:

1. Regression Analysis: Linear regression, logistic regression, polynomial


regression, ridge regression, Lasso regression, and elastic net regression.
2. Clustering: K-means, hierarchical clustering, DBSCAN, OPTICS, and fuzzy
clustering.
3. Decision Trees: Classification and regression trees (CART), Chi-square
Automatic Interaction Detection (CHAID), and Random Forest.
4. Neural Networks: Multilayer perceptron, convolutional neural networks,
recurrent neural networks, and deep belief networks.
5. Association Rules: Apriori, Eclat, and FP-growth.
6. Text Mining: Natural Language Processing (NLP), sentiment analysis, topic
modeling, and entity recognition.
7. Support Vector Machines: Linear SVM, polynomial SVM, radial basis function
(RBF) SVM, and support vector regression.
8. Time Series Analysis: Autoregression, moving average, ARIMA, seasonal
decomposition, and spectral analysis.
9. Principal Component Analysis: Singular value decomposition (SVD),
principal component analysis (PCA), and non-negative matrix factorization
(NMF).
10. Ensemble Methods: Bagging, boosting, and stacked generalization.
11. Deep Learning: Convolutional Neural Networks (CNN), Recurrent Neural
Networks (RNN), and Generative Adversarial Networks (GAN).
12. Bayesian Networks: Naive Bayes, Bayesian Belief Networks, and Dynamic
Bayesian Networks.
13. Genetic Algorithms: Genetic Programming and Evolutionary Strategies.
14. Collaborative Filtering: User-based, item-based, and matrix factorization.

These are just a few examples of data mining techniques and algorithms that are widely
used in industry and academia. The selection of a particular technique depends on the
specific problem being addressed and the characteristics of the dataset being
analyzed.

Any suitable technique can be found by searching, researching & reading literature
review of journals and relevant documents in the internet from the reliable sources.
Student must understand the concept of technique to be able to apply it to solve real
world problem.
Task/Assignment:

1. Prepare the proposal of the project: Introduction, Problem Statements,


Objectives, Scope, Significance – (Due: 28 April 2025)
2. Develop the system or algorithm (executable) with relevant GUI to show the
output
3. Prepare the final report on the project: Abstract, Introduction, Background
Studies, Objectives/Scope, Project Significance, Methodology/Experiment,
Discussion & Analysis, Conclusion (APA format – max 50 pages) – to be
submitted in Week 14
4. Conduct the presentation by each member of group (Power Point or Youtube
or video or other suitable app) – Week 14

References
www.datacamp.com
Intel corporation
Tableau
Rapidminer
WEKA
Tensorflow
Python
TED Talk
Journals (UiTM Library – online journals)
Other useful links

You might also like