Advanced Analytics - Course Outline
Advanced Analytics - Course Outline
Course Objective
This course is planned to provide students with:
Given the large amounts of unstructured data flooding the Internet, mining high-quality information
from text and web becomes increasingly critical. The actionable knowledge extracted from text data
facilitates effective decision making in a broad spectrum of areas, including business intelligence,
information acquisition, social behaviour analysis and strategization. This course will cover important
topics in text mining, web mining and image mining leading to text and web analytics. Students will
also be exposed to use of software for text and web mining. The course places emphasis on the use
of techniques for different aspects of text and web mining and strategization based on the mining
results. In general, exposure to each topic will be supported by a real life case study. In order to
make the course more relevant, and practice oriented, participants will be using the very popular
analytics package, WEKA and applying the techniques on different corpora (text databases).
Course Outline
Text Summarization
12 Chapter 11.2 Source Code summarization
techniques
Text Summarization
13
Techniques (Contd.)
Analyzing e-Commerce website -
14 Introduction to web mining Hand-out
Flipcart
Effectiveness of web search
15 Web structure mining Chapter 8
engines
Case – Analyzing Customer
16 Web content mining Chapter 9 and 10 Reviews (audio)
Making automated
17 Web usage mining Chapter 12 recommendations and
personalization
20 Project Presentations
1. Farhad Soleimanian Gharehchopogh and Zeinab Abbasi Khalifehlou, Analysis and evaluation
of unstructured data: Text mining versus natural language processing; DOI:
10.1109/ICAICT.2011.6111017 · Source: IEEE Xplore
2. Andreas Hotho, A Brief Survey of Text Mining, available at http://www.kde.cs.uni-
kassel.de/hotho/pub/2005/hotho05TextMining.pdf
3. James Pustejovsky and Branimir Boguraev “Lexical knowledge representation and natural
language processing”, Artificial Intelligence 63 (1993) 193-223
5. Nan Li a, Desheng Dash Wu, Using text mining and sentiment analysis for online forums
hotspot detection and forecast
6. Rayid Ghani et. al. Text Mining for Product Attribute Extraction
10. Rudy Prabowo and Mike Thelwall, Sentiment analysis: A combined approach, Journal of
Informetrics · March 2013
11. Alexander Pak and Patrick Paroubek, Twitter as a Corpus for Sentiment Analysis and Opinion
Mining, available at http://lexitron.nectec.or.th/public/LREC-010_Malta/pdf/385_Paper.pdf
12. Anurag Bejju, Sales Analysis of E-Commerce Websites using Data Mining Techniques,
International Journal of Computer Applications (0975 – 8887) Volume 133 – No.5, January
2016
13. Bernard J. Jansen, Paulo R. Molina, The effectiveness of Web search engines for retrieving
relevant ecommerce links, Information Processing and Management 42 (2006) 1075–109
14. Nadine Höchstötter and Dirk Lewandowski What users see – Structures in search engine
results pages, Information Sciences 179 (2009) 1796–1812
15. Khribi, M. K., Jemni, M., & Nasraoui, O. (2009). Automatic Recommendations for E-Learning
Personalization Based on Web Usage Mining Techniques and Information Retrieval.
Educational Technology & Society, 12 (4), 30–42.
17. Badrish Chandramouli, Jonathan Goldstein, Songyun Duan, Temporal Analytics on Big Data
for Web Advertising
18. Antonio Plaza et al, Recent advances in techniques for hyperspectral image processing,
Remote Sensing of Environment 113 (2009) S110–S122
Learning Outcomes
At the end of this course, a participant should be able to
Mine and analyse the data from text data, web data and image data
Identify the appropriate techniques for analysing the data drawn from text, web and image
sources
Extract sentiment from social media and strategize based on sentiment analysis
Use open source mining software package
Strategize based on the analysis of unstructured data
Evaluation: