0% found this document useful (0 votes)
32 views

Content of Data Analytics

Uploaded by

3131Nair Athira
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views

Content of Data Analytics

Uploaded by

3131Nair Athira
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Unit Unit Description Weighta

s ge
1 Analytics Fundamentals 15%
1.1.
Data Analytics and Data Science: Introduction Characteristics and Need

1.2.
Attribute Measurement Levels: Ordinal, Nominal, Ratio, Interval

1.3.
Data Analytics Life Cycle : Discovery, Data Preparation, Model Planning, Model Building,
Communicate Results, Operationalize
1.4.
Targeted domains for data Analytics applications

2 Data Acquisition and Web Scrapping 20%


2.1. Needs and Process

2.2. Primary and Secondary Data Sources: Repositories and Approaches

2.3. Data Acquisition Techniques: Surveys, Dat a Scraping, Biometric Techniques, Sensing

2.4. Data Scraping Methods: Screen Scraping, Web Scraping, Report Mining

3 Data Transformation 15%


3.1. Needs and Impacts
3.2. Handling Missing Values: Removal, Imputations
3.3. Reshaping data frames and restructuring data
3.4. Feature engineering and extraction techniques
4 Data Representation and Visualization 20%
4.1. Essentials of Data Representation and Visualization
4.2. Data Summarization Approaches: Cross-Tabulation, Frequency, and Distribution
4.3. Data Description Measures: Central Tendency, Variations, Shape
4.5. Techniques for Data Visualization: Description, Outliers Identification, Normalization, Trend
Representation
4.6. Advanced Visualization Concepts: Heatmaps, Scatter Plots, Box Plots
5 Data and Text Processing 15%
5.1. Characteristics of Data and Text
5.2. Text Processing Applications: Summarization, Recommendation Systems
5.3. Introduction to Corpus and Dictionaries: Types and Usage
6 Document Representation 15%
6.1. Vocabulary of terms :Tokenization, Stopwords
6.2. Document Representation
Term Document Matrix, Inverted Index
6.3. Statistical Properties of Terms
6.4. Term Frequency and Weighting
6.5 Overview of Document Similarity

Text Books

1. Python for Data Analysis by Wes McKinney


2. Fundamentals of Data Visualization by Claus O. Wilke
3 An Introduction to Statistical Learning: with Applications in R by Gareth James et al.
References :
1. Introduction to Data Science" by Jeffrey Stanton and Robert De Graa
2. Web Scraping with Python" by Ryan Mitchel
3. Introduction to Information Retrieval by Christopher D. Manning
4. A Guide to Building Dependable Distributed Systems" by Ross Anderson

You might also like