0% found this document useful (0 votes)
4 views2 pages

Untitled Document - G

The document provides an introduction to Big Data and Data Science, outlining their meanings, challenges, and benefits. It covers the data science process, including goal setting, data preparation, and model building, as well as the features of different types of data. Additionally, it discusses the importance of data cleansing and integration, along with various transformation strategies.

Uploaded by

Sukhwinder Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views2 pages

Untitled Document - G

The document provides an introduction to Big Data and Data Science, outlining their meanings, challenges, and benefits. It covers the data science process, including goal setting, data preparation, and model building, as well as the features of different types of data. Additionally, it discusses the importance of data cleansing and integration, along with various transformation strategies.

Uploaded by

Sukhwinder Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

‭ ATA SCIENCE‬

D
‭USING PYTHON‬
‭SECTION-A‬
I‭ntroduction to Big data and Data Science:-‬‭Meaning of Big data and Data‬
‭Science, Challenges of big data, Relationship between Big Data and Data‬
‭Science, Benefits and uses of data science and big data.‬
‭Facts of data:-‬‭Structured versus Unstructured data,‬‭natural language,‬
‭machine-generated data, graph-based data, audio, image and video data.‬
‭Data Science Process:-‬‭Goal setting, retrieving data, data preparation , data‬
‭cleansing, data integration and transformation, exploratory data analysis, data‬
‭visualization, Model building and performance evaluation, presentation.‬
‭Data Set and its features:-Meaning of the terms:-‬‭observations and‬
‭variables, Discrete and continuous variables, quantitative and qualitative‬
‭variables, dependent and independent variables, variables classified on scale:‬
‭Nominal, Ordinal, Interval and Ratio variables.‬
‭Data Preparation:-‬‭Need for data preparation, Data‬‭cleansing, Methods of data‬
‭cleansing – data entry errors, sanity checks, outlier detection, treatment of‬
‭missing values, discrepancies in data, use of metadata, codes and rules. Data‬

‭SECTION-A‬
‭Integration, Types of data integration. Data Transformation strategies –‬
‭Normalization, Data Discretization and discretization methods.‬

You might also like