0% found this document useful (0 votes)
14 views

DataScience Path Guide

Uploaded by

Ashish Guleria
Copyright
© © All Rights Reserved
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
14 views

DataScience Path Guide

Uploaded by

Ashish Guleria
Copyright
© © All Rights Reserved
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 14
Data Science SrNo. | Module Name Hour 1 | SAL Databases 24 2 Python Programming Basics 32 3 Python Packages 12 4 Data science fundamentals 32 5 | Machine Learning 16 6 Natural Language Processing 12 7 Data visualization and reporting 20 8 AWS data services 20 9 AWS Data Analytics Fundamentals 12 Total 180 Module 1 SQL Databases Module Name Introduction Introduction to SQL and RDBMS. © Database Architecture and instance of SQL server © Types of SQL commands ¢ Installation of MySQL workbench Data retrieval from single table * Select, Select distinct Where Clauses in where statement: AND/OR, IN, Between, Like, IS null, Is not null Aggregate functions: min, max, avg,count,sum Order by, limit, offset Group by, having, group_concate Date-time functions: curdate(),year(), day(), week() Cast and convert function If statement Case statement wee rcccce Data retrieval from multiple tables: «Joins: inner join, left outer join, right outer join, full outer join * Union, Union all e Subqueries Data definition language: * Database design steps: conceptual design, ER model, design schema © Data types in SQL © Constraints in SQL * Create database * Create table © Insert data * Update and Alter command * Truncate, delete , drop command * Index, Unique index Advanced SQL concepts © Common table expression (CTE) Rank functions View Stored procedure Trigger Module 2 Python Introduction Sr. No. Contents ‘An Overview of Python: Whats Python, How Python is different from other languages, Editors for Python Variables, Operators and data types. Variables How to define and use variable Naming conventions Operators: Assignment, Mathematical, Conditional, Logical Data types: Numbers, String, List, tuple, dictionary. Find data types of variable Change data type of variable Control Statements Conditional control If, if-else, if-else ladder, nested if Looping For loop Range function While loop Nested loop Break Continue Else statement Functions Syntax of function definition Formal parameters Global versus local variables Passing parameters and retuming values understanding argument matching basics. Importing functions from different modules Creating customized module and importing it Lists Defining list Accessing elements: indexing of elements List slicing Modifying element Adding element: append, insert Deleting element: Del, remove, pop Modifying list: sort, reverse Merging two lists Tuples: Defining tuple How itis different from list How to access elements inbuilt functions : max,min ete Merging two tuples Dictionary: Concept of dictionary Defining dictionary Accessing elements inbuilt functions keys(), values(), items() ete File handling Files, File Operations Files and Streams Creating a File Reading From File iterating Through Files Writing file, Reading Froma File Serialization Object oriented programming with python Leaming Python Classes Inheritance Operator Overload Class Methods, polymorphism Construstors, destructors 10 Exception Handling and Regular expressions Try, catch, finally Defining regular expressions Compiling regular expressions Using regular expressions Using match operation to extract a value Extracting multiple items Replacing multiple items 1 Working with databases How to Use a Database Working With a Database Using SQL to Query a Database Python and SQLite Creating an SQLite DB Pulling Data from a DB SQLite Database Files 12 Multithreading in python Starting a New thread Threading module Synchronizing threads Multithreaded priority queue 13 API integration with python Whatis API Integration with API using request module, commands for GET, PUT, UPDATE, DELETE, POST requests 14 | Python Packages SSH, Telnet, Request libraries Creating package / module in python Module 3 Python Packages 04 Numpy Introduction to NumPi Create Numpy Arrays Numpy Operations Numpy for Basic Vector Arithmetic Numpy for Basic Matrix Arithmetic. Broadcasting with Numpy Solve Equations with Numpy Numpy for Statistical Operation 02 Panda Read Data Read in CSV Data Using Pandas Read in Excel Data Using Pandas Reading in JSON Data Read in HTML Data Scikitleam - Predictive modeling pipelining Whatis scikitleam? Components of scikitleam Scikit-leam for Data Preprocessing Treating missing values Treating Outliers Feature Engineering Dimensionality Reduction Matplottib Load Data Line Chart Multi Line Plot Fill Plot Bar Chart Pie Chart Grid Save Image Module 4: Data Science Fundamentals Topics Data Pre-processing Descriptive Data Summarization, Data Cleaning, . Data Integration -> ETL Tools like SSIS, Talend etc. and Transformation, Data Reduction, Data Discretization Regression Differentiating algorithmic and model based frameworks, Regression Ordinary Least Squares, Ridge Regression, Lasso Regression, Logistic Regression K Nearest Neighbours, Classification Basic issues regarding classification and prediction, Decision Tree, Bayesian classification, K Nearest Neighbour Algorithm, Associative classification, Statistical-Based Algorithms, Rule-Based Classification Clustering Similarity and Distance Measures, Hierarchical Algorithms, Parlitioned Algorithms, Clustering with Categorical Attributes Descriptive and Inferential Statistics Descriptive Statistics, Inferential Statistics through hypothesis tests Module 5 : Machine Learning Topics Machine Learning Foundations: Design of a Leaming system - Types of machine learning, Applications of machine learning. Supervised Leaming Regression algorithms, Classification algorithms Unsupervised Leaming: Clustering algorithms Dimensionality Reduction Introduction, Feature Selection and Feature Extraction, Principal Component Analysis, Decision Tree algorithms, Ensemble methods Module 6: Data Visualization & Reporting Data Visualization tools and techniques Tableau: Data type, file type, tool type, show me menu, Type of data source supported by, how to connect different data sources, edit metadata, filter fields, filter data source, type of chars, filter data, data joining, data blending, extract data, adding filter data, apply filter on chart and data, number functions, string functions. Power BI: Components of Power BI, designing tables and reports, preparing dashboards Module 7: Natural Language Processing Topics IntroductiontoNLP NLP with Probabilistic models Autocorrect,Part of Speech Tagging and Hidden Markov Models Autocomplete and Language Models, Word embeddings with neural networks NLP with sequence model Neural Networks for Sentiment Analysis,Recurrent Neural Networks for Language Modeling,LSTMs and Named Entity Recognition Siamese Networks NLP with attention model Neural Machine Translation, Text Summarization Question Answering,Chat bot NLP with classification and vector spaces Sentiment Analysis with Logistic Regression, Sentiment Analysis with Naive Bayes Vector Space Models Machine Translation and Document Search Module 8:AWS Database service Topics, ‘AWS Well-Architected Framework + AWS Well-Architected Framework overview «AWS Well-Architected Framework and the six pillars + AWS Well-Architected Tool Date Types + Understanding your data AWS Database Services ‘* Database services offered by AWS Amazon Relational Databases Relational database introduction + Amazon Relational Database Service (Amazon RDS) * Amazon Aurora AWS Nonrelational Databases Nonrelational database introduction Amazon DynamoDB ‘Amazon Keyspaces (for Apache Cassandra) Amazon DocumentDB Amazon MemoryDB for Redis Amazon Timestream Amazon Quantum Ledger Database (Amazon QLDB) Amazon ElastiCache * Amazon Neptune Data Access and Analysis «Data access and analysis introduction * Amazon Redshift «Amazon Athena g the Right Database Choosing the right database activity AWS Migration Tools «Database migration Database Architecture * Architecture overview « Server-based architecture + Serverless architecture Module 9:AWS Data Analytics Fundamentals Topics Introduction to data analysis solutions - Data analytics and data analysis concepts - Introduction to the challenges of data analytics Volume - data storage - Introduction to Amazon $3 - Introduction to data lakes - Introduction to data storage methods Velocity — data processing - Introduction to data processing methods - Introduction to batch data processing - Introduction to stream data processing Variety — data structure and types - Introduction to source data storage - Introduction to structured data stores - Introduction to semistructured and unstructured data stores +Veracity - cleansing and transformation - Understanding data integrity - Understanding database consistency - Introduction to the ETL process Value - reporting and business intelligence - Introduction to analyzing data - Introduction to visualizing data

You might also like