TECH LAUNCH PROGRAM
DATA SCIENCE
Program Overview
✓ This intensive 90-day course provides a comprehensive journey through Data Science, incorporating Deep Learning,
SQL Databases, and Business Intelligence tools.
✓ The curriculum is divided into three phases, each building upon the previous knowledge.
Major Course Outcomes
Technical Skills Development:
✓ Proficiency in Python programming with focus on data science libraries
✓ Strong SQL database management capabilities
✓ Advanced data analysis using NumPy, Pandas, and statistical tools
✓ Deep learning and machine learning model development
✓ Expertise in BI tools (Tableau, Power BI)
Major Course Outcomes
Data Processing & Analysis
✓ Ability to clean, preprocess, and transform complex datasets
✓ Big data handling using tools like Spark
✓ Time series analysis and forecasting capabilities
Visualization & Reporting
✓ Creation of interactive dashboards
✓ Advanced data visualization techniques
✓ Enterprise-level BI implementation
✓ Custom visualization development
Major Course Outcomes
Machine Learning & AI
✓ Implementation of supervised and unsupervised learning algorithms
✓ Neural network architecture design and implementation
✓ CNN and RNN model development
✓ Transfer learning applications
Enterprise Integration
✓ Database optimization and performance tuning
✓ Security implementation and governance
✓ Server deployment and maintenance
✓ Enterprise-level system integration
Levels of Program
Phase 1 – Basic Level
– Python Basic For Data Science
– SQL Foundations
– Data Analysis using Python
– Business Intelligence Foundations
Phase 2 - Intermediate
– Advanced Data Analysis
– Advanced Business Intelligence
– Machine Learning Foundations
– Deep Learning Basics
Phase 1 – Basic Level
Python Basic For Data Science
Basic Operations & Functions
Installation and Setup
✓ Operators: Arithmetic, Comparison, Logical
✓ Install Python 3.x ✓ Input/Output: print(), input(), Formatted Output
✓ IDEs: VS Code / PyCharm Data Structures
✓ Using Jupyter Notebook ✓ Lists: Creation, Methods (append, extend), Slicing, Comprehensions
✓ Virtual environments (venv) ✓ Tuples: Immutable Lists, Named Tuples
✓ Dictionaries: Key-Value Operations, Methods, Comprehensions
✓ Package management with pip
✓ Sets: Unique Elements, Union/Intersection
Variables & Data Types
✓ Arrays: NumPy Basics
✓ Data Types: Int, Float, String, Boolean
Control Structures
✓ Type Conversion & Checking ✓ Conditionals: if, elif, else
✓ Variable Naming & Comments ✓ Loops: for, while, break, continue, Nested Loops
✓ String Operations: Formatting (f-strings), Methods (split, join), Slicing
✓ Numeric Operations: Arithmetic, Division (/, //), Modulo
Phase 1 – Basic Level
Python Basic For Data Science
Functions
✓ Definitions & Calls, Default/Keyword Arguments *args,
**kwargs
✓ Lambda Functions
✓ Scope & Lifetime of Variables Error Handling
Modules ✓ Try-Except, Exception Types
✓ Importing & Creating Modules ✓ Raising & Custom Exceptions
✓ Package Structure, Popular Modules ✓ Debugging Techniques
✓ Virtual Environments & pip Practice & Mini-Project
Object-Oriented Programming (OOP) ✓ Algorithms, Data Structures, OOP, Error Handling
✓ Classes & Objects: Definitions, Methods (Instance, Class, ✓ Build a Mini-Project combining concepts
Static)
✓ Inheritance: Single/Multiple, Overriding
✓ Encapsulation: Private/Protected Attributes, Property Decorators
✓ Polymorphism: Overloading, Duck Typing
Phase 1 – Basic Level
SQL Foundations
Database Concepts
✓ DBMS Overview: Relational Databases, Tables, Rows,
Columns
JOIN Operations
✓ Keys: Primary & Foreign Keys, Basics of Normalization
✓ Types: INNER, LEFT, RIGHT, FULL OUTER, CROSS,
SQL Installation Self JOIN
✓ Install MySQL/PostgreSQL ✓ Handling Multiple Tables
✓ GUI Tools: MySQL Workbench, pgAdmin ✓ Aggregation Functions
✓ Create and Set Up Sample Databases ✓ COUNT(), SUM(), AVG(), MAX(), MIN()
Basic Queries ✓ String Aggregations, Custom Aggregations
✓ SELECT Statements: Column Selection, Aliases GROUP BY & HAVING
✓ WHERE Clause: Comparison, Logical Operators, ✓ Single/Multiple Column Grouping
BETWEEN, IN, LIKE
✓ Aggregate Filtering
✓ ORDER BY: Ascending/Descending, Multiple Columns,
NULL Handling ✓ Complex Grouping Scenarios
Phase 1 – Basic Level
SQL Foundations
Subqueries
✓ Single & Multiple Row Subqueries
✓ Correlated Subqueries, EXISTS Operator
✓ CTEs (Common Table Expressions)
Window Functions
✓ Ranking: ROW_NUMBER(), RANK(), DENSE_RANK()
✓ LAG(), LEAD(), Partitioning, Moving Averages
Views & Stored Procedures
✓ Views & Materialized Views
✓ Stored Procedures & Functions
✓ Triggers Basics
Phase 1 – Basic Level
Data Analysis Tools
NumPy Array Operations Data Visualization
✓ Array Creation, Indexing & Slicing ✓ Matplotlib:
✓ Array Operations & Broadcasting ✓ Basic Plots: Line, Scatter, Bar, Histogram
✓ Plot Customization, Subplots, Saving
✓ Universal Functions
✓ Seaborn:
Pandas DataFrame
✓ Statistical Plots, Heatmaps
✓ Creation: From Files, Lists, Dicts
✓ Categorical & Regression Plots
✓ Operations: Column, Row, Indexing (loc, iloc) ✓ Style and Color
✓ Boolean Indexing & Selection Basic Charts
Data Cleaning ✓ Line, Bar, Pie, Box, Violin
✓ Missing Values Handling ✓ Correlation Matrices
✓ Removing Duplicates
✓ Data Type Conversion
✓ String & Date/Time Operations
✓ Data Validation
Phase 1 – Basic Level
Data Analysis Tools
Descriptive Statistics
✓ Central Tendency: Mean, Median, Mode
✓ Dispersion: Variance, Standard Deviation
✓ Distribution Shapes, Percentiles, Quartiles
Probability Basics
✓ Key Concepts, Random Variables
✓ Common Distributions
✓ Sampling Techniques
Statistical Testing
Hypothesis Testing:
✓ Null vs. Alternative
✓ Tests: t-tests, Chi-square
✓ Interpreting p-values
✓ Confidence Intervals
Phase 1 – Basic Level
Business Intelligence Foundations
✓ Power BI Desktop
✓ Interface Overview: Data, Model, and Report Views
Data Modeling
✓ Star Schema Design
✓ Relationships, Calculated Columns, Measures
DAX Basics
✓ Syntax and Common Functions
✓ Time Intelligence and Filter Context
Report Creation
✓ Visualizations: Charts, Tables, and Cards
✓ Filters, Interactions, and Formatting
✓ Publishing Reports
Phase 1 – Basic Level
Phase 1 Project & Assessments
Phase 1 Project
Database Design:
✓ Requirements Analysis, Schema Design, Table Assessment Criteria
Creation
✓ Code Quality, Problem-Solving, Documentation
✓ Data Population
✓ Presentation, Peer Review Feedback
Data Analysis:
Practical Assignments
✓ Exploratory & Statistical Analysis
✓ Daily Coding Exercises & Weekly Mini-Projects
✓ Data Preparation and Insights Generation
✓ SQL Query Challenges & Visualization Tasks
Dashboard Creation:
✓ Final Phase Project
✓ Planning, Visual Selection, Interactivity, Presentation
Learning Resources
✓ Documentation, Practice Datasets, Tutorials, Video
Courses, Forums
Phase 2 – Intermediate Level
Advanced Data Analysis
Complex Data Manipulation:
✓MultiIndex, Pivot Tables, Stack/Unstack
✓Advanced Grouping: Custom Aggregations, Rolling/Expanding Windows
✓Memory Optimization: Chunking, Optimized DataTypes
Time Series Analysis:
✓Basics: DateTime Indexing, Resampling, Rolling Stats
✓Seasonal Decomposition: Trend Analysis, Moving Averages
✓Time Zone Handling: Conversions, Localization
Data Preprocessing
✓ Cleaning: Outlier Handling, Feature Engineering
✓ Text Processing: String Operations, Regular Expressions
✓ Encoding: One-hot, Label, Target Encoding
✓ Scaling: StandardScaler, MinMaxScaler, RobustScaler
Phase 2 – Intermediate Level
Advanced Data Analysis
Database Optimization
✓ Query Optimization: Execution Plans, Query Profiling
✓ Design: Normalization, Denormalization, Partitioning
✓ Memory: Buffer Pool, Query Cache
Indexing
✓ Types: B-Tree, Hash, Full-Text
✓ Strategies: Composite, Covering, Index Hints
✓ Maintenance: Fragmentation, Rebuilding
Performance Tuning
✓ Query: JOIN/Subquery Optimization, WHERE Clause Tuning
✓ Server Config: Resource Allocation, Caching Strategies
✓ Monitoring: Performance Metrics, Slow Query Log
Phase 2 – Intermediate Level
Advanced Business Intelligence
Complex DAX
✓ Advanced Functions: Time Intelligence, Filter, Iterator Functions
✓ Context Manipulation: CALCULATE, CALCULATETABLE, Context Transition
✓ Calculations: Running Totals, YTD, Previous Period Comparison
Custom Visuals
✓ Development: Custom Visual Types, Interactivity, Formatting
✓ Python Integration: Script Visuals, Custom Visual Integration
✓ Advanced Visualizations: Matrix, Decomposition Tree, AI Visuals
Row-Level Security (RLS)
✓ Models: Static RLS, Dynamic RLS, Role Hierarchy
✓ Implementation: DAX Filters, Security Roles, Testing
Deployment
✓ Workspace Management, App Deployment, Gateway Configuration
Administration
✓ Capacity Management, User Management, Usage Monitoring
Sharing & Collaboration
✓ Report & Dashboard Sharing, App Distribution
Phase 2 – Intermediate Level
Machine Learning Foundations
Supervised Learning Dimensionality Reduction & PCA:
Linear Regression: ✓ PCA (Variance Explained, Component Selection)
✓ Simple & Multiple Regression (Feature Selection, Multicollinearity) ✓ Feature Selection (Filter, Wrapper, Embedded Methods)
✓ Regularization: Ridge, Lasso, Elastic Net Model Evaluation
Logistic Regression: Cross-Validation
✓ Binary & Multiclass Classification (One-vs-Rest, ROC Curve) ✓ Techniques: K-fold, Stratified K-fold, Leave-One-Out
✓ Model Evaluation: Precision-Recall, Confusion Matrix ✓ Time Series CV: Forward Chaining, Rolling Forecast
Decision Trees & Random Forests: ✓ Implementation: Validation Curves, Model Selection
✓ Tree Construction, Pruning, Splitting Criteria Metrics & Hyperparameter Tuning
✓ Random Forests: Ensemble Methods, Feature Importance Metrics:
Unsupervised Learning ✓ Classification: Accuracy, Precision, Recall, AUC-ROC
Clustering: ✓ Regression: MSE, RMSE, R-squared
✓ K-means (Elbow Method, Silhouette Analysis) Hyperparameter Tuning:
✓ Hierarchical (Agglomerative, Dendrograms), DBSCAN ✓ Methods: Grid Search, Random Search, Bayesian Optimization
Phase 2 – Intermediate Level
Deep Learning Basics
Deep Learning Frameworks
Neural Network Architecture
TensorFlow/Keras
✓ Components: Neurons, Layers, Weights, Biases
✓ TensorFlow Basics: Tensors, Variables, Operations
✓ Network Types: Feedforward, Convolutional, Recurrent
✓ Keras API: Sequential, Functional, Model Subclassing
Forward & Backward Propagation
✓ Model Building: Layers, Loss Functions, Optimizers
✓ Forward Propagation: Input Processing, Layer
Computation, Output Generation PyTorch Basics
✓ Backward Propagation: Chain Rule, Gradient Descent, ✓ Tensors, Autograd, Neural Network Modules
Learning Rate ✓ Model Development: DataLoader, Training Loops,
Activation Functions Evaluation
✓ Common: ReLU, Sigmoid, Tanh Simple Neural Networks
✓ Advanced: Leaky ReLU, ELU, SELU ✓ Implementation: Binary & Multiclass Classification,
Regression
✓ Training: Batch Processing, Epochs, Early Stopping
Phase 2 – Intermediate Level
Phase 2 Project Overview
ML Model Development Learning Resources
✓ Project Planning: Problem Definition, Data Collection, ✓ Technical Documentation, Research Papers, Online Courses
Model Selection
✓ Industry Case Studies, GitHub Repositories
✓ Implementation: Data Preprocessing, Model Training,
Evaluation Assessment Methods
BI Dashboard Integration ✓ Coding Assignments, Project Milestones, Peer Reviews
✓ Data Pipeline: ETL, Feature Engineering, Model ✓ Technical Presentations, Model Performance Metrics
Predictions Practical Applications
✓ Visualization: Interactive Dashboards, Real-Time Updates, ✓ Real-World Datasets, Industry Projects
Performance Metrics
✓ Kaggle Competitions, Portfolio Development
Database Integration
✓ Data Storage: Model Artifacts, Predictions, Metrics
✓ API Development: REST Endpoints, Real-Time Scoring,
Monitoring
Learning Resources & Tools Assessment Methods
✓ Enterprise tools documentation ✓ Code reviews
✓ Cloud platform documentation ✓ Architecture reviews
✓ Industry best practices
✓ Performance metrics
✓ Research papers
✓ Documentation quality
✓ Case studies
✓ Presentation skills
✓ Online courses
✓ Project completion
20
Best Practices
✓ Code quality standards
✓ Security guidelines
✓ Performance optimization
✓ Documentation standards
✓ Testing procedures
✓ Deployment protocols
21
Thank you