We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2
BIG DATA ANALYTICS
M.Tech II Semester : Computer Science & Engineering
Course code Category Hours/week Credits Maximum Marks 20CO201 PC L T P C CIA SEE TOTAL 4 0 0 4 40 60 100 Contact Classes:60 Tutorial Classes: 0 Practical Classes: 0 Total Classes:60 OBJECTIVES: The course should enable the students to: 1. Discuss the challenges traditional data mining algorithms face when analyzing Big Data. 2. Introduce the tools required to manage and analyze big data like Hadoop, NoSQL, Map, Reduce. 3. Teach the fundamental techniques and principles in achieving big data analytics with scalability and streaming capability using HIVE and PIG. 4. Introduce to the students several types of big data like social media, web graphs and data streams. 5. Enable students to have skills that will help them to solve complex real-world problems in for decision support. UNIT-I INTRODUCTION TO ANALYTICS Classes:12 Introduction to Analytics and R programming: Introduction to R, RStudio (GUI): R Windows Environment, introduction to various data types, Numeric, Character, date, data frame, array, matrix etc., Reading Datasets, Working with different file types .txt,.csv etc. Outliers, Combining Datasets, R Functions and loops. UNIT-II WORKING WITH R PROGRAMMING Classes:12 Manage your work to meet requirements: Understanding Learning objectives, Introduction to work & meeting requirements, Time Management, Work management & prioritization, Quality & Standards Adherence. Summarizing Data & Revisiting Probability (NOS 2101): Summary Statistics - Summarizing data with R, Probability, Expected, Random, Bivariate Random variables, Probability distribution. Central Limit Theorem etc. UNIT-III SQL USING R Classes:12 Work effectively with Colleagues (NOS 9002): Introduction to work effectively, Team Work, Professionalism, Effective Communication skills, etc. SQL using R: Introduction to NoSQL, Connecting R to NoSQL databases. Excel and R integration with R connector. UNIT-IV CORRELATION AND REGRESSION ANALYSIS Classes:12 Correlation and Regression Analysis: Regression Analysis, Assumptions of OLS Regression, Regression Modeling. Correlation, ANOVA, Forecasting, Heteroscedasticity, Autocorrelation, Introduction to Multiple Regression etc. UNIT-V UNDERSTAND THE VERTICALS Classes:12 Understand the Verticals - Engineering, Financial and others: Understanding systems viz. Engineering Design, Manufacturing, Smart Utilities, Production lines, Automotive, Technology etc. Understanding Business problems related to various businesses Text Books: 1. Student’s Handbook for Associate Analytics. 2. Time Series Analysis and Mining with R,Yanchang Zhao. Reference Books: 1. Introduction to Probability and Statistics Using R, ISBN: 978-0-557-24979-4, is a textbook written for an undergraduate course in probability and statistics. 2. An Introduction to R, by Venables and Smith and the R Development Core Team. This may be downloaded for free from the R Project website (http://www.r-project.org/, see Manuals). There are plenty of other free references available from the R Project website. 3. Montgomery, Douglas C., and George C. Runger, Applied statistics and probability for engineers. John Wiley & Sons, 2010 Web References: 1. http://anson.ucdavis.edu/~azari/sta137/AuNotes.pdf 2. https://www.edx.org/course/big-data-analytics-adelaidex-analyticsx 3. https://intellipaat.com/blog/big-data-tutorial-for-beginners/ 4. https://www.analyticsvidhya.com/blog/2015/.../big-data-analytics-youtube-ted-resourc... E-Text Books: 1. https://ndl.iitkgp.ac.in/ Outcomes: At the end of the course students able to 1. Identify the need for big data analytics for a domain. 2. Apply big data analytics for a given problem. 3. Suggest areas to apply big data to increase business outcome. 4. Use Hadoop, Map Reduce Framework handle massive data. 5. Introduce to the students several types of big data like social media, web graphs and data streams. 6. Enable students to have skills that will help them to solve complex real-world problems in for decision support.