0% found this document useful (0 votes)
45 views2 pages

Bda Imp

This document outlines topics for study modules 3 and 5 related to big data and machine learning. Module 3 focuses on NoSQL databases, distributed systems, MongoDB, and Cassandra. Module 5 covers regression analysis, ANOVA, KNN, association rule mining using the Apriori algorithm, text mining, web mining, page ranking, and naive Bayes classification. Key topics include database characteristics, CAP theorem, file formats, graph databases, and configuration commands.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views2 pages

Bda Imp

This document outlines topics for study modules 3 and 5 related to big data and machine learning. Module 3 focuses on NoSQL databases, distributed systems, MongoDB, and Cassandra. Module 5 covers regression analysis, ANOVA, KNN, association rule mining using the Apriori algorithm, text mining, web mining, page ranking, and naive Bayes classification. Key topics include database characteristics, CAP theorem, file formats, graph databases, and configuration commands.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

BDA IMP

Mod – 3
1. NoSQL along with issues, DataStore characteristics and features of NoSQL
transaction
2. Base, CAP and ACID properties
3. Table 3.1 ( from ppt – NoSQL datastore, in TextBook – egs of widely NoSQL
datastore)
4. Characteristics of scheme less model (final exam)
5. Explain any 2 out of KeyValue, Document, Graph, Tabular
6. R and C, ORC and Parquet file format ( introduction, digram, issues and
limitations)
7. Graph DataStore ( explaination, digram, uses and limitations)
8. Characteristics of Big Data NoSQL solution
9. Shared Nothing
10. 4 Distribution Model
11. Ways of Handling Big Data Problem (Final exam)
12. MongoDB and Cassandra (Either one)
13. Mongo DB Commands ( if only commands then write queries, if with
example then write create, show, drop etc)
14. Characteristics and features of Cassandra
15. Consistency level configuration command ( Final exam)
16. Keyspaces ( Final exam)
17. CQL

Mod – 5
1. Regression Analysis
2. ANOVA
3. KNN
4. Frequency Itemset and Association rules
5. Apriori Algorithm and the problem
6. Text Mining ( 5 phases)
7. Web Mining ( along with Web usage mining)
8. Page Ranking ( VIMP)
9. Support Vending Machine and Naive Bayes (final)

You might also like