0% found this document useful (0 votes)

29 views4 pages

Question 1: Fill in The Blanks

Uploaded by

Suhani Modi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views4 pages

Question 1: Fill in The Blanks

Uploaded by

Suhani Modi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Answer big data

Certainly! Here’s the text streamlined for direct copy-pasting:

Question 1: Fill in the Blanks

1. Additional Vs related to Big Data Analysis: Variability,

Visualization, Viability

2. The object is the first citizen in the object-oriented paradigm.

3. The objective of Big Data analysis, SPAS, stands for: Scalability,

Performance, Availability, Security

4. The two main components Hadoop ecosystem initially relied on:

HDFS, MapReduce

Question 2: True/False

1. True - Reduce is the process after map tasks are sorted by keys.

2. True - Pattern extraction and decision-making occur in the Intelligent

phase.

3. True - Pig enables complex job creation in Hadoop.

4. True - NoSQL systems are suited for real-time applications and may
support SQL-like languages.

5. False - Network bandwidth varies by node proximity on the same

rack, not between nodes on the same rack.

Question 3: MongoDB Insertion and Command Result

1. Insert command for new record:

db.city.insertOne({ _id: 102, city: "TORONTO", passengers:

800 })

2. Result of db.city.find({ name: { $regex: "(?i)t(?i)oronto" } }):

• It retrieves all documents where city matches “Toronto” case-

insensitively, including Toronto, toronto, and TORONTO.

Question 4: Difference between NFS and HDFS

• NFS (Network File System): Primarily used for local network storage
and lacks distributed fault tolerance.

• HDFS (Hadoop Distributed File System): Designed for distributed data

storage, it includes redundancy for fault tolerance and is optimized for
large data processing across clusters.

Question 5: Java Class Tester.java

1. Pig Command to Register the Jar:

2. Output of Code:

• It outputs True or False based on the value of name, checking if

each value is a prime number.
3. Functionality of Tester.java:

• It checks if an integer is a prime number and returns a boolean.

Question 6: Using HIVE

1. Create tables for documents:

CREATE TABLE doc1 (line STRING);

CREATE TABLE doc2 (line STRING);

2. Load documents into tables:

LOAD DATA INPATH 'hdfs_path/doc1.txt' INTO TABLE doc1;

LOAD DATA INPATH 'hdfs_path/doc2.txt' INTO TABLE doc2;

3. Create a table with common words in doc1 and doc2:

CREATE TABLE common_words AS

SELECT word FROM doc1 INTERSECT SELECT word FROM doc2;

Question 7: HDFS Command Explanation

• The command hadoop fs -setrep 2 -R -w /user/hadoop/ sets the

replication factor of files in /user/hadoop/ to 2, applying recursively.

Question 8: Improving GenAI Models with Big Data

• Big Data analysis enables enhanced model training, data diversity
management, and scalable data pipelines for continuous model
learning and improvement. Big Data helps refine these models
through extensive datasets, improving context understanding and
performance.

Question 9: Apriori Algorithm for Association Rules

1. Steps:

• Pass 1: Identify frequent items meeting the support threshold.

• Pass 2: Combine frequent items from Pass 1 into pairs and test for
frequency.

• Subsequent Passes: Continue increasing item combinations until

no additional frequent itemsets meet the support threshold.

2. Association Rules:

• Rules like {bread, milk} -> butter with confidence of 75% are
derived by calculating the conditional probability of item co-
occurrences based on support.

This text is ready for you to paste directly. Let me know if there’s
anything more you need!

C Language Class Room Notes
No ratings yet
C Language Class Room Notes
196 pages
Big Data Questions
100% (1)
Big Data Questions
39 pages
Hadoop MCQs
75% (8)
Hadoop MCQs
21 pages
Big Data Exam Correction
100% (1)
Big Data Exam Correction
10 pages
Bda MCQ
100% (1)
Bda MCQ
44 pages
Discrete Structure Syllabus
100% (1)
Discrete Structure Syllabus
2 pages
Big Data QCM 1 PDF
100% (1)
Big Data QCM 1 PDF
7 pages
BDC Previous Papers 2 Marks
100% (1)
BDC Previous Papers 2 Marks
7 pages
BDA 6TH SEM Question Bank
No ratings yet
BDA 6TH SEM Question Bank
6 pages
Fractions 5 Go Deeper Investigations
No ratings yet
Fractions 5 Go Deeper Investigations
7 pages
Big Data 22 23 24
No ratings yet
Big Data 22 23 24
10 pages
Big Data (KCS-061)
No ratings yet
Big Data (KCS-061)
46 pages
Big Data MCQ
No ratings yet
Big Data MCQ
47 pages
Deep Video Dehazing: Major Project Part I (18B19CI791) - AY 2023-24
No ratings yet
Deep Video Dehazing: Major Project Part I (18B19CI791) - AY 2023-24
26 pages
AaxHadoop Interview Questions and Answers
No ratings yet
AaxHadoop Interview Questions and Answers
37 pages
Big Data 2020
No ratings yet
Big Data 2020
13 pages
Big Data - Hadoop Questions Answers
No ratings yet
Big Data - Hadoop Questions Answers
18 pages
BigData Questions
No ratings yet
BigData Questions
17 pages
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
No ratings yet
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
3 pages
DS QCM BigData 2021
No ratings yet
DS QCM BigData 2021
6 pages
DSBDA ORAL Question Bank
100% (1)
DSBDA ORAL Question Bank
6 pages
DSA LAB Experiments
No ratings yet
DSA LAB Experiments
6 pages
Cayley Graph
No ratings yet
Cayley Graph
15 pages
Bigdata MCQ QA Part2
No ratings yet
Bigdata MCQ QA Part2
9 pages
Exception Handling: B. L. Patil Polytechnic, Khopoli
No ratings yet
Exception Handling: B. L. Patil Polytechnic, Khopoli
17 pages
Question Bank - Big Data
No ratings yet
Question Bank - Big Data
6 pages
Data Interpretation: Class Exercise-03
No ratings yet
Data Interpretation: Class Exercise-03
5 pages
Big Data Analytics 2023 Solution
No ratings yet
Big Data Analytics 2023 Solution
17 pages
DSBDA Kadak Document
No ratings yet
DSBDA Kadak Document
249 pages
CS Study Material Chandigarh
No ratings yet
CS Study Material Chandigarh
151 pages
Pig
No ratings yet
Pig
24 pages
Eiot Notes
No ratings yet
Eiot Notes
129 pages
Subject Name:: Knowledge Institute of Technology & Engineering-135
No ratings yet
Subject Name:: Knowledge Institute of Technology & Engineering-135
22 pages
OOPs Interview Questions
No ratings yet
OOPs Interview Questions
26 pages
SteganographyImage Project Report
No ratings yet
SteganographyImage Project Report
69 pages
Big Data Visualization
No ratings yet
Big Data Visualization
55 pages
MCQ Da
No ratings yet
MCQ Da
28 pages
Bigdatamcq mcq1
No ratings yet
Bigdatamcq mcq1
21 pages
Bda MCQ
No ratings yet
Bda MCQ
9 pages
Jso 2021 Paper III
No ratings yet
Jso 2021 Paper III
13 pages
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
No ratings yet
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
8 pages
HADOOP
No ratings yet
HADOOP
40 pages
Ite06 Big Data Analytics-Qbank
No ratings yet
Ite06 Big Data Analytics-Qbank
18 pages
Big Data - Explain Structured, Semi Structured and Unstructured Data
No ratings yet
Big Data - Explain Structured, Semi Structured and Unstructured Data
2 pages
Big Data Analytics
No ratings yet
Big Data Analytics
6 pages
Space Invader Game in JavaFX
No ratings yet
Space Invader Game in JavaFX
46 pages
Big Data Bank
No ratings yet
Big Data Bank
24 pages
Class 19 Pushdown Automata
No ratings yet
Class 19 Pushdown Automata
35 pages
Big Data
No ratings yet
Big Data
29 pages
Numerical Analysis: Lecture - 3
No ratings yet
Numerical Analysis: Lecture - 3
17 pages
Mock Exam: Q1. Which of The Following Is Not A Term For Measuring Data Quantity?
No ratings yet
Mock Exam: Q1. Which of The Following Is Not A Term For Measuring Data Quantity?
12 pages
Big Data 2023
No ratings yet
Big Data 2023
18 pages
Unit 1. Introduction To Big Data: False
No ratings yet
Unit 1. Introduction To Big Data: False
7 pages
ITE 1122 - Fundamental Structures of Programming - 2
No ratings yet
ITE 1122 - Fundamental Structures of Programming - 2
15 pages
BD Question Bank MCQ Answered
No ratings yet
BD Question Bank MCQ Answered
8 pages
Methods in Java
No ratings yet
Methods in Java
14 pages
DS BigDATA 2ièmeN2TR UVT 2022 2023
No ratings yet
DS BigDATA 2ièmeN2TR UVT 2022 2023
4 pages
Top 50 Hadoop Interview Questions For 2019
No ratings yet
Top 50 Hadoop Interview Questions For 2019
42 pages
Big Data Technologies - PGDBDA - Feb20
No ratings yet
Big Data Technologies - PGDBDA - Feb20
12 pages
Big Data Multiple Choice Questions
No ratings yet
Big Data Multiple Choice Questions
9 pages
Important Questions and Answers of Big Data Course
No ratings yet
Important Questions and Answers of Big Data Course
4 pages
Basic Big Data Interview Questions
No ratings yet
Basic Big Data Interview Questions
16 pages
r16 Te Sem Viii Choice It Big Data Analytics
No ratings yet
r16 Te Sem Viii Choice It Big Data Analytics
5 pages
2023 Ria - 37.05 - 04
No ratings yet
2023 Ria - 37.05 - 04
11 pages
Big Data BCS061 Complete Question Bank With RealWorld
No ratings yet
Big Data BCS061 Complete Question Bank With RealWorld
5 pages
University of Mumbai Sample MCQ Question Bank Course Code and Name: BDA ITC801 /R16 Class: BE Semester:8 Options A B C D
No ratings yet
University of Mumbai Sample MCQ Question Bank Course Code and Name: BDA ITC801 /R16 Class: BE Semester:8 Options A B C D
6 pages
HACKERRANK
No ratings yet
HACKERRANK
9 pages
BDA IMPORTANT QUESTION (5marks)
No ratings yet
BDA IMPORTANT QUESTION (5marks)
7 pages
16MC822 - Big Data Analytics
No ratings yet
16MC822 - Big Data Analytics
5 pages
Error Handling
No ratings yet
Error Handling
7 pages
SCS 202 24..25
No ratings yet
SCS 202 24..25
3 pages
Big Data Analytics 2M Definitions
No ratings yet
Big Data Analytics 2M Definitions
3 pages
Bigdata
No ratings yet
Bigdata
5 pages
Math Worksheet - 3
No ratings yet
Math Worksheet - 3
4 pages
Untitled Document 2
No ratings yet
Untitled Document 2
4 pages
Java Technologies-I (Core Java)
No ratings yet
Java Technologies-I (Core Java)
5 pages
HFCL PP
No ratings yet
HFCL PP
4 pages
Important Question Bank BD
No ratings yet
Important Question Bank BD
3 pages
Polynomial Addition
No ratings yet
Polynomial Addition
3 pages
5877 - 4 MCS 2 Big Data - 4093 - (19-06-2024 01 - 37 - 31 - 626 PM)
No ratings yet
5877 - 4 MCS 2 Big Data - 4093 - (19-06-2024 01 - 37 - 31 - 626 PM)
3 pages
Bda Bits - Mid I-Qp (2024-25)
No ratings yet
Bda Bits - Mid I-Qp (2024-25)
2 pages
Java Awt DND Dragsource
No ratings yet
Java Awt DND Dragsource
1 page
QB
No ratings yet
QB
4 pages
Bda Viva Questions
No ratings yet
Bda Viva Questions
2 pages
5Th Sem. / Computer Subject: Big Data: What Are The Challenges For Processing Bigdata? (C - 1)
No ratings yet
5Th Sem. / Computer Subject: Big Data: What Are The Challenges For Processing Bigdata? (C - 1)
2 pages
Big Data Analtytics QB
No ratings yet
Big Data Analtytics QB
3 pages
Spring 2024 - CS604 - 2
No ratings yet
Spring 2024 - CS604 - 2
2 pages
CP 213 Test 1 2023
No ratings yet
CP 213 Test 1 2023
2 pages
Subject:-Big Data Computer / IT
No ratings yet
Subject:-Big Data Computer / IT
2 pages

Question 1: Fill in The Blanks

Uploaded by

Question 1: Fill in The Blanks

Uploaded by

Answer big data

Certainly! Here’s the text streamlined for direct copy-pasting:

Question 1: Fill in the Blanks

1. Additional Vs related to Big Data Analysis: Variability,

2. The object is the first citizen in the object-oriented paradigm.

3. The objective of Big Data analysis, SPAS, stands for: Scalability,

4. The two main components Hadoop ecosystem initially relied on:

2. True - Pattern extraction and decision-making occur in the Intelligent

3. True - Pig enables complex job creation in Hadoop.

5. False - Network bandwidth varies by node proximity on the same

Question 3: MongoDB Insertion and Command Result

db.city.insertOne({ _id: 102, city: "TORONTO", passengers:

2. Result of db.city.find({ name: { $regex: "(?i)t(?i)oronto" } }):

• It retrieves all documents where city matches “Toronto” case-

Question 4: Difference between NFS and HDFS

• HDFS (Hadoop Distributed File System): Designed for distributed data

Question 5: Java Class Tester.java

1. Pig Command to Register the Jar:

• It outputs True or False based on the value of name, checking if

• It checks if an integer is a prime number and returns a boolean.

Question 6: Using HIVE

1. Create tables for documents:

CREATE TABLE doc1 (line STRING);

2. Load documents into tables:

LOAD DATA INPATH 'hdfs_path/doc1.txt' INTO TABLE doc1;

3. Create a table with common words in doc1 and doc2:

CREATE TABLE common_words AS

Question 7: HDFS Command Explanation

• The command hadoop fs -setrep 2 -R -w /user/hadoop/ sets the

Question 8: Improving GenAI Models with Big Data

Question 9: Apriori Algorithm for Association Rules

• Pass 1: Identify frequent items meeting the support threshold.

• Subsequent Passes: Continue increasing item combinations until

You might also like