0% found this document useful (0 votes)
22 views6 pages

BDA Module

The document outlines key concepts and questions related to Big Data Analytics, Data Science, and their applications in various industries. It covers definitions, advantages, challenges, tools, and processes involved in analyzing large datasets. Additionally, it discusses the importance of data visualization and real-time processing in decision-making and operational efficiency.

Uploaded by

22btrcn117
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views6 pages

BDA Module

The document outlines key concepts and questions related to Big Data Analytics, Data Science, and their applications in various industries. It covers definitions, advantages, challenges, tools, and processes involved in analyzing large datasets. Additionally, it discusses the importance of data visualization and real-time processing in decision-making and operational efficiency.

Uploaded by

22btrcn117
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

BDA Module-1

2-MARK QUESTIONS (Short Answer)

1. Define Big Data Analytics.


2. List any two advantages of Big Data Analytics in business.
3. What are the 4 Vs of Big Data?
4. Mention any two challenges in Big Data Analytics.
5. What is the difference between structured and unstructured data?
6. Define Data Science.
7. What are the primary applications of Data Science?
8. List two commonly used Big Data Analytics tools.
9. What is the role of Hadoop in Big Data Analytics?
10. Define Machine Learning in the context of Data Science.
11. What is Apache Spark, and why is it used in Big Data?
12. Mention any two differences between Traditional Data Processing and Big Data
Analytics.
13. What is meant by Data-Driven Decision Making (DDDM)?
14. What is ETL (Extract, Transform, Load) in Data Analytics?
15. Name any two programming languages used in Data Science.

5-MARK QUESTIONS (Short Explanatory)

1. Explain the importance of Big Data Analytics in business and industry.


2. Describe the four Vs of Big Data with examples.
3. Compare Traditional Data Processing and Big Data Analytics.
4. Explain the key opportunities of Big Data Analytics in various industries.
5. Discuss any three challenges faced in Big Data Analytics.
6. Explain the role of Data Science in modern businesses.
7. What are the different phases in the Data Science Process?
8. Explain the concept of Real-time Big Data Processing.
9. Differentiate between Batch Processing and Stream Processing in Big Data.
10. How does Hadoop work in handling Big Data?
11. Discuss any three Big Data Analytics tools and their functionalities.
12. Explain how predictive analytics is used in businesses.
13. What is data visualization, and why is it important in Big Data Analytics?
14. Describe the role of Artificial Intelligence (AI) in Data Science.
15. Explain how Big Data Analytics helps in customer relationship management (CRM).

10-MARK QUESTIONS (Detailed Answer)

1. Explain in detail the importance of Big Data Analytics in business and industry with
examples.
2. Discuss the challenges and opportunities of Big Data Analytics in different sectors.
3. Compare Traditional Data Processing and Big Data Analytics in terms of speed,
scalability, and efficiency.
4. Explain the four Vs of Big Data and their impact on data processing and analysis.
5. Discuss the technologies and tools used in Big Data Analytics with real-world
applications.
6. Explain the Data Science Process with its key stages and workflow.
7. How do Data Science and Big Data Analytics contribute to decision-making in
organizations?
8. Describe the role of Hadoop, Spark, and NoSQL Databases in Big Data Analytics.
9. Explain the relationship between Machine Learning and Data Science with examples.
10. Discuss how Big Data Analytics is revolutionizing the healthcare and finance
industries.

Key Opportunities of Big Data Analytics in Various Industries

11. Healthcare:
1. Helps in predicting diseases and personalizing treatments.
2. Tracks patient health through wearable devices.
12. Retail:
1. Understands customer preferences for better product recommendations.
2. Improves inventory management and pricing strategies.
13. Banking and Finance:
1. Detects fraud and manages risks.
2. Offers personalized financial services to customers.
14. Manufacturing:
1. Enhances production efficiency and reduces downtime.
2. Predicts equipment failures through data monitoring.
15. Transportation:
1. Optimizes routes and reduces fuel costs.
2. Improves traffic management and safety.
16. Education:
1. Analyzes student performance for personalized learning.
2. Helps institutions improve teaching methods.
17. Entertainment and Media:
1. Recommends content based on user preferences.
2. Analyzes audience behavior for better marketing.
18. Agriculture:
1. Monitors crop health and weather patterns.
2. Improves yield through data-driven decisions

Three Challenges Faced in Big Data Analytics (5 Marks)

1. Data Quality:
o Data may be incomplete, duplicate, or incorrect.
o Poor quality data leads to wrong analysis and decisions.
o Cleaning and organizing data takes time and effort.
2. Data Security and Privacy:
o Protecting sensitive data from hackers is difficult.
o Companies must follow privacy laws to keep data safe.
o Data breaches can harm a company’s reputation.
3. Storage and Management:
o Storing large amounts of data is expensive.
o Managing data from different sources is challenging.
o Ensuring easy access to stored data is difficult.
Different Phases in the Data Science Process (5 Marks)

4. Problem Definition:
o Understand the business problem and set clear goals.
o Know what the company wants to achieve with data.
5. Data Collection:
o Gather data from various sources (websites, databases, sensors).
o Collect both structured and unstructured data.
6. Data Cleaning and Preparation:
o Remove errors, duplicates, and missing values from data.
o Convert data into a usable format for analysis.
7. Data Analysis and Modeling:
o Analyze data to find patterns and insights.
o Build models using machine learning and statistical methods.
8. Model Evaluation:
o Test the model’s accuracy and performance.
o Make sure the model works well with new data.
9. Deployment:
o Implement the model in real-world business operations.
o Use it to make data-driven decisions.
10. Monitoring and Maintenance:
o Check the model regularly to keep it accurate.
o Update it when data or business needs change.

Concept of Real-time Big Data Processing (5 Marks)

 Real-time Big Data Processing means analyzing data instantly as it is generated.


 It helps businesses make quick decisions without delays.
 Data is processed in seconds or milliseconds to provide immediate insights.
 It is useful in situations where fast actions are needed, like fraud detection or traffic
updates.
 Tools like Apache Spark, Apache Storm, and Kafka are used for real-time
processing.

Examples:

 Banking: Detects fraud as soon as a suspicious transaction occurs.


 E-commerce: Shows product recommendations while customers browse.
 Transportation: Provides live traffic updates for route planning.
Hadoop is an open-source framework that helps store and process large amounts of data
across many computers. It works through two main components:

1. HDFS (Hadoop Distributed File System):


o Stores big data by breaking it into smaller chunks (blocks).
o These chunks are distributed across different computers (nodes).
o Data is replicated (copied) to avoid loss if a computer fails.

2. MapReduce:
o Processes data in two steps:
 Map: Breaks down tasks into smaller ones and processes data in
parallel.
 Reduce: Combines the results to get the final output.
o This method ensures fast and efficient processing.

Three Big Data Analytics Tools and Their Functionalities:

1. Hadoop:
o Function: Stores and processes large datasets across multiple computers.
o Key Features:
 HDFS: Breaks data into blocks and stores them across nodes with
copies for safety.
 MapReduce: Processes data in parallel for faster results.
 Scalable: Can easily add more computers to handle more data.
o Use Case: Analyzing social media data or website logs.
2. Spark:
o Function: Processes data faster than Hadoop, supporting real-time analytics.
o Key Features:
 In-memory processing: Data is processed in RAM, making it faster.
 Supports multiple languages: Works with Python, Java, and Scala.
 Real-time streaming: Handles live data streams like stock prices or
tweets.
o Use Case: Fraud detection and real-time customer analytics.
3. Tableau:
o Function: Visualizes data through charts, graphs, and dashboards.
o Key Features:
 Drag-and-drop interface: Easy for users to create visual reports.
 Connects to multiple data sources: Works with Excel, databases, and
cloud data.
 Interactive dashboards: Users can explore and filter data easily.
o Use Case: Business sales reports and marketing analysis.
 Customer Purchases:

 Predicts what customers will buy next.


 Example: Online stores suggest products based on past shopping.

 Risk Detection:

 Finds possible risks like fraud or missed payments.


 Example: Banks check if a loan might not be paid back.

 Stock Management:

 Predicts how many products will be needed.


 Example: Stores order the right amount of items to avoid waste.

 Better Marketing:

 Shows which people might like a sale or ad.


 Example: Companies send discounts to interested customers.

 Machine Repair Alerts:

 Tells when a machine might break.


 Example: Factories fix machines early to avoid problems
What is Data Visualization?

 Data Visualization is the process of showing data in graphs, charts, maps, or


images.
 It helps people see patterns and trends easily.

Why is Data Visualization Important in Big Data Analytics?

1. Easier to Understand Data:


o Complex data becomes simple and clear.
o Example: A sales chart shows which months had higher sales.
2. Quick Decision-Making:
o Visuals help spot problems or opportunities fast.
o Example: Companies can see sales drops and act quickly.
3. Better Communication:
o Makes it easier to explain data to others.
o Example: Managers use charts to present results in meetings.
4. Identifies Trends and Patterns:
o Helps find hidden trends in large data sets.
o Example: Social media graphs show popular topics.
5. Improves Engagement:
o People pay more attention to visuals than plain text.
o Example: Infographics are more interesting than long reports.

You might also like