2016 Emer Mid
2016 Emer Mid
1. **Flume** is used to ingest (transfer) data from RDBMS (Relational Database Management
System) to HDFS (Hadoop Distributed File System).
2. **Reactive** Artificial Intelligence systems do not store memories or past experiences for future
actions.
3. **Data stewards** hold the responsibility of ensuring the data are trustworthy, discoverable,
accessible, reusable and fit their purpose.
4. **Platform as a Service (PaaS)** is a sort of hardware architecture or software framework
(including application frameworks), that allows users to build intelligent applications.
### **Part IV: Answer the following questions accordingly (total 9 pts)**
1. **Clearly explain the relationship between Data science, Big Data and Hadoop? [2 pts]**
- **Data Science** is a multidisciplinary field that uses scientific methods, algorithms, and systems
to extract knowledge from data.
- **Big Data** refers to large volumes of data, often complex and from multiple sources, which
cannot be processed using traditional data processing tools.
- **Hadoop** is a framework used to store and process Big Data efficiently. Data Science utilizes Big
Data, and Hadoop is one of the main tools used in this process.
2. **What is the difference between Domain-Specific Expertise and Reasoning Machines? [2 pts]**
- **Domain-Specific Expertise** refers to AI systems designed to perform tasks in a specific area,
such as medical diagnosis or legal analysis.
- **Reasoning Machines**, on the other hand, can apply logical thinking and inference rules to
make decisions in a broader range of contexts.
3. **List and define characteristics of Big Data with appropriate example? [2.5 pts]**
- **Volume** – The amount of data is massive (e.g., social media data).
- **Velocity** – The speed at which new data is generated (e.g., real-time stock trading).
- **Variety** – Different types of data (e.g., videos, texts, audio).
- **Veracity** – Uncertainty or trustworthiness of data (e.g., user-generated content).
- **Value** – Useful insights extracted from data (e.g., customer behavior analysis).
---
If you'd like the answers formatted into a Word or PDF document, or need an answer key table, just
let me know!