0% found this document useful (0 votes)
7 views3 pages

2016 Emer Mid

The document includes fill-in-the-blank exercises and questions related to data science, big data, and artificial intelligence. Key concepts covered include the roles of Flume and Hadoop, characteristics of big data, and the distinction between domain-specific expertise and reasoning machines. Additionally, it addresses the goals of AI and justifies why computers are considered programmable devices.

Uploaded by

masreshame34
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views3 pages

2016 Emer Mid

The document includes fill-in-the-blank exercises and questions related to data science, big data, and artificial intelligence. Key concepts covered include the roles of Flume and Hadoop, characteristics of big data, and the distinction between domain-specific expertise and reasoning machines. Additionally, it addresses the goals of AI and justifies why computers are considered programmable devices.

Uploaded by

masreshame34
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

### **Part III: Fill in the blank space provided with correct words or phrases [each 1 pt]**

1. **Flume** is used to ingest (transfer) data from RDBMS (Relational Database Management
System) to HDFS (Hadoop Distributed File System).

2. **Reactive** Artificial Intelligence systems do not store memories or past experiences for future
actions.

3. **Data stewards** hold the responsibility of ensuring the data are trustworthy, discoverable,
accessible, reusable and fit their purpose.
4. **Platform as a Service (PaaS)** is a sort of hardware architecture or software framework
(including application frameworks), that allows users to build intelligent applications.
### **Part IV: Answer the following questions accordingly (total 9 pts)**

1. **Clearly explain the relationship between Data science, Big Data and Hadoop? [2 pts]**
- **Data Science** is a multidisciplinary field that uses scientific methods, algorithms, and systems
to extract knowledge from data.
- **Big Data** refers to large volumes of data, often complex and from multiple sources, which
cannot be processed using traditional data processing tools.
- **Hadoop** is a framework used to store and process Big Data efficiently. Data Science utilizes Big
Data, and Hadoop is one of the main tools used in this process.

2. **What is the difference between Domain-Specific Expertise and Reasoning Machines? [2 pts]**
- **Domain-Specific Expertise** refers to AI systems designed to perform tasks in a specific area,
such as medical diagnosis or legal analysis.
- **Reasoning Machines**, on the other hand, can apply logical thinking and inference rules to
make decisions in a broader range of contexts.

3. **List and define characteristics of Big Data with appropriate example? [2.5 pts]**
- **Volume** – The amount of data is massive (e.g., social media data).
- **Velocity** – The speed at which new data is generated (e.g., real-time stock trading).
- **Variety** – Different types of data (e.g., videos, texts, audio).
- **Veracity** – Uncertainty or trustworthiness of data (e.g., user-generated content).
- **Value** – Useful insights extracted from data (e.g., customer behavior analysis).

4. **List at least three goals of Artificial Intelligence? [1.5 pts]**


- Mimic human reasoning and problem-solving.
- Automate repetitive or dangerous tasks.
- Enhance human capabilities and decision-making.

5. **Justify your reason why computer is referred as a programmable device? [1 pt]**


- A computer is called a programmable device because it can be instructed through software or code
to perform a wide range of tasks and solve problems, depending on user needs.
### **Part I: Write true if the statement is correct & false if the statement is incorrect [each 1 pt]**

1. **True** – Artificial superintelligence could potentially outperform humans in all domains.


2. **True** – Industry 3.0 introduced electronic and IT systems.
3. **True** – Narrow AI performs specific tasks.
4. **True** – Flume is used in Big Data pipelines with Hadoop.
5. **False** – Semi-structured data **does include** elements of structured data.
### **Part II: Choose the best answer from the given alternatives [each 1.5 pts]**

1. **C. String** – "Abebe123" is a string data type.


2. **C. Development of industries** – Main feature of Tertiary Industry.
3. **D. Artificial Intelligence is specialization of Machine Learning** – This is **incorrect**, correct
answer:
**C. Machine Learning enables the machine to learn from data**.
4. **A. Conferencing equipment** – This is **not** a network device.
5. **B. NoSQL database** – Suitable for scalable big data management.
6. **C. IBM’s Deep Blue** – This is **odd** as it's outdated and not aligned with current AI levels.
7. **B. Unstructured data** – Text-heavy data is unstructured.
8. **B. Cluster computing** – Framework for distributed processing using simple programming.

---

If you'd like the answers formatted into a Word or PDF document, or need an answer key table, just
let me know!

You might also like