Data Engineering Pre-Interview Quiz MCQ
Data Engineering Pre-Interview Quiz MCQ
Data Engineering Pre-Interview Quiz MCQ
1. What is the command for displaying list of files in current directory in Linux command line
a. dir
b. files
c. ls
d. lcd
2. What is the command for displaying the content of a file in Linux command line
a. view
b. cat
c. display
d. print
3. What is the command for changing current directory in Linux command line
a. chdir
b. cd
c. changedir
d. ccd
4. Which of the following is not a text editor on Linux command line
a. emacs
b. vim
c. ncat
d. nano
5. What directory path do log files normally are stored in Linux machine
a. /log
b. /tmp/log
c. /var/log
d. /sys/log
6. Which of the following commands can’t be used to setup networking
a. ifconfig
b. ip
c. netcat
d. nmcli
7. Which of the following command can be used to find a certain text inside a file
a. awk
b. find
c. grep
d. lookup
8. Which of the following command can be used to do string search and replace in basic
linux installation
a. sed
b. replace
c. pwd
d. zstd
9. Where do system configuration files normally resides in
a. /sys/config
b. /etc/
c. /conf/
d. /var/
10. Where do user’s files normally resides in
a. /home/<username>/
b. /users/<username>/
c. /data/<username>/
d. /sys/<username>/
15. Which one of the following components is normally not a key component of a Data
Warehouse?
a. Data sources
b. Data entry
c. Data mart
d. ETL
16. Which of the following is commonly the flow of data in a Data Warehouse?
a. Source -> ETL -> Data Mart -> ETL -> Data Warehouse
b. Source -> ETL -> Data Warehouse -> ETL -> Data Mart
c. Source -> ETL -> Data Analysis -> ETL -> Data Warehouse
d. Source -> ETL -> Data Archiving -> ETL -> Data Warehouse
17. If there is a requirement to replicate a table, which have rows that are regularly
modified, from data source into data warehouse in real-time, which of the following tool
can be used to extract the data from the source database ?
a. Batch ETL
b. Change Data Capture
c. Database Backup
d. Message Queue
18. Which of the following data layer will give the best UI performance for front-end Business
Intelligence / Visualization tool to visualize data?
a. Raw data in data warehouse
b. Pre-aggregated data in data mart
c. Star schema in data warehouse
d. Raw data in data source
a. YARN
b. HDFS
c. K8S
d. MR2
20. Which one of the following components provide SQL JDBC access to data residing in
Hadoop?
a. Hive
b. Pig
c. Zookeeper
d. HBase
35. Which of the following method is an invalid form of string formatting in Python 3.7?
a. “Hello {}”.format(name)
b. “Hello ${name}”
c. “Hello %s” % name
d. f“Hello {name}”
36. What would be the output of the following code?: map(lambda x: x % 2, range(1,11))
a. [1,2,3,4,5,6,7,8,9,10]
b. [2,4,6,8,10,12,14,16,18,20]
c. [1,3,5,7,9]
d. [1,0,1,0,1,0,1,0,1,0]
37. What would be the output of the following code?: filter(lambda x: x % 2, range(1,11))
a. [1,2,3,4,5,6,7,8,9,10]
b. [2,4,6,8,10,12,14,16,18,20]
c. [1,3,5,7,9]
d. [1,0,1,0,1,0,1,0,1,0]
38. What would be the output of the following code?: map(lambda x: x * 2, range(1,11))
a. [1,2,3,4,5,6,7,8,9,10]
b. [2,4,6,8,10,12,14,16,18,20]
c. [1,3,5,7,9]
d. [1,0,1,0,1,0,1,0,1,0]
39. Which of the following is an invalid native data type in Python 3.7?
a. list
b. array
c. tuple
d. object
40. Which of the following is an invalid import statement in Python 3.7?
a. from .module import function
b. from ..module import function
c. import function from module
d. from module import function as myfunction