Cs614 Grand Quiz Merge
Cs614 Grand Quiz Merge
2) The ---------------- saw the advent of disk storage, or DASD( direct Access Storage Device) :
• 1960s
• 1970s page 13
• 1950s
• 1990s
3) In context of data warehouse, normally it becomes difficult to extract data from different
sources because these sources are normally.
• Heterogeneous page 140
• Homogeneous
• Centralized
• Baseline
6) Flat files are one of the prevalent structures used in ------------------- data extraction:
• Online
• Offline page 134
• Incremental
• Full
7) Which of the following is NOT one of the advantages of changed data capture (CDC) technique?
• Flat files are not required
• Limited query interface is required for data extraction page 152
• No incremental on-line I/O required for log tape
• Extraction of changed data occurs immediately
12) De-normalization is more like a “controlled crash” with the aim to ------------ without loss of
information:
• Check
• Balance
• Decrease
• Enhance page 49
13) ----------------- is making all efforts to increase effectiveness and efficiency in meeting and
accepted customer expectation:
• Quality assurance
• Quality improvement page 183
• Quality maintenance
• Quality Establishment
14) ------------- is the application of intelligence and experience to get common goals.
• Wisdom page 11
• Education
• Power
• Information
15) In the data transformation, ---------- is the rearrangement and simplification of individual
• Aggregation
• Enrichment page 136
• Splitting joining
• Conversion
20) ------------ provides a combination of “relational databases access” and “cube” data structures
within a single framework:
• HOLAP page 78
• DOLAP
• MOLAP
• ROLAP
21) Data Warehouse provides the best support for analysis while OLAP carries out the -------------
task:
• Mandatory
• Whole
• Analysis page 69
• Prediction
22) ------------------ involves splitting a table by columns so that a group of columns is placed into the
new table and the remaining columns are placed in another new table:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given option
24) If each cell of Relation R contains a single value ( no repeating values) then it is confirmed that :
• Relation R is in 1st Normal Form page 43
• Relation R is in 2nd Normal Form
• Relation R is in 3rd Normal Form
• Relation R is in 3rd Normal Form but not in 2nd Normal Form
27) Which people criticize Dimensional Modeling (DM) as being a data mart oriented approach?
• Those that consider ER models as Data marts
• Those that consider Business processes as Data marts page 110
• Those that consider Data marts as Data warehouse
• Those that consider dimensional model
• Those that consider dimensional modeling as de-normalization approach
31) -------------------- allows download of “cube” structures to a desktop platform without the need
for shared or cube server:
• MPLAP
• ROLAP
• DOLAP page 78
• HOLAP
33) -------------------- is usually deployed when expression can be used to group data together in such
a way that access can be targeted to a small set of partitions:
• Expression elimination
• Expression partitioning page 67
• Expression indexing
• None of the given option
34) Taken jointly, the extract programs or naturally evolving systems formed a spider web, also
known as
• Distributed Systems Architecture
• Legacy System Architecture page 14
• Online System Architecture
• Intranet System Architecture
35) The data has to be checked , cleaned and transformed into a --------------- format to allow easy
and fast access
• Unified page 20
• Predicated
• Qualified
• Proactive
36) Suppose in a system A, the values of “PhoneNo” attribute were stored in “countrycode-phone-
extension” format, however after transformation into data warehouse the separate columns
were used for “countrycode”,”phone” and “extension”. The above scenario is an example of :
• One-to-one scalar transformation
• One-to-many element transformation page 144+conceptual
• Many-to-one element transformation
• Many-to-many element transformation
41) Suppose the size of the attribute “Computerized National Card (CNIC) no. is changed in NADRA
database. This transformation refers to:
• Format revision page 153
• Field splitting
• Field decoding
• Calculation of derived value
42) The divide and conquer cube partitioning approach helps alleviate the ------------ limitations of
MOLAP implementation:
• Flexibility
• Maintainability
• Security
• Scalability page 85
44) ------------- can be used when some columns are rarely accessed rather than other columns or
when the table has wide rows or header or both:
• Horizontal splitting
• Pre-joining
• Vertical splitting page 56
• Derived attributes
46) The online high performance transaction processing was evolved in --------------:
• 1980
• 1975 page 12
• 1977
• 1965
47) Cube is a logical entity containing values of a certain aggregation level at an intersection of a
combination of -------------------- :
• Facts
• Dimension page 88
• Summary tables
• Primary and foreign key
50) The main reason(s )for the increase in cube size may be:
• Increase in the number of dimensions
• Increase in the cardinality of the dimensions
• Increase in the amount of detail data
• All of the given options page 87
51) Suppose the amount of data recorded in an organization is doubled in year. This increase in ------
-----:
• Linear
• Quadratic
• Exponential page 15
• Logarithmic
53) --------------- models the macro relationships among data elements with an overall deterministic
strategy:
• Dimensional model page 102
• Entity relationship model
• Object oriented model
• Structured model
55) ----------------- technique requires a separate column to specify the time and date when the last
modification was occurred:
• Checkmarks
• Timestamps page 150
• Just-in-Time
• Real Time extraction
56) Which of the de-normalization technique squeezes master table into detail?
• Pre-joining page 58
• Horizontal splitting
• Vertical splitting
• Adding redundant column
57) De-normalization can help:
• Minimize joins
• Minimize foreign keys
• Resolve aggregates
• All of the given options page 51
58) The domain of the “gender” field in some database may be (‘F’,’M’) or as (“Female”, “Male”) or
even as (1, 0). This is:
• Primary key problem
• Non primary key problem page 163
• Normalization problem
• All of the given option
63) Relational databases allow you to navigate the data in ------------- that is appropriate using the
primary , foreign key structure with in the data model:
• Only One Direction
• Any Direction page 19
• Two Direction
• None of these
65) De- normalization is the process of selectively transforming normalized relations into un-
normalized physical record specifications , with the aim to:
• Well structure the data
• Well model the data
• Reduce query processing time page 50
• None of the given option
67) Suppose in system A, the possible values of “Gender” attribute were “Male”& “Female”,
however in data warehouse ,the values stored were “M” for male and “F” for female. This above
scenario is an example of :
• One-to-one scalar transformation page 144
• One-to-many element transformation
• Many-to-one element transformation
• Many-to-many element transformation
76) Node of a B-Tree is stored in memory block and traversing a B-Tree involves --------------- page
faults:
• O(n log n)
• O(log n) page 22
• O(n)
• O(n2)
77) As dimensions get less detailed (e.g. , year vs. day) cubes get --------------------
• Smaller page 84
• Larger
• Partitioned
• Merged
78) Which of the following is not a technique of “ Changed Data Capture” in currently used Modren
Source System?
• Timestamps
• Partitioning
• Triggers
• Dimensional Modeling page 150
79) The trade-offs of de-normalization is/are:
• Storage
• Performance
• Ease-of-use
• All of the given options page 62
80) If actual data structure does not conform to documented formats then it is called:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
81) “Header size is reduced, allowing more rows per back , thus reducing I/O” .The above statement
is TRUE with respect to:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given options
82) ----------------- Breaks a table into multiple tables based upon common column values
• Horizontal splitting page 54
• Vertical splitting
• Adding redundant column
• None of the given option
87) The growth of master files and magnetic tapes exploded around the mid- ---------------
• 1950s
• 1960s page 12
• 1970s
• 1980s
88) If one or more records in a relational table do not satisfy one or more integrity constraint , then
the data:
• Is syntactically dirty
• Is semantically dirty page 160
• Has Coverage anomaly
• Has extraction issue
93) In case of multiple sources for the same data element , we need to prioritize the source systems
per element based, the process is called:
• Ranking page 143
• Prioritization
• Element selection
• Measurement selection
95) In ------------------ SQL generation in vastly simplified for front-end tools when the data is highly
structure:
• MOLAP
• Star Schema page 107
• Hybrid schema
• Object oriented schema
97) In Context of Change Data Capture (CDC) sometimes a ------------- object can be used to store
recently modified data:
• Buffer table
• Change table page 149
• Checkmark table
• Change control table
98) “Sometimes during data collection complete entities are missed”. This statement is an example
of :
• Missing tuple page 161
• Missing attribute
• Missing aggregates
• Semantically dirty data
106) For large record spaces and large number of records , the run time of the clustering
algorithms:
• Prohibitive page 164
• Static
• Exponential
• Numerical
107) ------------- can result in costly errors, such as , False frequency distributions and incorrect
aggregates due to double counting:
• Data duplication page 165
• Data reduction
• Data anomaly
• Data transformation
108) The degree to which values are present in the attributes that require them is known as -
---------------------:
• Completeness page 185
• Uniqueness
• Accessibility
• Consistency
109) Time complexity of Key Creation process in basic Sorted Neighborhood (BSN) Method is
----------------------:
• O(n log n)
• O(log n)
• O(n) page 171
• O(2n)
118) Considered the following Employee table and identify the column which causes that the
table is not in first normal form(1NF):
Employee(Emp_ID, Emp_Name ,Emp_skills, Emp_Designation)
• Emp_ID
• Emp_Name
• Emp_skills page 43(conceptual)
• Emp_Designation
120) --------------- segregate data into separate partitions so that queries do not need to
examine all data in a table when WHERE clause filters specify only a subset of the partitions.
• Pre-joining technique
• Collapsing table technique
• Horizontal splitting technique page 56
• Vertical splitting technique
CS614
• MOLP
• ROLAP
• DOLAP
• HOLAP
• Past
• Present
• Future
• History
5) The data in the data warehouse is___________
• Volatile
• Non-volatile
• Static
• Non-structure
• Quantity sold
• Total sale in Rs
• Discount in percentage
• Count of orders in a store
• MOLAP
• Star Scheme
• Hybrid schema
• Object oriented scheme
• Individual Transactions
• Daily aggregrates
• Monthly aggregates
• Normalized attributes
• Buffer table
• Changing table
• Checkmark table
• Change control table
• Missing tuple
• Missing attribute
• Missing aggregation
• Semantically dirty data
• Evolving data
• Keeping Static data
• Tracking past data
• Maintaining historic data
• Storage
• Performance
• Ease-of-use
• All of the given option
• Physical
• Logical
• Dimension
• Multivalued
• Many-to-many
• One-to-many
• One-to-one
• None of the given options
• Format revision
• Field spilitting
• Field decoding
• Calculation of derived value
• Cubes
• Data marts
• Data warehouse
• Aggregation
• DOLAP
• HOLAP
• ROLAP
• MOLAP
• One-to-many relation
• One-to-one relation or mant-to-many relation
• Unified
• Predicted
• Qualified
• Proactive
46). “More resources means proportionally less time for given amount of
data” that statement refers to
• Scale-Up
• Speed-Up
• Size-Up
• Over-Utilized system
47)The optimizer uses a hash join to join two tables if they are joined
using an equijoin and
• outer table has less number of rows
• inner table has less number of rows
• cardinality of table is equal
• large amount of data needs to be joined
48) “If resources increase in proposition to increase in data size, time is
constant”. The statement refers to
• Scale-up
• Speed-up
• Size-up
• Over-utilized system
49) If a product meets formally defined “requirement specifications”, yet
fails to be a quality product form the customer’s perspective, this means
the requirements were _________.
• Defective
• Unclear
• Unrefined
• Undefined
72)___________ is the degree of utility and value the data has to support
the enterprise processes that enable accomplishing enterprise objectives.
• Intrinsic Data Quality
• Realistic Data Quality
• Strong Data Quality
• Weak Data Quality
73) _________ is a system of activities that assures conformance of
product to pre-established requirements.
• Quality assurance
• Quality improvement
• Quality maintenance
• Quality establishment
74) In context of nested-loop join actual number of matching rows
returned as a result of the join would be _________ of the order of tables
• Dependent
• Independent
• Superset
• Subset
75) In context of bitmap index, the length of the bit vector is:
• The possible number of domain values in corresponding field
(column)
• The number of records in the base table
• The possible number of bitmap tables formed for corresponding
field (column)
• None of the given options
76)The _________ operator proves useful in more complex metrics
applicable to the dimensions of timeliness and accessibility.
• Max page
• Min
• Min and Max
• None of given
77) In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’
rows in inner table, time complexity is
• (M log N)
• O (log MN)
• O (MN)
• O (M + N)