DMS Unit 5
DMS Unit 5
3.System Analyst :
System Analyst is a user who analyzes the requirements of parametric end
users. They check whether all the requirements of end users are satisfied.
4.Sophisticated Users :
Sophisticated users can be engineers, scientists, business analyst, who are
familiar with the database. They can develop their own data base
applications according to their requirement. They don’t write the program
code but they interact the data base by writing SQL queries directly
through the query processor.
5,Data Base Designers :
Data Base Designers are the users who design the
structure of data base which includes tables, indexes,
views, constraints, triggers, stored procedures. He/she
controls what data must be stored and how the data
items to be related.
6.Application Program :
Application Program are the back end programmers who
writes the code for the application programs.They are the
computer professionals. These programs could be written
in Programming languages such as Visual Basic, Developer,
C, FORTRAN, COBOL etc.
Types of privileges
Role:
User Roles. A role groups several privileges and roles, so that they can
be granted to and revoked from users simultaneously. A role must be
enabled for a user before it can be used by the user. Oracle provides
some predefined roles to help in database administration.
Transaction
A transaction is a unit of work that is performed
against a database. Transactions are units or
sequences of work accomplished in a logical order,
whether in a manual fashion by a user or automatically
by some sort of a database program.
A transaction is the propagation of one or more
changes to the database. For example, if you are
creating a record or updating a record or deleting a
record from the table, then you are performing a
transaction on that table. It is important to control
these transactions to ensure the data integrity and to
handle database errors.
Properties of Transactions
Transactions have the following four standard properties, usually
referred to by the acronym ACID.
Atomicity − ensures that all operations within the work unit are
completed successfully. Otherwise, the transaction is aborted at the
point of failure and all the previous operations are rolled back to
their former state.
The operating system saves the database files onto tape or some other media. This
is useful to restire the system to an earlier point whenever needed.
Logical Backup
In logical backup technique, the IMPORT/EXPORT utilities are used to create the
backup of the database. A logical backup backs-up the contents of the database. A
logical backup can be used to restore the database to the last backup. However,
unlike physical back, it should not be used to create an OS back up copy because
restoring using this approach would make it possible to correct the damaged
datafiles. Therefore in these situations physical backups should be preferred.
Database recovery
Time-Variant
Historical information is kept in a data warehouse. For example, one can retrieve files from
3 months, 6 months, 12 months, or even previous data from a data warehouse. These
variations with a transactions system, where often only the most current file is kept.
Non-Volatile
Non-Volatile defines that once entered into the warehouse, and data should not change.
S.N Data Warehouse Data Mart
0.
1. It is a centralised system. It is not a centralised system.
2. Data warehouse defines a top-down model. Data mart defines a bottom-up model.
Difference between Data Warehouse and Data Mart
3. Slightly denormalization is involved in data warehouses. Highly denormalization is involved in data mart.
5. Fact constellation schema is preferred in data warehouses. Star schema and snowflake schema are preferred
in data mart.
8. They mostly have longer life spans. It has a shorter life span.
9. Here we get the data in a detailed format. Here we get the summarised version of data.
10. The data warehouse is huge in size. It is smaller as compared to the data warehouse.
KDD PROCESS
Data mining is a subset of the KDD process. The KDD
process is a comprehensive approach to extracting
useful knowledge and insights from large datasets,
while data mining is a specific task within the KDD
process that involves finding patterns and
relationships in data using computational algorithms
Data Selection - The first step in the KDD process is identifying and selecting the
relevant data for analysis.
Data Preprocessing - After selecting the data, the next step is data preprocessing. This
step involves cleaning the data, removing outliers, and removing missing,
inconsistent, or irrelevant data.
Data Transformation - Once the data is preprocessed, the next step is to transform it
into a format that data mining techniques can analyze.
Data Mining - This is the heart of the KDD process and involves applying various data
mining techniques to the transformed data to discover hidden patterns, trends,
relationships, and insights.
Pattern Evaluation - After the data mining, the next step is to evaluate the discovered
patterns to determine their usefulness and relevance.
Knowledge Representation - This step involves representing the knowledge extracted
from the data in a way humans can easily understand and use. This can be done
through visualizations, reports, or other forms of communication that provide
meaningful insights into the data.
Deployment - The final step in the KDD process is to deploy the knowledge and
insights gained from the data mining process to practical applications.
Difference between RDBMS and MongoDB:
RDBMS MongoDB
It is row-based. It is document-based.
It is slower in comparison with MongoDB. It is almost 100 times faster than RDBMS.
It is column-based. It is field-based.
Limited query language and support Rich query language and flexible
Querying
for secondary indexes indexing
Data Strong consistency model for read Eventual consistency model for read and
Consistency and write operations write operations
Geospatial
No support for geospatial data Supports geospatial data
Data
Supports transactions across multiple
Transactions Supports multi-document transactions
tables
Free community version or paid
Pay-per-request or provisioned
Pricing enterprise version with multiple pricing
capacity
options