0% found this document useful (0 votes)
56 views15 pages

Data Basics

Chapter 1 discusses the fundamentals of data and information, defining data as a collection of facts that can be processed into meaningful information. It outlines data processing activities such as capturing, manipulating, and managing output results, as well as the differences between data and information. Additionally, it explains file types, file organization methods, and the components and objectives of database systems.

Uploaded by

Hafsa Himayat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
56 views15 pages

Data Basics

Chapter 1 discusses the fundamentals of data and information, defining data as a collection of facts that can be processed into meaningful information. It outlines data processing activities such as capturing, manipulating, and managing output results, as well as the differences between data and information. Additionally, it explains file types, file organization methods, and the components and objectives of database systems.

Uploaded by

Hafsa Himayat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 15
Chapter 1 DATA BASICS Q. Define data and information. Explain with examples. Data, Data is a collection of facts and figures related to an object. An object can be a person, event or anything about which data is collected. Data may consist of text, numbers, images, sound and videos. Data can be processed to produce useful information. Data is very important for an organization. It can be used by the managers to perform effective and successful operations of management. It provides a view of past activities related to the rise and fall of an organization. It also enables the organization to make better decision for future activities. Example The data of a student may consist of Roll Number, Student Name, Father Name and Marks of different subjects. The purpose of collecting this data is to maintain the record of the student during the study period. Information The processed data is called information. Information is meaningful, useful and organized. It is used for making decisions, Data is used as input for processing and information is the output of this processing. Example ‘The data of the students can be processed to produce useful information such as total marks and grade of the student. It can also be processed to find the number of passed and failed students etc. Q. Define data processing. What activities are involved, in data processing? Discuss in detail. ‘The process of manipulating data to achieve the required objectives and results is called data processing, It is also known as operations on data. The software is used to process data and convert it into meaningful information. A series of actions or operations are performed on data to get the required output or result. Activities in Data Processing Different activities involved in data processing are as follows: © Data capturing * Data manipulation © Managing output results 2 fl IT Series => Computer Science (MS Access and C) 1. Data Capturing The process of recording the data in some form is called data’ capturing. Data is captured before it can be processed, Data may be recorded on source documents, Data can also be given directly to the computer through input devices. 2. Data Manipulation ‘The process of applying different operations on data is called data manipulation. The following operations can be performed on data: Classifying A process of organizing data into classes or groups is called classifying, For example, the data in a college can be classified in two groups. The data of students may be in one group and data of teachers may be in second group. Calculation A process of applying arithmetic operations on data is called calculation. The common calculations are addition, subtraction, multiplication and division etc. For example, the total marks of student are calculated to find the grade. Sorting The process ‘of arranging data in a logical sequence is called sorting. The data can be' sorted’ numerically or alphabetically. For example, the names of the students can be sorted according to the obtained marks. Summarizing The process of reducing a large amount of data in a more-concise and usable form is called summarizing. For example, the data of the students in a class can be summarized to show the number of passed and failed students. + 3. Managing Output Result The following activities can bé performed on data after the data has been captured and manipulated: Storage The process of retaining data for future use is called data sto-age. Different storage medium are used to store the data such as hard disks and tapes etc. For example, the student data is stored on the hard disk. Retrieval The process of accéssing or fetching the stored data’ is called data retrieval. The data can be retrieved as and when required. The retrieved data can be displayed in different forms such as reports, graphs and charts etc. For example, the student data can be retrieved from the hard disk anytime to prepare the result card, Communication ‘The process of transferring data from one location to another is called data communication. The data may be transferred to different locations for further processing. For example, the result can be sent to the students. via email. Reproduction The process of copying or duplicating data is called reproduction. Data can be reproduced if different users need data at different locations. Chapter 1 => Data Basics Q. What is difference between data and information? | The difference between data and information is as follows: Information Information is processed form of data. Data is normally huge in its volume. Data is difficult or even impossible to reproduce, For example, it is very hard to reproduce the data of census if itis lost Data is used rarely. Information is the output of computer. Information is normally short in its volume. Information is easier to reproduce. For example, number of graduate citizens can be recalculated from the stored data. Information is used frequently: Data does not dependent on information. Information depends on data. Q. Define file, record and field in detail. Field ’ ‘A combination of one or more characters is called field. It is the smallest unit of data that can be accessed by the user. The name of each field in a record is unique. The data type of a field indicates the type of data that can be stored in the field. Each field contains one specific piece of information. A field size defines the maximum number of characters that can be stored in a field. ‘ The EmployeelD, Name, HireDate, JobTitle and’ Phone are examples of fields that represent an employee. Fiekls ‘. [Rapid om [tine [Munger | Figure 1.1; Fields and Record Record > Record A collection of related fields used as a single unit is called a recard. For example, an employee's record includes. a set of fields that contains EmployerID, Name, HireVate, JobTitle and Phone etc. 4 IT Series =» Computer Science (MS Access and C) File ‘A collection of related records used as a single unit is called file. File is also known as data set. Files are stored on different storage media such as hard disk, USB flash drive or optical disc etc. The Employee file may contain the records of hundreds of employees, Each employee's record consists of same fields but each field contains different data. : EmployertD ro | Sane nea Abdullah ai = or Kame Usman Khai =] es a ‘Ali Raza EmployeeID Name Frets > Figure 1.2: A File Q. Describe the file types from usage point of view. ‘The types of files from usage point of view are as follows: 1. Master File Master files are used to store the information that remains constant for a long period of time. For example, a college maintains a master file of all students. The file may contain the fields such Student Name, Father Name, Address, Phone and Email etc. The master files are the latest updated files. These files are updated when any change in their contents are required. These files are never empty since they are created, 2. Transaction File A type of file that stores the input data before processing is called transaction file. It may be temporary file and may exist until the master file is updated. It may also be used to maintain a permanent record of the data about a transaction. For example, a transaction file can be used to store the fee deposited by the student. 3. Backup File A type of file that is used to take backup of important data is called backup file. It is @ Permanent file. It is used to store an additional copy of data. The data can be recovered from this file ifthe original file is lost or damaged, Backup files are mostly ereated by using specific software. Chapter 1 => Data Basics 5 Q. Describe the file types from functional point of view. The files are giver c ae eee aiven Proper names from functional point of view. It consists of file name a fle ion. The name and extension of a file is separated by dot. The extension is normally assigned by the software in which the file is created File Types from Functional Point of View ‘The types of files from functional point of view are as follows 1. Program File ____ Atype of file that contains the software instructions is called program file. program file has an extension of .com or .exe. These files contain the instructions that can be directly executed by the computer. 2. Data File A type of file that contains data is called data file. Data files are created by the software being used. Different software store data in the data files using different formats. A data files is generally opened in the same software in which it is created. It can also be opened in different software that supports the format of that data file. Some examples of data files and software in which they are created are as follows: Audio File Video File Q. How do we organize files on storage media? ‘A technique for physically arranging the records of a file on secondary storage devices is called file organization. The files are organized on storage media in the following ways 1. Sequential Files The records in sequential file organization are stored in sequence. A sequence means the records are stored one after the other. The records can be retrieved only in the sequence in which they were stored. The primary storage media for sequential files is magnetic tape. ‘The major disadvantage of sequential access is that it is very slow, It requires more processing time. Ifthe last record is to be retrieve, all preceding records are reas before reaching the last record 2. Direct or Random Files The records in direct file organization are not stored in a particular sequence. A key -d is used to determine the location to store the record, Each record is accessec value of a recor: directly without going through the preceding records, IT Series => Computer Sclence (MS Access and ¢ ion ts Bul ing data on disk. Direct file organization is This file organization is suitable for storing dat specie record. A Probar ns file organization for finding a specific rec a tay onar Our pe offs ced epuonyc The peblen czars he sae sa calculated to store two or more records. 3. Indexed Sequential Files ial fi izati cd in ascending or descendin indexed tial file organization, records are store n y ling order Te Ore ts tase a value called key. Additionally, indexed file organization maintains an index in a file. i is for each record j An index consists of key values and the corresponding disk address for each n the file. Index refers to the ple on a disk where a record is stored. The index file is updated whenever a record is added or deleted from the file. The records in indexed file organization can be accessed in sequential access as well as random access or direct access. The records in this file type require more ‘Space on storage media. This method is slower than direct file organization as it requires to perform an index ‘search. x Q. What is file Processing system? Discuss the problems or disadvantages faced in traditional file Processing approach. File Processing System A file processing system is a set of Programs used to manage data stored in the files. The programs in the file processing system sre developed independently. Each program defines and manages its own data files. The records in one file may not be related to the records stored in other files. The following figure shows a typical file processing system that ‘uses different set of files for each department in an organization. . Figure 1.3: File processing system Problems / Disadvantages in Tradition, Different as follows: al File Processing Approach Problems or disadvantages faced in raditional file processing approach are 1. Data Redundancy Data redundancy means duplication of data in multiple files, The redundancy of data Causes wastage of storage. In file Processing system, the same data may be duplicated in several files, Chapter 1 > Data Basics Suppose lane files are used in a college. The Students file contains the data such as RollNo, Name, ress, Phone and other details of the students. The Library file contains the same data of the students who borrow books from library along with the information about the book. The data of one student appears in two files. It causes wastage of storage and creates many problems. 2. Data Inconsistency Data redundancy in file system results in data inconsistency problem. Inconsistency means that two files may contain different data of the same entity. For example, the address of a student must be updated in all files if any change occurs. It is possible that it is changed in Students file but not in Library file. The data becomes inconsistent in this situation. 3. Program Data Dependency Program data dependency is a relationship between data in files and program required to update and maintain the files. Application programs are developed according to 2 particular file format in file processing system. If the format of file is changed, the application program also needs to be changed accordingly. For example, if there is a change in the length of postal code, it requires change in the program. The changes may be costly to implement. 4. Lack of Flexibility The file processing system is not very flexible. It can take a lot of time to collect the data from different files and write programs to produce the desired information. 5. Integrity Problems Integrity means reliability and accuracy of data. The stored data must satisfy certain types of consistency constraints. For example, RolINo and Marks of the students should be numeric value. It is very difficult to apply these constraints on files in file processing system. 6. Security Problems File processing system does not provide adequate security on data. In some situations, it is required to provide different types of access to data for different users. For example, a data entry operator should only be allowed to enter data. The chairman of the organization should be able to access or delete the data completely. Such types of security options are not available in file processing system. Q. What is database? Explain with the help of an example. Which facilities are provided by database system? Database A database is a collection of logically related data sets or files. Each (ile may contain different type of information and are used for specific purposes. The files may be organwedt in different ways to meet different processing and retrieval requirements of the users, Example ‘Abank may have different files for the clients as follows: © Saving Accounts IT Series > Computer Science (MS Access and C) 8 ee ees Current Accounts. Automobile Loan Personal Loan. Clients Information ete. : The client database of bank will consist of the records from all of the above files. The data of any client can be added, retrieved or updated easily using database programs. Facilities of Database System A database system normally provides the following facilities to the user: Adding new files to database Inserting new data in existing files Retrieving data from existing files Updating data in existing files Deleting data from existing files Removing existing files from database Q. Define database system. systems? Write in details. Database System What are four major components of database {A database system isa collection of data as well as programs required to manage that data. A database system is a computerized record-keeping system. The main purpose of this system is to maintain data and provide it to the user when it is required. Components of Database Systems The four major components of database system are as follows: 1. Data Data is the most important Stored in the database. The basi Process data for the user. 2 Hardware * component of database system. Data is a collection of facts ic Purpose of a database system is to store, maintain and The physical components of a Computer system are called hardware. The hardware is used to perform different tasks such as 'nput, output, storage and processing. Some important hardware components are as follows: e Secondary Storage 1/0 Devices Processors. Main Memory 3. Software Chapter 1 => Data Basics 9 Different types of software used in a database system are as follows: Operating System It manages all hardware components. It also enables all other software to run on the computer. DBMS Software It is the database management system that is used to create and manage a database system. Application Programs — These are used to access and process the data stored in the database. 4. Personnel The people related to the database system are called personnel. Different types of persons in a database system are as follows: Database Administrator The database administrator is a person who is responsible to manage the whole database system. Application Programmer The application programmer is a person who is writes the application programs to access data from database. End Users The end users are the persons who use application programs to perform different operations on database. They include clerks, managers and directors etc writes Procedares fe and standards —— Database administrator manages : & a End users Programmers = o-] DBMS utilities Applications Programs Figure 1.4: Major Components of Database System 10 IT Series => Computer Science (MS Access and C) Q. Explain different objectives of the databases. ‘The main objectives of using databases are as follows: 1, Data Integration The data in file system is stored in separate files. It is very difficult to access data stored in separate and independent files. An important objective of databases is to solve this, problem. The data in database may be located at different computers physically but it is connected through data communication links. In this way, data appeats centralized logically 2. Data Integrity Data integrity means the reliability and accuracy of data. Integrity rules are designed to keep the data consistent and correct. These rules act like a check on the incoming data . Enforcing data integrity ensures the quality of data in the database. DBMS provides several methods to enforce integrity of the data in a database. For example, an integrity rule can make sure that the same Employee ID is not assigned to multiple employees. 3. Data Independence Database approach provides the facility of data independence. It means that the data and the application programs are separate from each other. The user can change data storage structures and operations without changing the application programs. The user can also modify programs without reorganization of data. Q Define database model. Describe different types of database models. A set of rules and standards that define how the database organizes data is called database model. It also defines how users view the organization of data. Types of Database Models There are three types of database models. These models are as follows: 1. Hierarchical Model ‘The hierarchical model arranges records in hierarchy like an organizational chart. Each record type in this model is called a node or ‘segment, A node represents a particular entity. The top-most node is called root. Each node is a subordinate of the node that is at the ‘next higher level. A higher level node is called parent and lower level node is called child. A Parent node can have one or many child nodes. A child node can have only one parent node, ‘This kind of structure is often called inverted tree Rot ——»| Esty) oa — 1 Le] Hana tata un) [ee Figure 1.5; Hierarchical Model Chapter 1 => Data Basics u 2. Network Model The network model is similar to hierarchic ical model, The difference is that child node can have more than one parent nodes. The child nodes are represented by arrows in network model. It requires more complex diagram to represent : flexibility than hierarchical model. Present @ database. It also provides more eh Entity 4 Entity 5 Entity 6 Figure 6: Network Model 3. Relational Model Relation model is the most commonly used database model. It is more flexible than hierarchical and network database models. The relational model consists of simple relations. A relation is a term used for table. A relation represents a particular entity. It is used to store information about the entity. The relationships are based on the data of the entities. The relationship between entities is represented by the following diagram: Entity 1 Entity 4 x Entity 2 Entity 5 tt. = Entity 3 Entity 6 Figure 1.7: Relational Model Q. What is database management system (DBMS)? Discuss its objectives. Database Management System ‘A database management system (DBMS) is a collection of programs that are used to create, maintain and access database in a convenient and efficient manner. DBMS. uses database manager software tq,control the overall structure of a database, DBMS is normally used by different organizations for different purposes. yy 12 IT Series => Computer Science (MS Access and 9 wl dept Pe hk \ Employees U —— [Frans Customers Sales dept Inventory r og C rk | ae Account Figure 1.8: Database Management System Objectives of Database Management System Sotne important objectives of database management system are as follows: 1. Shareability An ability to share data is a fundamental objective of database management system. Shareability means that data is shared by different people at the same time. Data is stored at a central place. Different users can access this data from different locations. It reduces storage cost and provides data consistency. 2. Availability . Availability means that the users must be able to access data and DBMS easily. The data should be available when and where it is required. . 3. Evolvability Evolvability means that DBMS should provide the facility to change the database due to increase in user requirements or change in the technology. The change in database may ‘occur in two ways. It may change by contents or by structure. For example, the structure of the database may be changed if the operation of an organization is expanded. 4. Database Integrity Database integrity ensures that data entered in the database is accurate and consistent. Database is normally shared among different users. Some measures must be maintained to ensure database integrity, Q Discuss different features of database management system. Some important features of database management system are as follows: 1. Data Dictionary . Data dictionary is a file that stores all data definitions for a database. It may also monitor the data that is entered. It ensures that data is according to the data definition rules. The rules include field names, field sizes and data types etc, Data dictionary is also used for data access authorization for database users, Data dictionary is also called repository. Chapter 1 => Data Basics 13 2. Utilities | DBMS utilities are programs that are used to maintain database, Some utilities are used for backup and recovery procedures of databases. 3. Query Language A query language is used to perform different operations on the databases, The most popular query language is SQL. It stands for Structure Query Language. The statements of SQL are written in simple English-like sentences. Some important commands of SQL are SELECT, DELETE, CREATE, MODIFY, UPDATE and INSERT etc. SQL can be used for the following purposes: Create table structures Enter data in tables Retrieve data from tables Update data in tables Delete data from tables etc. 4, Report Generator A report generator is a program that is used to produce reports. It retrieves data from database and displays it in different formats. The user can use report generator to format page number, dates, titles and column headings etc. The users can produce useful and attractive reports by using report generator. Report generator is also called report writer. 5. Access Security Access security refers to the protection of database from unauthorized access. The database management system provides several procedures to maintain data security. The security is maintained by allowing access to the database through the use of passwords. Different users of database system have different levels of access rights to the database. 6. Backup and Recovery Database management system provides the facility for backup and recovery, Backup facility is used to store an additional copy of data. The data can be recovered from backup file if original file is lost or damaged. Q Briefly describe the advantages and disadvantages of DBMS. ‘Some important advantages of database management system are as follows: 1. Data Independence ~ Database approach provides the facility of data independence. It means that the data and the application programs are separate from each other. The user can change data storage structures and operations without changing the application programs. The user,can also modify programs without reorganization of data. 2. Redundancy Control Redundancy means duplication of data in multiple files, The data in databage appears only once: It is not duplicated. The same data is used at all places it is required, 2 IT Series => Computer Science (MS Access and C) 4 Suppose we want to manage the information of students who study in college and the student who live in hostel. The data of students is stored in college database. If a student also database. It will not be duplicated. lives in hostel, his record will be stored only in college 3. Consistency Constraints Consistency constraints are th database. Ifa data does not fulfill these constraints, ust be followed to enter data in the annot be entered in the database. rovide an easy way for applying different consistency f the data in databases. For example, a constraint can ident are always between 0 and 100 etc. e rules that mi itee Database management systems p? constraints that ensure the consistency of be applied to ensure that the marks of a stu 4. Support Complex Data Relationships Database management system allows the user-to design complex data stru enables the user to logically view and access data in different ways. ictures. It 5. Data Security Features Data security is the protection of the database management system provides several procedures to mai maintained by allowing access to the database through the use of usernames a Different users of database system have different levels of access rights to the database. from unauthorized access. The database intain data security. The security is nd passwords. 6. Database Backup & Recovery Database management system provides the facility for backup and recovery. Backup facility is used to store an additional copy of data. The data can be recovered from this file if the original file is lost or damaged. A utility in DBMS is used to recover data. The utility rebuilds the data by using the backup copy of the data. 7. Advanced Capabilities DBMS provides advanced access capabilities for online reporting. It uses Internet for advanced capabilities. Disadvantages of Database Management System Some important disadvantages of database management system are as follows: 1. High Cost of DBMS ‘A complete database management system is very large and complex soft i ftware. It is expensive to purchase database management software, 5 fe — 2. Higher Hardware Cost Database management system is complicated and heavy ry software. It requires a lar; amount of memory and higher processing power to run. It requires ceetil Fane work properly and efficiently. 3. Appointing Technical Staff The technical staff such as database administra tor and applicati . required to manage DBMS. The organization needs to pay good salaries to the technical otal Chapter 1 => Data Basics 15 4. Cost of Staff Training DBMS is a complex system and it requires trained users to use it properly. The user training is required in all fields such as programming, application development and database administration. An organization may need to spend a lot of amount for staff training 5, Problems in Wrong Database Enviroi..nent The problems may occur if wrong pe of database environment is selected. A database system may also need changes du. to change in requirements. The change can d° costly due to conversion and testing of the existing programs. It may require a lot of cost to implement the changes. 6. Need of Data Dictionary Another disadvantage of DBMS is the need of data dictionary. Data dictionary stores data definitions or description of the structure of data used in database. It ensures that data is according to the data definition rules. The rules include field names, field sizes and data types etc. Data dictionary is also used for data access authorization for database users. Data dictionary is very useful tool but it is also expensive. It requires installation costs as well as hardware requirements. . Short Questions Qu. Define data. Data is a collection of facts and figures related to an object. Day may consist of text, numbers, images, sound and videos. Data can be processed to produce useful information. Q2. How is data useful for an organization? Data is very important for an organization. It can be used by the managers to perform effective and successful operations of management. It also enables the organization to make better decision for future activities. Q3. Define information. . The processed data is called information. Information is meaningful, useful and organized. It is used for making decisions. Data is used as input for processing and information is the output of this processing, QA. Give an example of data and its corresponding information. The data of a student may consist of Roll Number, Student Name, Father Name and Marks of different subjects, The data of the students can be processed to produce useful information such as total masks and grade of the student Q5. Differentiate between data and information. Data is a collection of facts and figures related to an object. The information is” generated by processing the data. Data is used as input in the computer and information is the output of the computer,

You might also like