Data Processing
Data Processing
I. Introduction Meaning and purpose of Data processing - Source documents data input data Manipulation - Output of information - data storage -Files and Records - file creation - File access - File manipulation and maintenance - File generation - sequential and Direct file organization Data processing Elements, data entry, data processing and computer applications. Computer application to functional areas- accounting, inventory control, marketing.
Syllabus
2.Meaning and purpose of windows - menus - Dialog Boxes - File Management under Windows, features of word processing under Windows - Microsoft Word - File Menu Using Letter wizard for producing business letters -Entering, selecting, inserting, viewing text - Normal view - Page view - Point view - Zooming the view - character and paragraph formating - Printing a document. 3. Business data analysis using excel spread sheet spreadsheet overview - formatting worksheet Data - Relative and absolute Referencing working with Formula working with Functions - Creating and using Macros - Data Management through worksheets - analysis through charts/graphs - Setting print Styles Printing worksheets and charts/Graphs. Mini Project
Data
Data The word data is derived from Latin language. It is plural of Datum (But Data is usually used as a singular term.) Datum (singular) Data (plural). Data is any collection of facts of figures. The data is the raw material to be processed by a computer. Example Names of students, marks obtained in the examination, designation of employees, addresses, quantity, rate, sales figures or anything that is input to the computer is data. Even pictures, photographs, drawings, charts and maps can be treated as data. Computer processes the data and produces the output or result
Types of Data
Mainly Data is divided into two types: 1. Numeric Data 2. Character Data 1. Numeric Data The data which is represented in the form of numbers is known as Numeric Data. This includes 0-9 digits, a decimal point (.), +, /, sign and the letters E or D. 2. Character Data Character data falls into two groups. i. String Data ii. Graphical Data String Data String data consists of the sequence of characters. Characters may be English alphabets, numbers or space. The space, which separates two words, is also a character. The string data is further divided into two types. a. Alphabetic Data b. Alphanumeric Data Graphical Data It is possible that pictures, charts and maps can be treated as data. The scanner is normally used to enter this type of data. The common use of this data is found in the National Identity Card.
Information
A collection of data which conveys some meaningful idea is information. It may provide answers to questions like who, which, when, why, what, and how. or The raw input is data and it has no significance when it exists in that form. When data is collated or organized into something meaningful, it gains significance. This meaningful organization is information or Observations and recordings are done to obtain data, while analysis is done to obtain information
Data Processing
Data processing: Any operation or set of operations performed upon data, whether or not by automatic means, such as collection, recording, organization, storage, adaptation or alteration to convert it into useful information.
It is the process through which collected data is transformed into a form that computer can understand. It is very important step because correct output result totally depends on the input data. In input step, following activities can be performed. i) Verification The collected data is verified to determine whether it is correct as required. For example, the collected data of all B.Sc. students that appeared in final examination of the university is verified. If errors occur in collected data, data is corrected or it is collected again. ii) Coding The verified data is coded or converted into machine readable form so that it can be processed through computer. iii) Storing The data is stored on the secondary storage into a file. The stored data on the storage media will be given to the program as input for processing.
Database Package
A computer program that allows you to easily store, organise and retrieve information. Enables you to:
define data to be stored define how it is to be stored and kept up to date define ways in which data may be retrieved and presented. enter or update data obtain answers to questions about data
Database
a complete set of data used for a particular purpose.
Field name
Form View
Field Record
Database
Database
Field name
List View
Record
Field
Record all the data about a single entity (person, thing) a set of related fields Field one particular piece of data about any entity Field name The heading given for a particular piece of data Key Field a field whose contents are unique to a particular record can be used to uniquely identify a particular record eg student number Data type type of data allowed in a given field eg. text, numeric, date etc Validation data being entered into a field may be restricted by type or specific values.
Some Terms
File Management
24
File Management
File management system consists of system utility programs that run as privileged applications Input to applications is by means of a file Output is saved in a file for long-term storage
25
26
File Operations
Create Delete Open Close Read Write
27
Record
Collection of related fields Treated as a unit
Example: employee record
28
Database
Collection of related data Relationships exist among elements
29
Typical Operations
Retrieve_All Retrieve_One Retrieve_Next Retrieve_Previous Insert_One Delete_One Update_One Retrieve_Few
30
31
32
33
35
36
Device Drivers
Lowest level Communicates directly with peripheral devices Responsible for starting I/O operations on a device Processes the completion of an I/O request
37
38
Logical I/O
Enables users and applications to access records Provides general-purpose record I/O capability Maintains basic data about file
40
Access Method
Reflect different file structures Different ways to access and process data
41
42
Ease of update
File on CD-ROM will not be updated, so this is not a concern
44
45
File Organization
The Pile
Data are collected in the order they arrive Purpose is to accumulate a mass of data and save it Records may have different fields No structure Record access is by exhaustive search
46
Pile
47
File Organization
The Sequential File
Fixed format used for records Records are the same length All fields the same (order and length) Field names and lengths are attributes of the file One field is the key filed
Uniquely identifies the record Records are stored in key sequence
48
File Organization
The Sequential File
New records are placed in a log file or transaction file Batch update is performed to merge the log file with the master file
49
Sequential File
50
File Organization
Indexed Sequential File
Index provides a lookup capability to quickly reach the vicinity of the desired record
Contains key field and a pointer to the main file Indexed is searched to find highest key value that is equal to or precedes the desired key value Search continues in the main file at the location indicated by the pointer
51
File Organization
Comparison of sequential and indexed sequential
Example: a file contains 1 million records On average 50,000 accesses are required to find a record in a sequential file If an index contains 1000 entries, it will take on average 500 accesses to find the key, followed by 500 accesses in the main file. Now on average it is 1000 accesses
52
File Organization
Indexed Sequential File
New records are added to an overflow file Record in main file that precedes it is updated to contain a pointer to the new record The overflow is merged with the main file during a batch update Multiple indexes for the same key field can be set up to increase efficiency
53
54
File Organization
Indexed File
Uses multiple indexes for different key fields May contain an exhaustive index that contains one entry for every record in the main file May contain a partial index
55
Indexed File
56
File Organization
The Direct or Hashed File
Directly access a block at a known address Key field required for each record
57
58
File Directories
Contains information about files
Attributes Location Ownership
Directory itself is a file owned by the operating system Provides mapping between file names and the files themselves
59
60
61
62
63
Each user directory is a simple list of files for that user Still provides no help in structuring collections of files
64
65
66
67
Can have several files with the same file name as long as they have unique path names
68
69
File Sharing
In multiuser system, allow files to be shared among users Two issues
Access rights Management of simultaneous access
70
Access Rights
None
User may not know of the existence of the file User is not allowed to read the user directory that includes the file
Knowledge
User can only determine that the file exists and who its owner is
71
Access Rights
Execution
The user can load and execute a program but cannot copy it
Reading
The user can read the file for any purpose, including copying and execution
Appending
The user can add data to the file but cannot modify or delete any of the files contents
72
Access Rights
Updating
The user can modify, deleted, and add to the files data. This includes creating the file, rewriting it, and removing all or part of the data
Changing protection
User can change access rights granted to other users
Deletion
User can delete the file
73
Access Rights
Owners
Has all rights previously listed May grant rights to others using the following classes of users
Specific user User groups All for public files
74
Simultaneous Access
User may lock entire file when it is to be updated User may lock the individual records during the update Mutual exclusion and deadlock are issues for shared access
75
Fixed Blocking
76
77
78
79
Preallocation
Need the maximum size for the file at the time of creation Difficult to reliably estimate the maximum potential size of the file Tend to overestimated file size so as not to run out of space
80
81
82
83
No external fragmentation Best for sequential files No accommodation of the principle of locality
84
85
86
87
88
89
90
Inodes
Index node Control structure that contains key information for a particular file
91
92
93
94
95
96
97
Inode object
Represents a specific file
Dentry object
Represents a specific directory entry
File object
Represents an open file associated with a process
98
99
Cluster
One or more contiguous sectors
Volume
Logical partition on a disk
100
101