Lecture 06 NMU FORENSICS Ok 11 11
Lecture 06 NMU FORENSICS Ok 11 11
CYBER FORENSICS
CSE344
In the realm of digital data management and security, file formats and
data hiding techniques are pivotal.
They not only determine how data is stored, accessed, and manipulated
but also play significant roles in cybersecurity and digital forensics.
Understanding these concepts is essential for professionals involved in
data management, cybersecurity, and forensic investigations.
This lecture delves deep into file formats, explores various data hiding
techniques, and examines their intricate relationship with digital
forensics.
FILE FORMAT
A file format refers to the specific structure and encoding used to store data in a
file.
A file format is a layout and organization of data within the file.
If a file is to be used by a program, it must be able to recognize and have access to
the data in the file.
For instance, a text document can be recognized by a program such as Microsoft
Word that is designed to run text files but not by a program that is designed to run
audio or video files.
A file format is indicated along with the file name in the form of a file extension.
A suffix at the end of a filename that indicates the file format, and it helps operating
systems and applications determine which program should open the file.
The extension contains three or four letters identifying the format and is separated
from the file name by a period.
Different file formats are tailored for various types of data and applications, such as
images, audio, text, or executable files.
The header provides necessary context for interpreting the data stored in the file.
For instance, an audio file format will have a header that specifies sample rates and
bit depths, which are crucial for playback. If the software can read the header, it can
accurately decode and render the audio.
TYPES OF FILE FORMAT
File formats can be categorized based on the type of data they store and their
usage:
❑ Text Files:
Description: Store data as readable characters using encoding standards like ASCII or UTF-8.
Characteristics: Human-readable, editable with simple text editors, lack complex formatting.
Examples: .txt, .csv, .json, .xml.
❑ Binary Files:
Description: Store data in binary (machine-readable) format.
Characteristics: Not human-readable, require specific applications for interpretation, more
efficient in terms of space.
Examples: Executables (.exe), dynamic link libraries (.dll), compiled programs (.bin).
❑ Image Files:
Description: Contain visual data encoded as pixels.
Characteristics: May use compression (lossy or lossless), support color depth and resolution
specifications.
Examples: .jpeg, .png, .bmp, .gif.
❑ Audio Files:
Description: Encode sound data, including music, speech, and other audio forms.
Characteristics: Can be compressed (lossy or lossless), support various sample rates and bit
depths.
Examples: .mp3, .wav, .flac, .aac.
TYPES OF FILE FORMAT
❑ Video Files:
Description: Store moving visual images accompanied by audio.
Characteristics: Often use advanced compression algorithms to reduce size, support various
resolutions and frame rates.
Examples: .mp4, .avi, .mkv, .mov.
❑ Document Files:
Description: Contain formatted text, images, tables, and other media.
Characteristics: Support rich formatting, embedded objects, and often include metadata.
Examples: .docx, .pdf, .pptx, .xlsx.
❑ Database Files:
Description: Hold structured data, often organized in tables with relationships.
Characteristics: Support indexing, querying, and transactional operations.
Examples: .db, .sql, .mdb, .sqlite.
❑ Compressed Files:
Description: Archive multiple files into a single file, often reducing overall size.
Characteristics: Use compression algorithms, support encryption and password protection.
Examples: .zip, .rar, .7z, .tar.gz.
COMPONENTS OF FILE FORMATS
Most file formats consist of several key components:
❑ Header:
Function: Contains metadata about the file, such as format version, size, type, and other
necessary information to interpret the file.
Example: A JPEG image starts with a specific byte sequence (e.g., FFD8) that identifies
it as a JPEG.
❑ Body:
Function: The main content of the file, holding the actual data, whether it's text, image
pixels, audio samples, etc.
Example: In a .txt file, the body consists of the textual content; in a .png file, it contains
the pixel data.
❑ Footer (Optional):
Function: Marks the end of the file or provides additional metadata. Not all file formats
include a footer.
Example: Some video formats include a footer to indicate the end of the stream or to hold
index information.
FILE HEADER
A file header is a specific section at the beginning of a file that contains
metadata about the file.
This metadata typically includes information about the file's structure,
type, and the data it contains.
Components of a File Header
Magic Numbers:
These act as file signatures
They are specific sequences of bytes at the beginning of a file that uniquely
identify the file format.
Unique identifiers (often in hexadecimal) that signify the file type. For