Data Flow Diagram
Data Flow Diagram
Also known as DFD, Data flow diagrams are used to graphically represent the flow of data in a
business information system. DFD describes the processes that are involved in a system to transfer data
from the input to the file storage and reports generation. As its name indicates its focus is on the flow of
information, where data comes from, where it goes and how it gets stored.
Data flow diagrams visually represent systems and processes that would be hard to describe in a
chunk of text. You can use these diagrams to map out an existing system and make it better or to plan
out a new system for implementation. Visualizing each element makes it easy to identify inefficiencies
and produce the best possible system.
Data flow diagrams can be divided into logical and physical. The logical data flow diagram
describes flow of data through a system to perform certain functionality of a business. The physical data
flow diagram describes the implementation of the logical data flow.
Logical data flow diagrams focus on what happens in a particular information flow: what information is
being transmitted, what entities are receiving that info, what general processes occur, etc. The
processes described in a logical DFD are business activities—a logical DFD doesn’t delve into the
technical aspects of a process or system. Non-technical employees should be able to understand these
diagrams.
Physical data flow diagrams focus on how things happen in an information flow. These diagrams specify
the software, hardware, files, and people involved in an information flow. A detailed physical data flow
diagram can facilitate the development of the code needed to implement a data system.
Design
Logical Physical
Feature
What the model How the How the system will be implemented
depicts business (or how the current system
operates. operates).
Type of data Show data stores Master files, transition files. Any
stores representing processes that operate at two
permanent data different times must be connected
collections. by a data store.
Ideally, systems are developed by analyzing the current system (the current logical
DFD) and then adding features that the new system should include (the proposed
logical DFD). Finally, the best methods for implementing the new system should be
developed (the physical DFD). This progression is shown in the figure illustrated below.
Data flow diagrams are also categorized by level. Starting with the
most basic, level 0, DFDs get increasingly complex as the level
increases. As you build your own data flow diagram, you will need to
decide which level your diagram will be.
Level 0 DFDs, also known as context diagrams, are the most basic
data flow diagrams. They provide a broad view that is easily digestible
but offers little detail. Level 0 data flow diagrams show a single
process node and its connections to external entities.
Level 1 DFDs are still a general overview, but they go into more detail
than a context diagram. In a level 1 data flow diagram, the single
process node from the context diagram is broken down into
subprocesses. As these processes are added, the diagram will need
additional data flows and data stores to link them together.
Level 2+ DFDs simply break processes down into more detailed
subprocesses. In theory, DFDs could go beyond level 3, but they rarely
do. Level 3 data flow diagrams are detailed enough that it doesn’t
usually make sense to break them down further.
Data Flow Diagrams Symbols
There are essentially two different types of notations for data flow diagrams (Yourdon & Coad
or Gane & Sarson) defining different visual representations for processes, data stores, data flow
and external entities.
Yourdon and Coad type data flow diagrams are usually used for system analysis and design,
while Gane and Sarson type DFDs are more common for visualizing information systems.
Visually, the biggest difference between the two ways of drawing data flow diagrams is how
processes look. In the Yourdon and Coad way, processes are depicted as circles, while in the
Gane and Sarson diagram the processes are squares with rounded corners.
Process Notations. A process transforms incoming data flow into outgoing data flow.
Datastore Notations. Datastores are repositories of data in the system. They are sometimes also
referred to as files.
Dataflow Notations. Dataflows are pipelines through which packets of information flow. Label
the arrows with the name of the data that moves through it.
External Entity Notations. External entities are objects outside the system, with which the
system communicates. External entities are sources and destinations of the system's inputs and
outputs.
Data flow diagram symbols and notation
To enhance the detail of your data flow diagram, follow the same
process as in step 3. The processes in your level 1 DFD can be broken
down into more specific subprocesses. Once again, ensure you add
any necessary data stores and flows—at this point you should have a
fairly detailed breakdown of your system. To progress beyond a level 2
data flow diagram, simply repeat this process. Stop once you’ve
reached a satisfactory level of detail.
When your diagram is completely drawn, walk through it. Pay close
attention to the flow of information: does it make sense? Are all
necessary data stores included? By looking at your final diagram, other
parties should be able to understand the way your system functions.
Before presenting your final diagram, check with co-workers to ensure
your diagram is comprehensible.