Three-Dimensional Computer Graphics Architecture: Tulika Mitra and Tzi-Cker Chiueh

Three key points: 1. 3D computer graphics rendering involves transforming 3D models into 2D images through a graphics pipeline consisting of geometric transformation and rasterization stages. 2. Geometric transformation maps 3D triangles to 2D coordinates through a series of transformations like modeling, viewing, projection, and clipping. This requires many floating point operations. 3. Rasterization converts transformed triangles into pixel values for display, using mostly integer operations like additions and comparisons. Parallel processing can accelerate rendering complex 3D graphics models.

Uploaded by

pravin_bhavale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views9 pages

Three-Dimensional Computer Graphics Architecture: Tulika Mitra and Tzi-Cker Chiueh

Uploaded by

pravin_bhavale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

SPECIAL SECTION: COMPUTATIONAL SCIENCE

SURVEYS

Three-dimensional computer graphics

architecture
Tulika Mitra* and Tzi-cker Chiueh
Computer Science Department, State University of New York at Stony Brook, Stony Brook, NY 11794-4400, USA

tions that attempt to address these problems.

Three-dimensional (3D) computer graphics hardware has Finally, we present a taxonomy of parallel rendering algo-
emerged as an integral part of mainstream desktop PC
rithms, which uses parallel processing hardware to render
systems. The aim of this paper is to describe the 3D graph-
extremely complicated 3D graphics models.
ics architecture at a level accessible to the general compu-
tational science community. We start with the generic 3D
graphics rendering algorithm, the computational require- 2. 3D graphics pipeline
ments of each of its steps, and the basic architectural fea-
tures of 3D graphics processors. Then we survey the Polygon-based 3D graphic rendering is the process of con-
architectural features that have been implemented in or verting the geometric description of a 3D model (or a virtual
proposed for state-of-the-art graphics processors at the world) to a photo-realistic two-dimensional image (a 2D ar-
processor and system levels to enable faster and higher- ray of picture elements or pixels) that can be displayed on a
quality 3D graphics rendering. Finally, we describe a tax- computer monitor. Each pixel represents a colour value con-
onomy of parallel 3D rendering algorithms that accelerate sisting of red, green, and blue (RGB) components. The se-
the performance of 3D graphics using parallel processing. quence of steps involved in this conversion forms the 3D
graphics pipeline, each stage of which can be implemented
either in hardware or software.
1. Introduction The input to the 3D graphics pipeline is a virtual world
UNTIL recently, real-time three-dimensional (3D) computer created by application programmers. This world/scene con-
graphics was available only in very high-end machines from sists of a mathematical representation of a set of
Silicon Graphics Inc. In the last few years however, the PC objects, their positions relative to each other, an optional
industry has seen an unprecedented growth of cost- set of light sources, together with a viewpoint that provides
effective 3D graphics accelerators. Because a significant a camera angle into the virtual world. Objects or primitives
amount of industrial research effort has been invested in are typically represented by a set of triangles for ease of
powerful 3D graphics cards, it is predicted that the perform- implementation. The description of the 3D model is passed
ance of these accelerators will surpass the performance of to the 3D graphics engine through a standard Application
SGI machines by the year 2001 (ref. 1). 3D graphics applica- Programmer Interface (API) such as OpenGL2 or Direct 3D3.
tions place a stringent demand on the processing power The 3D graphics pipeline itself consists of two distinct
and on the data transfer bandwidth of the memory subsys- stages: geometric transformation and rasterization. The
tem and interconnecting buses. The growing importance of geometric transformation stage maps triangles from a 3D
3D graphics applications has motivated CPU vendors to coordinate system (object space) to a 2D coordinate system
add new instructions to the existing instruction set architec- (image space) by performing a series of transformations.
ture, and to develop higher-bandwidth memory and system The computation in this stage is mostly floating-point in-
buses. In fact, the data-intensive nature of 3D applications tensive, involving linear algebraic operations such as matrix
has been one of the primary motivations multiplication and dot products. The rasterization stage
behind the introduction of advanced Dynamic Random converts transformed triangles into pixel values to be
Access Memory (DRAM) architectures for host memory, shown on the computer screen. This stage involves mostly
and the local memory on graphics cards. integer arithmetic, such as simple additions and compari-
In this article, we start with the basic steps required to sons. An excellent reference to the 3D graphics pipeline can
render a polygon-based 3D graphics model and their asso- be found in Foley et al.4.
ciated and bandwidth requirements. Then we examine the
major design issues in generating photo-realistic images on 2.1 Geometric transformation
desktop machines in real time, and the architectural innova-
At the input of the geometric transformation stage, each
triangle consists of three vertex coordinates, vertex
normals and other attributes such as colour. For ease of
*For correspondence. (e-mail: [email protected]) manipulation, vertices are represented in homogeneous
838 CURRENT SCIENCE, VOL. 78, NO. 7, 10 APRIL 2000
SPECIAL SECTION: COMPUTATIONAL SCIENCE

coordinates, which are quadruples of the form {x, y, z, w}, 2.1.3 Projection transformation: This transformation
where in most cases w is 1. (The tuple {x/w, y/w, z/w} is the projects objects onto the screen. There are two types of
Cartesian coordinate of the homogeneous point.) The geo- projections: (1) orthographic projection, which keeps the
metric transformation stage applies a sequence of ope- original size of 3D objects and hence is useful for architec-
rations on the vertices of the triangle. Figure 1 shows the tural and computer-aided design; (2) perspective projection,
geometric transformation part of a typical 3D graphics pipe- which produces more realistic images by making distant
line which consists of the following stages: objects appear smaller. Each of these transformations again
involves a 4 × 4 matrix multiplication. However, as most en-
tries in these matrices are zero, a careful implementation
2.1.1 Model and viewing transformation: Modelling
requires only 6 multiplications and 3
transformation positions primitives with respect to each
additions.
other, and the viewing transformation orients the resulting
set of primitives to the user viewpoint. These two transfor-
2.1.4 Clipping: The application programmer defines a 3D
mations can be combined into a single multiplication of the
viewing frustum such that only the primitives within the
homogeneous vertex coordinate by a 4 × 4 matrix, which is
frustum are projected onto the screen. This step
implemented as 16 floating point multiplications and 12
removes the objects that are outside the viewable area. The
floating point additions. Lighting calculation, in addition,
algorithm requires one floating point comparison per view-
requires the transformation of the vertex nor-
boundary plane, and thus 6 comparisons per vertex. If a
mal by a 3 × 3 inverse transformation matrix, which costs 9
triangle is partially clipped, then the algorithm should cal-
floating point multiplications and 6 floating point
culate the position of the new vertices at the intersection of
additions.
the triangle edge and the view-boundary plane. The number
of such operations performed depends on the actual num-
2.1.2 Lighting: This stage evaluates the colour of the
ber of triangles that cross the view-boundary planes, which
vertices given the direction of light, the vertex position, and
varies from one viewpoint to another. Hence, we will not
the surface-normal vector and material characteristics of an
take this cost into account for our computation requirement
object’s surface. We will consider here only the most
calculation.
popular shading model, called Gouraud shading, which
interpolates the colour of the three vertices across the sur-
2.1.5 Perspective division: If perspective transformation
face. Evaluating the colour of a vertex requires a variable
is applied on a homogeneous vertex, then the w value no
amount of computation depending on the number of light
longer remains equal to 1. This stage divides x, y, z by w to
sources and the material properties. We assume the sim-
convert the vertex to Cartesian coordinates.
plest case of a single light at infinite distance, and the mate-
rial with only ambient and diffuse coefficients. This lighting
2.1.6 Viewport mapping: This step performs the final
model calculates the following equation for each R, G, B
scaling and translation to map the vertices from the pro-
component:
jected coordinate system to the actual viewport on the
computer screen. Each vertex component is scaled by an
Cdiffuse × Clight × (N ⋅ L) + Areflection × Alight, independent scale factor and offset by an independent off-
set, i.e. 3 floating point multiplications and 3 floating point
where Clight and Cdiffuse are the light source intensity and dif- additions.
fuse reflection coefficient; Alight and Areflection are the ambient The total computation requirement to perform geometry
light intensity and ambient light coefficient; (N ⋅ L) is the transformation per vertex is then 46 multiplications, 29 addi-
dot product of surface-normal vector and the direction of tions, 3 divisions, and 6 comparisons. Modern processors
light vector. (N ⋅ L) is calculated only once. However, the can execute floating point addition, subtraction, compari-
rest of the equation should be calculated independently for son, and multiplication operations quite fast using pipelined
R, G and B components for each vertex. This requires a total execution units. Floating point division operation however,
of (3 + 3 × 3 = 12) multiplications and (2 + 3 × 1 = 5) addi- is not usually pipelined, and can take as high as 50 floating
tions per vertex. point addition operations’ worth of time. The total floating
point operation requirement for a single vertex transforma-
tion is then around 130. Today

Figure 1.
Geometry transformation stage of a 3D graphics pipe-
line.
CURRENT SCIENCE, VOL. 78, NO. 7, 10 APRIL 2000 839
SPECIAL SECTION: COMPUTATIONAL SCIENCE

even a modest scene requires around 1 million vertex trans- gle proceeds down from a starting point, and moves out-
formations per second to achieve a rate of 30 frames per ward from the centre line6. The centre line shifts to the left
second. This would translate to 130 MFlops (million floating or right, until it steps outside of the triangle at any point of
point operations) per second. Today’s PCs have sufficient time (Figure 4 a). To achieve parallelism, the triangle may be
floating point computation power and therefore typically traversed one pixelstamp at a time, rather than pixel by
perform the geometric transformation stage in the main CPU. pixel6. A pixelstamp is an array of pixels of dimension X × Y.
Evaluation of edge functions for all the pixels within a pixel-
stamp could start in parallel, and only qualified pixels are
2.2 Rasterization sent to the
pixel processing stage. Triangle traversal visits all pixel-
The rasterization stage comprises two steps. The scan con-
stamps that are completely or partially inside the triangle
version step decomposes a triangle into a set of pixels, and
(Figure 4 b).
calculates the attributes of each pixel, such as colour, depth,
The rasterization stage also includes texture mapping,
alpha, and texture coordinates. The pixel processing step
which is a crucial and widely used technique that wraps a
performs texture mapping, depth test and alpha blending for
2D texture image on the surface of a 3D object to emulate
individual pixels. Figure 2 shows the rasterization stage of
the visual effects of complex 3D geometric details,
the graphics pipeline.
such as wooden surface, tiled wall, etc. Each vertex of a
There are two distinct mechanisms that are quite popular
texture-mapped triangle comes with a texture coordinate that
for the scan conversion step: linear interpolation
defines the part of the texture map to be applied
algorithm and linear edge function algorithm. In linear in-
(refer to Figure 5). These texture coordinates are inter-
terpolation-based algorithms4, the triangle set-up step first
polated across the triangle surface via scan conversion. The
computes the slopes, with respect to the X-axis, for all the
most popular texture mapping implementation is based on
attributes along each edge of the triangle. Next, the edge
mip-mapping7 (Figure 6), which pre-calculates multiple re-
processing step iterates along the edges and computes the
duced-resolution versions of a texture image. Each resolu-
two end points of a horizontal pixel segment, called a span.
tion level corresponds to a particular depth. Coarser (finer)
Finally, the span processing step iterates along each span
resolution levels are used for farther (closer) objects. For a
and computes the attributes for each pixel on the span
3D object at a given depth, the mip-mapping algorithm
through linear interpolation (Figure 3).
chooses a pair of adjacent resolution levels of the texture
In linear edge function-based algorithms5, each edge of
image, and performs weighted filtering of 8 texels (texture
the triangle is defined by a linear edge function. The triangle
pixel) from these two resolution levels. This tri-linear filter-
is scan converted by evaluating, at each pixel’s centre, the
ing eliminates visual discontinuities when different mip-map
function for all edges, and processing only those pixels that
levels are applied on the same object.
are inside all the edges. The attributes are also computed
Before a pixel is written to the frame buffer, the rendering
from the linear functions. Typically, the traversal of a trian-
engine needs to check whether that triangle is actually visi-
ble at that pixel, i.e. no other triangle overlaps that pixel
making it invisible. This is known as hidden surface removal
for opaque objects. The number of overlapping triangles for
a pixel is called the depth complexity of the pixel. The major-
ity of graphics accelerators achieve hidden surface removal
using a depth/Z buffer, which is an array with the same di-
mension as the frame buffer. After a triangle is scan-
Figure 2. Rasterization stage of a 3D graphics pipeline. converted into a set of