0% found this document useful (0 votes)

82 views13 pages

GPU-Driven Real-Time Mesh Contour Vectorization

This document summarizes a research paper about GPU-driven real-time mesh contour vectorization. The paper proposes a method that includes preprocessing mesh data on the CPU, extracting contour edges on the GPU using Bresenham's algorithm, parallelizing Potrace to trace contour boundaries into edge loops, segmenting the loops into stroke curves on the GPU, and rendering the strokes with styles. This achieves real-time performance for high-resolution rendering of dense 3D meshes while supporting advanced stylization effects.

Uploaded by

王毅

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views13 pages

GPU-Driven Real-Time Mesh Contour Vectorization

Uploaded by

王毅

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Eurographics Symposium on Rendering (2022)

A. Ghosh and L.-Y. Wei (Editors)

GPU-Driven Real-Time Mesh Contour Vectorization

Wangziwei Jiang, Guiqing Li∗ , Yongwei Nie, Chuhua Xian

South China University of Technology, Institute of Computer Science and Engineering, China

Figure 1: Real-time vectorization and stylization of a rose (158k triangles) under 2560 × 1440 resolution: From left to right are respectively
input 3D mesh, vectorized stroke curves rendered with different colors, and two different stylizations based on extracted stroke curves
Abstract
Rendering contours of 3D meshes has a wide range of applications. Previous CPU-based contour rendering algorithms support
advanced stylized effects but cannot achieve realtime performance. On the other hand, real-time algorithms based on GPU
have to sacrifice some advanced stylization effects due to the difficulty of linking contour elements into stroke curves. This
paper proposes a GPU-based mesh contour rendering method which includes the following steps: (1) before rendering, a
preprocessing step analyzes the adjacency and geometric information from the 3d mesh model; (2) at runtime, an extraction
stage firstly selects contour edges from the 3D mesh model, then the parallelized Bresenham algorithm rasterizes the contour
edges into a set of oriented contour pixels; (3) next, Potrace is parallelized to extract (pixel) edge loops from the contour pixels;
(4) subsequently, a novel segmentation procedure is designed to partition the edge loops into strokes; (5) finally, these strokes
are then converted into 2D strip meshes in order to support rendering with controllable styles. Except the preprocessing step,
all other procedures are implemented in parallel on a GPU. This enables our framework to achieve real-time performance for
high-resolution rendering of dense mesh models.
CCS Concepts
• Computing methodologies → Non-photorealistic rendering; Image processing;

1. Introduction the contour of a given 3D model into 2D or 3D stroke curves

[GTDS10], then render those curves in different styles, e.g.,
Contour of 3D meshes, which reveals the essential shape of objects,
changing contour width [GVH07], line-drawing density simplifi-
plays a crucial role in painting and other arts. In particular, styl-
cation [GDS04], stroke texturing [BCGF10], and stroke abstrac-
ized contours are essential for artistic expression. That is why con-
tion [BJC∗ 12]. Though CPU-based approaches are able to produce
tour rendering which extracts and stylizes contours of 3D meshes
stroke curves from the 3d contour, it is difficult for them to achieve
has long been a fundamental topic in non-photorealistic rendering
real-time performance.
(NPR).
CPU-based contour rendering methods usually first convert Most contour rendering methods for real-time applications are

© 2022 The Author(s)

https://www.eg.org https://diglib.eg.org
DOI: 10.2312/sr.20221159
94 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

fully implemented on GPU in order to avoid frequent commu- 2.1. Image-based contour rendering
nication between CPU and GPU. GPU-based methods can be
Image-based approaches directly apply image filters to extract fea-
roughly divided into two categories: contour-edge-based rendering
ture pixels from a rendered image. Some CPU-based algorithms
[MH04,CF09], and image-filtering-based rendering [ST90,ND04].
further exploit image vectorization algorithms to convert feature
The former directly extract contour edges from the mesh and then
pixels into continuous planar curves. For example, CPU-based im-
renders each edge as a line segment or a rectangle while the lat-
age vectorization algorithms [Sel03] can be used for tracing feature
ter first renders the geometry (for example, depth and normals)
curves. Xiong et al. [XFZ16] used GPU to accelerate the vector-
into textures, then finds feature pixels via image processing filters.
ization process, however their method relies on CPU to finish the
Unfortunately, to our knowledge, existing full GPU-based methods
sequential contouring.
cannot link contour pixels / edges together to form stroke curves,
which however is the key for contour stylization. Recently, we have GPU-based approaches usually make use of a fragment shader to
also witnessed deep neural networks being utilized to produce styl- apply edge detection filter on G-buffers. A G-buffer generally con-
ized line drawings [LNHK20, LFHK21], which mainly focus on sists of three components: scene color, depth and normal images. A
learning styles rather than curve generation. pixel in the G-buffer is considered as a line feature if its gradient is
higher than a specified threshold [CS16]. Mesh contour can be ap-
Chaining contour elements (image pixels or mesh edges) into a
proximately extracted as a subset of these line features. As only a
curve usually starts from a contour element and then continuously
set of scattered pixels are generated, this kind of methods can only
links the current element to its adjacent neighbor, until arriving at a
support limited control over the stylization [ND04, Har07]. For ex-
singular point where the chain’s visibility changes [BH19]. When
ample, it is challenging to achieve thick lines since the detected
linking 2D pixels, this process can be considered as a particular
pixels are usually highly noisy under a low gradient threshold. It is
genre of image vectorization. It is difficult to parallelize the linking
often inaccurate too. For example, contours may be missed when
procedure due to its sequential nature and the irregular topology of
the depth varies slowly around the contour area.
contour edges or pixels.
To address the issue, we propose a GPU-based system to gener- “Inverted hull”, a special GPU-based method by Raskar and Co-
ate stroke curves from 3D mesh models. It first works on the CPU hen [RC99], is very popular in industries due to its simplicity and
to prepare the adjacency information between vertices, edges and efficiency. The given mesh model is rendered twice to reveal its
faces, and also geometric attributes including vertex positions and outline. The first pass renders front faces into a depth buffer while
face normals. A GPU scheme is then designed to quickly locate the second pass renders slightly enlarged back faces in black color,
contour edges between front and back faces of the mesh. After that, so that contour appears as black borders.
the parallelized Bresenham algorithm [Wri90] is adopted to raster- Bénard et al. [BJC∗ 12] proposed a method to track feature
ize these contour edges. To trace the boundaries of these rasterized curves in the image space with temporal coherence. Their algo-
contours efficiently, we parallelize the Potrace algorithm [Sel03] rithm is mainly based on CPU, except for the stages of line pixel
on the GPU, where the pixel-edge chaining step is parallelized by filtering and final rendering that are done on a GPU. Each curve is
the technique of parallel list ranking [Wyl79]. Finally, based on the represented as a polyline initialized using a CPU-based image vec-
orientation of mesh contours and the traced boundaries, we devise torization algorithm. In each frame, they avoided the cost of vector-
a simple heuristic that is also parallelized to extract stroke polylines ization (essentially reconstruction) by tracking and deforming a set
from image boundaries. of curves. Although being able to achieve excellent temporal coher-
In summary, our contributions include: ence for meshes of moderate complexity, the approach suffers from
a performance bottleneck due to multiple readbacks from GPU to
• We parallelize the Potrace algorithm [Sel03] that is previously CPU. As having little knowledge about the underlying 3D scene,
designed for CPU-based image vectorization, overcoming the its curve topology sometimes deviates from scene occlusions and
sequential nature of boundary tracing by using the technique of details.
parallel list ranking [Wyl79].
• We also propose a heuristic-rule-based parallel algorithm to ex-
tract stroke curves from the traced boundaries. 2.2. Mesh-edge-based contour rendering
• Our method is fully in parallel. By exploiting the sparsity of con-
Instead of extracting contour strokes from rendered images, some
tour edges and pixels, we further improve the performance of our
methods directly compute and render contour edges from the 3D
method, achieving real-time performance.
model. An edge is considered on the contour when one of its two
adjacent faces is forward and the other one backward with respect
2. Related work to the current viewpoint.
An large amount of literature has been contributed to contour ex- The earliest GPU-based methods [CM02, Goo03] treat each
traction and stylization, which can be roughly classified into three mesh edge as a degenerated quad and select contour edges in a
categories: image-based contour rendering, mesh-edge-based con- vertex shader. Each quad contains four vertices (two are the end-
tour rendering, and the hybrid methods. This work focuses on real- points of the mesh edge and the other two are its opposite vertices
time approaches that can be implemented on a GPU. We refer the on its two adjacent faces) in order to determine whether the corre-
readers to the survey by Bénard and Hertzmann [BH19] for more sponding edge is a contour one. A fragment shader is then devised
details. to scan-convert the contour edges. Noticing that it may lead to gaps

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 95

between adjacent edges when vertex normals fail to reflect the con- a specialized rasterization scheme to collect contour pixels only.
tour curvature well, McGuire and Hughes [MH04] drew caps at Our scheme includes three sequential stages: recognition of contour
the ends of each contour edge. There are also efforts using GPU edges, rasterization of the recognized edges and visibility decision
to extract mesh edges for other purposes. For example, Peciva et on the rasterized fragments (pixels).
al. [PSM∗ 13] and wachter et al. [WKS07] used GPU to efficiently
compute shadow volumes.
4.1. Computation of contour edges
Cole and Finkelstein [CF10] noted that early GPU-based meth-
An edge of a mesh is considered contour if and only if one of its
ods suffer from visibility issues. They utilized geometry shader and
two adjacent triangles is a front face and the other one is a back face
advanced fragment shader techniques to achieve accurate visibil-
with respect to the viewpoint. Given local information of all edges
ity determination for contour edges. Each edge is projected onto
collected on the CPU, our GPU-based procedure first computes the
the screen and sliced into small 2D segments, then the visibility of
orientation of all faces, and then collects the contour edges while
each segment is estimated via comparing its depth against the scene
getting rid of non-contour ones.
depth buffer, and finally each contour edge is individually rendered
as textured quads. Pre-processing. Recognizing an contour edge needs to know the
local geometry near the edge, therefore we collect all related infor-
2.3. Hybrid approaches mation in CPU. This includes the following five buffers:

Hybrid method combines both the geometric information of con- • edge-vertex buffer Bev : store the index of 2 vertices for each
tour edges and texture information of the rasterized pixels to gen- edge;
erate contour stroke curves during the whole process. Both contour • edge-face buffer Be f : store the index of 2 adjacent faces of each
edges and pixels have their own advantages and disadvantages for edge;
rendering. The former may lead to small and frequent zig-zag arti- • vertex buffer Bvc : record vertex coordinates;
facts when rendered as strokes [NM00]. On the contrary, the latter • face-vertex buffer B f v : save vertex index for each face;
has simpler topology and natural appearance but usually loses ac- • face-normal buffer B f n : record face normals.
curate 3D information. Considering that concave edges, whose internal dihedral angles are
A typical hybrid approach by Isenberg et al. [IHS02] extracts greater than π, cannot be a part of a visible contour [BH19], we
3D curves from contour edges with the help of a image-precision discard all this kind of edges in Bev and Be f to save resources. To
line visibility algorithm adapted for contours. The algorithm is es- our experience, about 40% of the total mesh edges can be removed
sentially a software depth test in which contour edges are scan- (see Figure 12).
converted into pixel-sized fragments and each fragment compares Orientation of triangles with respect to viewpoint. We dis-
its depth against its 3 × 3 neighbors in the z-buffer. patch a GPU kernel to calculate the orientation of each face with
Our approach analyzes the strokes in the image space. We also respect to the viewpoint based on buffers B f v and B f n and store it
record geometric information such as vertex positions, face nor- in the face orientation buffer B f o in which a back face is labelled
mals, primitive adjacency, and projected direction of contour edges with ’1’ while a front face is labelled with ’0’.
for use. Therefore, our method can also be viewed as a GPU-based Detection of contour edges. With B f o as input, a GPU kernel is
hybrid algorithm. created to recognize contour edges from Be f . An edge is a contour
edge if its two adjacent faces have different orientations, namely
3. Overview one with label ’1’ and the other with label ’0’. Next, we use paral-
lel stream-compaction [BOA09] to select contour edges while dis-
Our method takes a triangular mesh as input and generates vector- carding the rest. This yields a new buffer, the contour edge buffer
ized contour curves. Specifically, it consists of five stages as shown Bce . The subsequent GPU threads will only process edges in Bce in-
in Figure 2. From left to right: (1) Preprocessing is conducted on stead of those in Bev . According to McGuire [McG04, MH04], the
CPU to collect the adjacency information and geometric attributes number of contour edges is close to N 0.8
f where N f is the number
from the given mesh models; (2) Rasterization is responsible for of mesh faces.
recognizing the contour edges by checking the orientation of faces
sharing the edge and then rasterizing the edges into pixels via a
parallelized Bresenham algorithm; (3) The vectorization state par- 4.2. Fragment generation
allelizes Potrace algorithm to trace the loops of the pixel bound- A parallelized Bresenham algorithm [Wri90] is designed to scan-
aries; (4) The stroke generation stage employs a simple heuristic convert the contour edges into fragments. Each fragment is a pixel-
to extract the strokes from the pixel edge loops; (5) Finally, the sized primitive with geometric attributes and a pointer to its contour
stylization stage yields the rendering result of contour edges with a edge. The algorithm consists of two passes: a counting pass and an
specific style. allocation pass.
Fragment counting pass. With Bce and Bvc as input, this pass
4. Contour rasterization
counts how many fragments are covered by each contour edge. If
Conventional hardware rasterization only yields a whole image in- the absolute slope of the projection of the contour edge is less than
stead of generating the desired contour pixels. Hence, we develop 1, the number of pixels equals to the length of its projection along

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
96 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

Figure 2: Our approach consists of five stages. From left to right are respectively preprocessing, rasterization, vectorization, stroke genera-
tion, and stylization rendering. In the middle three pictures, white pixels stand for background regions.

x-axis. Otherwise the number is the length of its projection along employ the parallel stream compaction algorithm [SHG∗ ] to obtain
y-axis. Fragment count is stored as a sub-buffer Bc f within Bce . We a contour pixel buffer Bcp with pixel coordinates.
call these fragments contour fragments.
Fragment generation pass. This pass allocates a fragment at-
tribute buffer B f a for the fragments according to the total fragment
number. We record pixel coordinates, projection of its associated
edge vector (edge vertices oriented by its adjacent front face), depth
and normal for each fragment in B f a . For each contour edge, its
fragments are sequentially stored in B f a . To achieve such an al-
location scheme, we need the mapping between contour edges in
Bce and contour fragments in B f a . We apply an exclusive add-scan
upon the fragment count buffer Bc f to build the mapping Bce_ f
for each edge to its starting fragment index. The fragment-to-edge
mapping B f _ce is initialized with negative ones. We use Bce_ f to Figure 3: Creation of contour pixels: A soft test generates a buffer
build the mapping at starting fragments in B f _ce . Then we broadcast of contour pixels such as (x, y) and a set of pv-frags for each pixel,
the mapping to other fragments via a segmented max-scan [Ble90] ’e’, ’d’ and ’f’ on (x, y); A hard test selects the front-most one for
upon B f _ce , with each starting fragment seen as the segment head. each contour pixel, e. g. ’e’ among ’e’, ’d’, and ’f’ on (x, y).
B f _ce and Bce_ f enables each fragment (resp. contour) to access
attributes from the corresponding contour (resp. fragments). Fi- Hardware depth test pass. This pass picks the front-most pv-
nally, we apply the parallel Bresenham algorithm [Wri90] to com- frag for each contour pixel and copies the fragment attributes into
pute the coordinate for each fragment. Note that the depth and nor- the corresponding pixel in Bcp . We treat each pv-frag as a 1-pixel-
mal should be interpolated from vertex attributes in a perspective- size point whose depth is the fragment depth and whose color is
correct manner. computed by packing bits of the fragment attributes from B f a . The
pv-frag points are then rendered into a texture with hardware z-test.
4.3. Contour pixel generation At last, each contour pixel samples the texture at its coordinate and
decodes the sampled color to the corresponding fragment attributes.
We need to extract visible fragments from B f a . Accurate contour In Figure 3, pv-frag ’e’ is finally selected in this test.
visibility has long been a challenging problem [CF10,BHK14]. We
address the issue by a two-pass procedure on GPU. A soft depth test Figure 4 presents an example of the visibility test: visible (resp.
picks up pixels covered by visible contour fragments, referred to as hidden) fragments are marked as green (resp. red) on the left col-
contour pixels. A hardware z-test pass then selects the front-most umn; the right column illustrates contour pixels colored with en-
fragment for each contour pixel. coded geometrical attributes. Rasterized contour-pixels only oc-
cupy a tiny portion of the screen, making it possible to achieve
Soft depth-test pass. A scene depth texture is rendered in ad-
realtime image vectorization.
vance. For each contour fragment, we compare its depth from B f a
against depth samples from its 3 × 3 neighborhood in the depth tex-
ture. A fragment passes the test if it is in front of no fewer than two
neighbors and is called a pseudo-visible-fragment (abbrev. as pv-
frag). This relaxed depth test allows multiple pv-frags to cluster in
the same contour pixel as shown in Figure 3 in which ’e’, ’d’ and
’f’ among 6 fragments pass the test to be a pv-frag within the same
screen pixel.
To generate the contour pixels, we use a texture with all pixels
assigned to 0. Each pv-frag atomically reads its pixel value from Figure 4: An example showing results before (left: fragments) and
the texture, and then mark the pixel value as 1. We record the co- after (right: the contour pixels) generating contour pixels.
ordinate of a pixel in the pv-frag first visiting the pixel and then

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 97

5. Contour chaining in which each element knows the indices of its previous and next
pixel-edge neighbors.
So far, we have obtained Bcp , the buffer of contour pixels with
geometric attributes, in which contour pixels generally form long
and thin strands in the corresponding image. A chaining process
should be conducted to link the contour pixels into a set of long
curves [GTDS10].
Our chaining process is inspired by Potrace [Sel03], which is Figure 6: Four pixel configurations. Given the left pixel-edge (red
designed for vectorizing the boundary of a binary image, where the arrow) of a contour pixel (black one), its next pixel-edge should be
boundary consists of a sequence of boundary pixel edges. A pixel the blue one.
has four pixel edges by viewing it as a square and a boundary pixel
edge is one shared by a foreground pixel and a background pixel as
shown in Figure 5. Each boundary is an oriented pixel-edge loop 5.2. Edge loop flattening
and encloses a connected region. These loops act as a superset of
our final stroke curves. We propose a parallel solution to replace the highly sequential pro-
cess of Potrace to extract all pixel-edge loops from B pel and flatten
them onto a linear array as shown in Figure 7. Our solution consists
of two passes: loop breaking and list ranking. The first pass selects
a head element to break edge-loops while the later pass ranks pixel-
edges in each edge-loop with respect to the head element.
In our setting, each edge-loop is a circular linked list and each
pixel-edge is a list node randomly scattered in B pel . It is quite suit-
able for Wyllie’s parallel list ranking algorithm [Wyl79] to deter-
mine the rank of each pixel-edge in the pixel-edge loop. With ranks
calculated, organizing the pixel-edges into linear arrays becomes
trivial.
Loop breaking. In this step, we determine the head pixel-edge
for each pixel-edge-loop. We specify the pixel-edge with the largest
Morton code [Mor66] as the head of the loop, where the Morton
Figure 5: Edge-loops: black and white squares are foreground code, unique for each pixel-edge, encodes its direction and related
(contour) and background pixels, respectively; edges shared by pixel coordinates. After obtaining the Morton codes of all pixel-
white and black squares are boundary pixel edges (red, blue and edges, we employ Wyllie’s algorithm with its operator set as inte-
yellow one with arrow indicating their direction); three colored ger maximum to pick up the head pixel-edge with maximal Morton
polygons are pixel-edge loops. code. The tail node of an edge-loop will be chosen as the predeces-
sor of the starting one. Two traced edge-loops are shown in the top
of Figure 7 in which the red arrows stand for the head.
5.1. Generation of pixel edges and creation of their linkage List ranking. This pass ranks the above linked lists with head
and tail nodes via Wyllie algorithm [Wyl79]. After ranking, we use
We follow the ‘path decomposition’ scheme of Potrace to generate
the rank of each node (pixel-edge) as its array index and serialize all
oriented pixel-edges for each contour pixel and build their linkage
pixel-edge loops into an array, i.e., B pel , such that the pixel-edges
according to different contour pixel configurations.
belonging to the same loop occupy a continuous segment as shown
Each pixel-edge is clockwise oriented around its contour pixel, in the bottom of Figure 7.
therefore for the left pixel-edge of a contour pixel, we need to con-
sider the 2 × 2 block where the contour pixel is at the bottom-right
5.3. Operations on edge-loop pool
corner as shown in Figure 6. In this case, there are four possible
configurations for the next pixel-edge. Other three cases, namely The pixel-edge loop buffer B pel forms the basis of our following
top, right and bottom pixel-edges of a contour pixel can be dealt screen-space algorithms. We call it an edge-loop pool. We develop
with in a similar manner. Furthermore, it also requires to find the two special operations: spatial filtering and segmentation. Classical
previous pixel-edge of the current one for each of the above four parallel computing primitives like the segmented scan [SHG∗ ] can
cases, which is needed in loop breaking process. be applied to the edge-loop pool by treating each edge-loop as a
segment.
The whole task only involves 3 × 3 neighborhood of a contour
pixel in the bitmap and it is trivial to parallelize. GPU threads only Spatial filtering. Spatial filtering can be considered as a 1D con-
work on Bcp , i.e., the buffer of contour-pixels (foreground pixels), volution on each edge-loop: each edge navigates around its edge-
in order to improve performance. A binary bitmap with contour loop and collects data from the neighboring pixel-edges. In our im-
pixels as the foreground is required to support neighboring pixel plementation, GPU threads linearly map to all pixel edges. Each
queries. It finally outputs a pixel-edge loop buffer denoted by B pel thread caches data into the thread group shared memory. In most

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
98 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

(a) The case of foreground and background contours meet at a junction.

Figure 7: Pixel-edge loops: two pixel-edge loops with a red arrow

as the head node (top) and their pixel-edge loop array (bottom).

cases, neighboring data can be found and fetched efficiently from (b) The case of a self-occlusion model generates a cusp and a junction.
this cache. However, there are a few cache misses: (1) Pixel edge
is mapped to the start or end of a thread group; (2) Pixel edge is at Figure 8: Orientation-based stroke extraction.
the start (or end) of an edge-loop, and its predecessor (or successor)
is not mapped to the same thread group. This can only happen to
the first or last edge-loop mapped to the thread group. We detect contour edge share the same vertex order as its adjacent front face.
both scenarios and load missed data to the group shared memory. The contour of a smooth mesh will form counter-clockwise curves
Since the topology of edge-loop is fixed each frame, we can prepare on the screen. After rasterization, the hidden contour is discarded
missed data for each thread group and reuse it the whole frame. while the visible contour becomes thin and long strands of contour-
pixels. As the camera projection preserves the orientation of a tri-
Segmentation. Given a key for each edge, segmentation splits
angle face, the strands of contour-pixels share a counter-clockwise
each edge-loop into segments; pixel-edges inside a segment share
orientation (see Figure 8).
the same key, and two adjacent segments have different keys. Seg-
mentation requires each pixel-edge to evaluate where its segment During the rasterization stage (Section 4), we numerate vertices
starts and ends, which can be implemented via two segmented vc0 and vc1 of each contour edge according to its winding order in
scans, one for the starting index and another one for the ending the adjacent front face, project them to the screen positions vs0 and
index. vs1 respectively, and finally copy the edge direction vs1 − vs0 to its
rasterized contour fragments.
6. Stroke extraction
6.2. Orientation of pixel-edges
Edge-loops excessively cover contour features and neither start nor
end at visibility changes. To resolve this issue, we select desired All pixel-edges are originally clockwise oriented around their
pixel-edges from loops, which we call stroke segments. Each stroke contour-pixel square. To estimate an accurate orientation of the
segment starts or ends as its underlying mesh contour became vis- pixel-edge, we fit a curve to the local shape on its edge-loop. Our
ible or hidden, and each contour feature is covered by exactly one fitting algorithm takes the framework by Lewiner et al. [LGJLC05].
stroke segment (Figure 8). An “inner” edge-loop without visibility We dispatch two kernels to realize the local curve fitting. The
change will be extracted as a stroke (see the inner loop in Figure 7). first kernel samples the midpoint of each pixel edge, and then ap-
We match the orientation of each pixel-edge with its surround- plies Laplacian operator to smooth the midpoints by using the spa-
ing contour-pixels along the edge-loop; pixel-edges with coher- tial filtering discussed in Subsection 5.3. The second kernel applies
ent orientation will be selected as stroke segments. According to the spatial filtering again to collect for each pixel edge e0 the mid-
Bénard and Hertzmann [BH19], there are two kinds of visibility points W = {m−n , . . . m0 , . . . mn } of its neighborhood along the
change among contour-pixels: cusp and junction (see Figure 8b). corresponding edge loop (n = 8 in our experiments). In addition,
Our heuristic can resolve both cases owing to the oriented nature of we compute the arc-length parametrization of W as follows

edge-loops and mesh contours.  0 k = 0;
sk = sk+1 + ∥mk − mk+1 ∥ k = −1, −2 · · · , −n; (1)
sk−1 + ∥mk − mk−1 ∥ k = 1, 2, · · · , n;

6.1. Orientation of contour pixels
A quadratic parametric curve us then use to fit We
Seen from the viewpoint, vertices at a front face (resp. back face)
have a counter-clockwise (resp. clockwise) winding order. Let each r(s) = as + bs2 , (2)

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 99

where r(s) = (x(s), y(s)), a = (ax , ay ) and b = (bx , by ). This leads Conventional stroke rendering algorithms [DiV13] can be eas-
to the following optimization ily applied to these stroke polylines to achieve stylized results. In
n order to collaborate with texture mapping, we extend each vertex
argmin{a,b} ∑k=−n ||wk (mk − r(sk )||2 , (3)
of a polyline along its the nomral direction [HLW93] to obtain a
−
(pk+e −pe )2 strip mesh as illustrated in Figure 10, and then create the texture
where wk = e σ2 are Gaussian weights. The orientation of coordinates for each of the mesh vertex on the given texture.
(a)
pixel edge e is then computed as te = ∥(a)∥ .

6.3. Stroke generation based on inside-outside test

Combining orientations of contour-pixels estimated in Subsection
4.2 and orientations of pixel-edges obtained in Subsection 6.2, we
can extract strokes from edge-loops. Concretely, for each pixel edge
e0 , we again collect its neighbor en , · · · , e−1 , e0 , e1 , · · · , en on the
same edge loop like having done in Subsection 6.2 and then find
their corresponding contour pixels pn , · · · , p−1 , p0 , p1 , · · · , pn . If Figure 10: Stroke parameterization for texturing. Extending points
more than half of the inner products between the orientations of pi on the path along its normal direction ni to two sides yields a
e0 and pk is greater than τ (an adjustable threshold set to 0.6 in strip planar mesh which is then mapped to the texture space.
default), then e0 is labeled as inside the surface contour. Otherwise,
it is assigned an outside label.
A Visibility change event happens if adjacent pixel-edges on an 7. Experimental results
edge-loop switching between inside and outside (visible and oc- This section first describes our implementation details and then
cluded). The segmentation operator described in Section 5.3 is then shows the advantages of the proposed framework via a variety of
used to trace (inside) stroke segments and discard (outside) redun- experiments. We will elaborately evaluate the time performance of
dant segments as shown in Figure 9. our approach and discuss its main influencing factors, and then
compare our approach with three popular contour rendering sys-
tems (Freestyle, Line Art, Pencil+4) and Active Strokes [BJC∗ 12]
both in time cost and rendering quality. Our system is developed
as a render pipeline in Unity Engine. All runtime procedures are
implemented on the GPU by using HLSL shaders.
Our approach and the first three systems run on a PC with Intel
i7-7700HQ 2.8 GHz of 8 GB RAM and NVIDIA GTX 1070 while
Active Strokes [BJC∗ 12] works on a PC with Intel Core i7-4790K
4GHz of 32 GB RAM and NVIDIA GTX 980Ti due technique rea-
sons. Nonetheless, both hardware configurations are fairly close.
All results are generated under 1920 × 1080 resolution.

7.1. Implementation details

Figure 9: Inside-outside test: the case of junction (top row) and Mesh data is preprocessed and stored in GPU buffers persistently
the case of a cusp and a junction (bottom row). Each row shows while runtime data such as information for contour edges and pixel-
contour-pixels, pixel-edges, orientation match, extracted strokes edges is generated from scratch in each frame. For example, the
(colored) and redundant segments (gray) rrom left to right. edge-loop pool (Section 5.3) contains many sub-arrays sequentially
storing pixel-edges of edge-loops. Each pixel-edge records its index
The above heuristic may lead to noisy results due to bad mesh in the sub-array and the length of the sub-array. The order between
quality or image sampling. We leverage the spatial filtering (Sec- sub-arrays is determined by the allocation process [Har10].
tion 5.3) to smooth the inside-outside values to make our algorithm In the chaining stage, Wyllie algorithm requires log(n) iterations
more robust. Very short visible strokes are also given up to improve to resolve linked lists with a list length n. In practice, we found
the visual appearance. that 18 iterations are enough for screen resolution of 2048 × 2048.
Note that each stroke is actually a sequence of pixel edges which One or more stream compactions can be inserted inbetween these
should be converted into polylines for rendering. Let {e1 , e2 ... en } iterations to discard short lists that have already finished ranking.
be such a stroke without loss of generality. We simply compute the
vertices of its polyline {p0 , p1 ... pn } by setting pi to the midpoint
7.2. Performance evaluation
of ei . We can smooth the stroke polylines if necessary. In addition,
we can compute tangent ti of pi as the orientation of ei and normal We observe the performance by separating our system into two
ni orthogonal to f ti for latter use. stages: one is the rasterization process (Section 4) and the other

© 2022 The Author(s)

Eurographics Proceedings © 2022 The Eurographics Association.
100 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

is the vectorization process consisting of pixel-edge chaining (Sec-

tion 5) and stroke extraction (Section 6). The model is placed to
cover the screen as much as possible in all experiments.
Time cost distribution. Figure 11 depicts the performance un-
der a 1920 × 1080 resolution, commonly used in real-time render-
ing applications. Generally, The performance of the first stage is
mainly governed by shape complexity and mesh size while perfor-
mance of the latter is primarily determined by screen resolution.
We also known from the figure that the time cost mainly comes
from the vectorization stage and is merely affected by the mesh
complexity. Figure 11 shows that our system achieves highly real-
time performance considering the conventional budget of realtime
rendering applications (16ms per frame).

(a)

Figure 11: GPU runtime performance per stage, under 1920 ×

1080 resolution
(b)

Sparsity analysis of contour primitives. Our approach greatly Figure 12: Sparsity of primitives: (a) Ratios between convex edges
benefits in performance from the sparsity of contour edges and con- and the total edges, and the contour edges and the total edges ;
tour pixels which determines the GPU workload. To verify this, we (b) Ratio between contour pixels and screen pixels covered by the
rotate the mesh model and record the average of three ratios - ratio mesh bounding box.
between the amount of convex edges and the total edges, contour
edges and the total edges, contour-pixels and the pixels covered
by the mesh’s screen bounding box. Figure 12) (a) illustrates that
about 60% convex edges remain after preprocessing while only 4% Active Stroke is a prototype based on an image-space line ren-
of total edges are contour edges. Similarly, near 4% of the render- dering method [BJC∗ 12], which is different from the other three
ing pixels are contour ones. tools in two critical points: (1) it only generates curves in the first
frame and then maintains a set of curves throughout the subsequent
frames while the other three methods generate stroke curves for
7.3. Comparison with previous work each frame; (2) it generates curves from feature samples in the
depth buffer with image-space filters (not from actual 3D mesh con-
As no existing GPU-based approaches support contour vectoriza-
tours) while the other three methods generate curves from the 3D
tion, all approaches to be compared are CPU-based: Freestyle, Line
contour of the mesh model.
Art, Pencil+4 and Active Strokes are CPU-based. Both Freestyle
and Line Art are line drawing modules of Blender, among which Runtime Performance. For methods generating curves in each
Freestyle is based on a series of work by Grabli and Turquin et frame (all except Active Strokes), we profile the time of contour ex-
al. [GTDS10, GTDS04]. Pencil+4 is a closed-source line drawing traction and stroke vectorization, which are the main focus of our
renderer, with implementation across multiple softwares. Consid- method. For Active Strokes, we record the total cost of feature pixel
ering that our system is developed in Unity Engine, we choose the extraction (extract samples and image readback) and curve pro-
Unity version of Pencil+4 for comparison. cessing (advection, relaxation, and topology adjustments). Table 1

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 101
∗
Table 1: Comparison of timings among five approaches: ours, Freestyle, Line Art, Pencil+4 and Active Strokes [BJC 12].

Mesh Model (tris) Suzanne (0.1k) Bunny (5k) Cow (6k) Teapot (6k) Fandisk (13k) Rocker. (20k)
Ours 0.76 0.78 0.7 0.75 0.63 0.86
Pencil+4 7.93 8.43 9.32 8.21 9.22 14.84
Freestyle 27.10 43.32 43.75 44.79 53.34 67.13
Line Art 11.30 18.80 19.65 20.29 28.71 44.69
Active Strokes 54.40 54.68 53.79 46.70 52.26 54.40
Mesh Model (tris) Horse (97k) Buddha (99k) Arm. (100k) David (100k) Dragon (249k) Lucy (300k)
Ours 0.87 0.98 0.97 1.02 1.42 0.96
Pencil+4 68.10 97.67 111.61 67.00 208.00 183.60
Freestyle 242.37 331.10 370.56 383.69 828.47 845.61
Line Art 165.90 189.49 233.44 166.54 445.14 551.52
Active Strokes 51.45 97.09 124.56 91.49 107.43 73.73

demonstrates that our approach achieves tens to hundreds folds of 7.4. Stylized rendering of complex models
acceleration over other methods. This is because the serial nature
At the end, we present some stylized results of complex models.
of CPU makes it challenging to process massive geometric data. In
Figure 15 depicts three types of stroke patterns for Lucy model
addition, the iterative processes such as contour vectorization also
while Figure 16) illustrates the final rendering results and stylized
consume considerable time on CPU (linear time complexity). In
strokes for two complex monstrous models.
contrast, the time complexity of our vectorization algorithm is only
O(logn).
Stroke chaining quality. Generally, our method produces stroke 8. Limitations, conclusions and future work
curves with quality comparable to or even better than the CPU- Our method suffers from some disadvantages in some special situ-
based approaches regardless of the complexity of mesh shapes. In ations. First, it may ignore subtle contour visibility changes due to
our experiments, hybrid methods (all except Active Strokes) were lack of accurate 3D contour information to guide the stroke extrac-
tuned to ensure a coherent configuration: only mesh contour is ex- tion and therefore wrongly connect different strokes together (see
tracted, and contour elements are chained to stroke curves starting Region A in Figure 14). Second, a stroke may be falsely broken if
and ending at visibility events. Since the stroke topology of Active being occluded by other primitive or objects as shown in the top
Strokes is mainly determined by the curve tracking process, instead left of Figure 13a. This case worsens for contour features highly
of rendering a static scene, we animate the scene to bring motion to clustering on the screen such as thin objects as shown in the bottom
the curves and take a screen capture. left of Figure 13a. It usually does not happen for geometry-based
All experimental results are presented in Figure 14 in which algorithms such as Pencil+4, e.g. seeing the second column in Fig-
strokes are drawn with different colors. As shown in areas C and ure 13b. Third, a minor limitation is that the extracted strokes have
D in the figures, contour by other methods is inappropriately bro- a pixel offset from the actual screen-space contour. This artefact
ken into fragmented curves, and the curve topology fails to re- will not be perceived generally and can be amended by moving the
flect the occlusion relationship. In contrast, continuous curves by stroke pixels towards their associated contour-pixels.
our method match the occlusion relationship better. Nevertheless, Regardless of the aforementioned drawbacks, our method
area A depicts that our image-based line extraction process can achieves acceleration of hundreds of times against CPU-based
not recognize those endpoints with subtle visibility change on the methods and is enough to make up for these disadvantages in
screen, while other hybrid methods (all except Active Strokes) pro- real-time applications. As future work is to extend our frame-
duce more accurate line distribution. This defect is even more vis- work to generate temporally coherent stylized contour anima-
ible for Active Strokes, which links all pixels as a single curve. tions. It is also interesting to integrate the proposed framework
For dense meshes shown in the third ("David") and fourth column into a more complete and powerful GPU contour stylization
("Lucy"), our method yields a line topology similar to that by Pen- pipeline [BH19]. A reference implementation of the proposed
cil+4 and Active Strokes and much better than those by Line Art method is available at https://github.com/JiangWZW/
and Freestyle which are highly fragmented as shown in area G to Realtime-GPU-Contour-Curves-from-3D-Mesh.
K.
Our algorithm achieves a balance between Pencil+4 and Active
9. Acknowledgements
Strokes: (1) It leads to more coherent and continuous curves com-
pared with Pencil+4 which links curves directly on meshes because We thank anonymous reviewers, especially the primary reviewer,
contour-pixels on image often have cleaner topology and smoother for the valuable and careful comments. We thank Pierre Bénard for
geometric attributes than the contour edges on the mesh; (2) It kindly providing the experiment data of Active Strokes [BJC∗ 12].
catches more details and better reflects the occlusion relationship We also thank Wengrui Ma and Yiming Wu for helpful discussions
than Active Strokes. on Freestyle and Line Art.

Eurographics Proceedings © 2022 The Eurographics Association.
102 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

This research is sponsored in part by the National Nat- [CF09] C OLE F., F INKELSTEIN A.: Fast high-quality line visibility. In
ural Science Foundation of China (61972160, 62072191), in Proceedings of the 2009 symposium on Interactive 3D graphics and
part by the Natural Science Foundation of Guangdong Province games (Boston, Massachusetts, 2009), Proceedings of the 2009 sympo-
sium on Interactive 3D graphics and games, Association for Computing
(2019A1515012301 , 2019A1515010860). Guiqing Li is the cor- Machinery, p. 115–120. 2
responding author.
[CF10] C OLE F., F INKELSTEIN A.: Two fast methods for high-quality
line visibility. IEEE Transactions on Visualization and Computer Graph-
ics 16, 5 (2010), 707–717. doi:10.1109/TVCG.2009.102. 3, 4
[CM02] C ARD D., M ITCHELL J. L.: Non-photorealistic rendering with
pixel and vertex shaders. Direct3D ShaderX: vertex and pixel shader tips
and tricks (2002), 319–333. 2
[CS16] C ARDONA L., S AITO S.: Temporally coherent and artistically
intended stylization of feature lines extracted from 3d models. Computer
Graphics Forum 35, 7 (2016), 137–146. doi:https://doi.org/
10.1111/cgf.13011. 2
[DiV13] D I V ERDI S.: A brush stroke synthesis toolbox. In Image and
(a) Video-Based Artistic Stylisation, Image and Video-Based Artistic Styli-
sation. 2013, pp. 23–44. 7
[GDS04] G RABLI S., D URAND F., S ILLION F.: Density measure for
line-drawing simplification, 2004 6-8 Oct. 2004 2004. 1
[Goo03] G OOCH B.: Silhouette extraction. Course Notes for Theory
and Practice of Non-Photorealistic Graphics: Algorithms, Methods, and
Production Systems 6 (2003), 1–10. 2
[GTDS04] G RABLI S., T URQUIN E., D URAND F., S ILLION F. X.: Pro-
grammable style for npr line drawing. In Proceedings of the Fifteenth Eu-
rographics conference on Rendering Techniques (Norrköping, Sweden,
(b)
2004), Proceedings of the Fifteenth Eurographics conference on Render-
ing Techniques, Eurographics Association, p. 33–44. 8
Figure 13: Limitations: (a) Our method wrongly partitions the rect-
[GTDS10] G RABLI S., T URQUIN E., D URAND F., S ILLION F. X.: Pro-
angle into two strokes due to occlusion by a stick (top left) while grammable rendering of line drawing from 3d scenes. ACM Transactions
Pencil+4 preserves the integrity well (top right); (b) Our method on Graphics 29, 2 (2010), 1–20. 1, 5, 8
falsely clusters contour features of thin objects (the red rectangle
[GVH07] G OODWIN T., VOLLICK I., H ERTZMANN A.: Isophote dis-
regions, bottom left) and Pencil+4 again yields more reasonable tance: a shading approach to artistic stroke thickness. In Proceedings
results (bottom right). of the 5th international symposium on Non-photorealistic animation and
rendering (San Diego, California, 2007), Proceedings of the 5th interna-
tional symposium on Non-photorealistic animation and rendering, Asso-
ciation for Computing Machinery, p. 53–62. 1
References [Har07] H ARVILL A.: Effective toon-style rendering control using scalar
fields., 2007. 2
[BCGF10] B ÉNARD P., C OLE F., G OLOVINSKIY A., F INKELSTEIN A.:
Self-similar texture for coherent line stylization. In Proceedings of the [Har10] H ARRIS M.: State of the Art in GPU Data-Parallel Algorithm
8th International Symposium on Non-Photorealistic Animation and Ren- Primitives. Tech. rep., Nvidia, 2010. 7
dering (New York, NY, USA, 2010), NPAR ’10, Association for Com- [HLW93] H SU S. C., L EE I. H. H., W ISEMAN N. E.: Skele-
puting Machinery, p. 91–97. URL: https://doi.org/10.1145/ tal strokes. In Proceedings of the 6th Annual ACM Sym-
1809939.1809950, doi:10.1145/1809939.1809950. 1 posium on User Interface Software and Technology (New York,
[BH19] B ÉNARD P., H ERTZMANN A.: Line drawings from 3d models: NY, USA, 1993), UIST ’93, Association for Computing Machin-
A tutorial. Foundations and Trends® in Computer Graphics and Vision ery, p. 197–206. URL: https://doi.org/10.1145/168642.
11, 1-2 (2019), 1–159. doi:10.1561/0600000075. 168662, doi:10.1145/168642.168662. 7

[BHK14] B ÉNARD P., H ERTZMANN A., K ASS M.: Computing smooth [IHS02] I SENBERG T., H ALPER N., S TROTHOTTE T.: Styliz-
surface contours with accurate topology. ACM Transactions on Graphics ing silhouettes at interactive rates: From silhouette edges to sil-
33, 2 (2014), 1–21. doi:10.1145/2558307. houette strokes. Comput. Graph. Forum 21, 3 (2002), 249–258.
URL: https://doi.org/10.1111/1467-8659.00584, doi:
[BJC∗ 12] B ÉNARD P., J INGWAN L., C OLE F., F INKELSTEIN A., 10.1111/1467-8659.00584. 3
T HOLLOT J.: Active strokes: Coherent line stylization for animated
[LFHK21] L IU D., F ISHER M., H ERTZMANN A., K ALOGERAKIS E.:
3d models. In NPAR 2012 - 10th International Symposium on Non-
Neural strokes: Stylized line drawing of 3d shapes, October 2021. 2
photorealistic Animation and Rendering (Annecy, France, 2012), NPAR
2012 - 10th International Symposium on Non-photorealistic Animation [LGJLC05] L EWINER T., G OMES J R J. D., L OPES H., C RAIZER M.:
and Rendering, ACM, pp. 37–46. Curvature and torsion estimators based on parametric curve fitting. Com-
puters & Graphics 29, 5 (2005), 641–655. 6
[Ble90] B LELLOCH G.: Pre x sums and their applications. Tech. rep.,
Citeseer, 1990. 4 [LNHK20] L IU D., NABAIL M., H ERTZMANN A., K ALOGERAKIS E.:
Neural contours: Learning to draw lines from 3d shapes, June 2020. 2
[BOA09] B ILLETER M., O LSSON O., A SSARSSON U.: Efficient stream
compaction on wide simd many-core architectures. In Proceedings of the [McG04] M C G UIRE M.: Observations on silhouette sizes. Journal
Conference on High Performance Graphics 2009 (New York, NY, USA, of Graphics Tools 9, 1 (2004), 1–12. doi:10.1080/10867651.
2009), HPG ’09, Association for Computing Machinery, p. 159–166. 3 2004.10487594. 3

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 103

(a)

(b)

(c)

(d)

Eurographics Proceedings © 2022 The Eurographics Association.
104 Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization

(e) Active Strokes

Figure 14: (continued) Comparison of contour stroke quality among five approaches: four examples are presented for each approaches and
from top to bottom are respectively our approach, Pencil+4, Line Art, Freestyle and Active Strokes. Dotted rectangles on the model are
zoom-in regions whose larger versions are placed around the models.

Figure 15: Stylization with texture mapping: three types of stroke patterns are depicted for the Lucy model.

[MH04] M C G UIRE M., H UGHES J. F.: Hardware-determined feature [RC99] R ASKAR R., C OHEN M.: Image precision silhouette edges. In
edges. In Proceedings of the 3rd international symposium on Non- Proceedings of the 1999 symposium on Interactive 3D graphics (Atlanta,
photorealistic animation and rendering (2004), Proceedings of the 3rd Georgia, USA, 1999), Proceedings of the 1999 symposium on Interactive
international symposium on Non-photorealistic animation and rendering, 3D graphics, Association for Computing Machinery, p. 135–140. 2
pp. 35–47. 2, 3
[Sel03] S ELINGER P.: Potrace: a polygon-based tracing algorithm. Po-
[Mor66] M ORTON G. M.: A computer oriented geodetic data base and a trace (online), http://potrace. sourceforge. net/potrace. pdf (2009-07-01)
new technique in file sequencing. 5 (2003). 2, 5
[ND04] N IENHAUS M., D ÖLLNER J.: Sketchy drawings. In Proceedings [SHG∗ ] S ENGUPTA S., H ARRIS M., G ARLAND M., ET AL .: Efficient
of the 3rd international conference on Computer graphics, virtual real- parallel scan algorithms for gpus. 4, 5
ity, visualisation and interaction in Africa (Stellenbosch, South Africa,
2004), Proceedings of the 3rd international conference on Computer [ST90] S AITO T., TAKAHASHI T.: Comprehensible rendering of 3-d
graphics, virtual reality, visualisation and interaction in Africa, Associa- shapes. SIGGRAPH Comput. Graph. 24, 4 (1990), 197–206. doi:
tion for Computing Machinery, p. 73–81. 10.1145/97880.97901. 2
[NM00] N ORTHRUP J., M ARKOSIAN L.: Artistic silhouettes: A hybrid [WKS07] W ÄCHTER C., K ELLER A., S TICH M.: Efficient and ro-
approach. In Proceedings of the 1st international symposium on Non- bust shadow volumes using hierarchical occlusion culling and geometry
photorealistic animation and rendering (2000), pp. 31–37. 3 shaders, 2007. 3
[PSM∗ 13] P ECIVA J., S TARKA T., M ILET T., KOBRTEK J., Z EMCIK [Wri90] W RIGHT W. E.: Parallelization of bresenham’s line and circle
P.: Robust silhouette shadow volumes on contemporary hardware. In algorithms. IEEE Computer Graphics and Applications 10, 5 (1990),
GraphiCon’2013 (2013), pp. 56–59. 3 60–67. 2, 3, 4

Eurographics Proceedings © 2022 The Eurographics Association.
Wangziwei Jiang, Guiqing Li, Yongwei Nie, Chuhua Xian / GPU-Driven Real-Time Mesh Contour Vectorization 105

Figure 16: Stylization with toon shading for two monsters: each row shows the final render (left) and stylized strokes (right).

[Wyl79] W YLLIE J. C.: The Complexity of Parallel Computations. PhD

thesis, Cornell University, 1979. 2, 5
[XFZ16] X IONG X., F ENG J., Z HOU B.: Real-time image vectorization
on gpu. In VISIGRAPP (1: GRAPP) (2016), pp. 143–150. 2

Digitisation Principles and Phased Array Imaging
No ratings yet
Digitisation Principles and Phased Array Imaging
58 pages
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
Efficient GPU Path Rendering Using Scanline Rasterization
No ratings yet
Efficient GPU Path Rendering Using Scanline Rasterization
12 pages
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
E0 271 Assignment 2
No ratings yet
E0 271 Assignment 2
5 pages
Imm4884 PDF
No ratings yet
Imm4884 PDF
1 page
CIS 665 GPU Project Final Report - Dongsoo Han
0% (1)
CIS 665 GPU Project Final Report - Dongsoo Han
11 pages
Efficient GPU Screen-Space Ray Tracing
No ratings yet
Efficient GPU Screen-Space Ray Tracing
13 pages
Ray Tracing On GPU
No ratings yet
Ray Tracing On GPU
44 pages
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Batch and Cull in Opengl
No ratings yet
Batch and Cull in Opengl
25 pages
Volume Rendering: Exploring Visual Realism in Computer Vision
From Everand
Volume Rendering: Exploring Visual Realism in Computer Vision
Fouad Sabry
No ratings yet
Polygon Computer Graphics: Exploring the Intersection of Polygon Computer Graphics and Computer Vision
From Everand
Polygon Computer Graphics: Exploring the Intersection of Polygon Computer Graphics and Computer Vision
Fouad Sabry
No ratings yet
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
From Everand
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
Fouad Sabry
No ratings yet
Rougier 2013 Poly Lines
No ratings yet
Rougier 2013 Poly Lines
17 pages
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
From Everand
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
Fouad Sabry
No ratings yet
Ray Tracing On GPU: University of Applied Sciences Basel (FHBB) Diploma Thesis
No ratings yet
Ray Tracing On GPU: University of Applied Sciences Basel (FHBB) Diploma Thesis
44 pages
Oberberger Max Advanced+Graphics+Summit
No ratings yet
Oberberger Max Advanced+Graphics+Summit
199 pages
Smooth GPU Tessellation
No ratings yet
Smooth GPU Tessellation
9 pages
Are We Done With Ray Tracing
No ratings yet
Are We Done With Ray Tracing
91 pages
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
From Everand
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
Fouad Sabry
No ratings yet
Texture Mapping: Exploring Dimensionality in Computer Vision
From Everand
Texture Mapping: Exploring Dimensionality in Computer Vision
Fouad Sabry
No ratings yet
Modern GPU Architecture
No ratings yet
Modern GPU Architecture
93 pages
Survey of Nvidia RTX Technolog
No ratings yet
Survey of Nvidia RTX Technolog
9 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
From Everand
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
Fouad Sabry
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Computer Graphics 2 - Object Representations: Tom Thorne
No ratings yet
Computer Graphics 2 - Object Representations: Tom Thorne
43 pages
Ray Tracing Graphics: Exploring Photorealistic Rendering in Computer Vision
From Everand
Ray Tracing Graphics: Exploring Photorealistic Rendering in Computer Vision
Fouad Sabry
No ratings yet
GPU-Accelerated Minimum Distance and Clearance Queries PDF
No ratings yet
GPU-Accelerated Minimum Distance and Clearance Queries PDF
14 pages
Engel W Ed Gpu Pro 4 Advanced Rendering Techniques
100% (1)
Engel W Ed Gpu Pro 4 Advanced Rendering Techniques
370 pages
Karis Nanite SIGGRAPH Advances 2021 Final
No ratings yet
Karis Nanite SIGGRAPH Advances 2021 Final
155 pages
Easy Going Vector Graphics As Textures On The GPU
No ratings yet
Easy Going Vector Graphics As Textures On The GPU
4 pages
Weiskopf 2004
No ratings yet
Weiskopf 2004
9 pages
CialloAbstract
No ratings yet
CialloAbstract
2 pages
Vertex Computer Graphics: Exploring the Intersection of Vertex Computer Graphics and Computer Vision
From Everand
Vertex Computer Graphics: Exploring the Intersection of Vertex Computer Graphics and Computer Vision
Fouad Sabry
No ratings yet
A Realistic 2D Drawing System: Online Submission ID: 513
100% (1)
A Realistic 2D Drawing System: Online Submission ID: 513
8 pages
Shader: Exploring Visual Realms with Shader: A Journey into Computer Vision
From Everand
Shader: Exploring Visual Realms with Shader: A Journey into Computer Vision
Fouad Sabry
No ratings yet
Hermite Spline Tubes VIS2020
No ratings yet
Hermite Spline Tubes VIS2020
5 pages
GPU-Based Nonlinear Ray Tracing: EUROGRAPHICS 2004 / M.-P. Cani and M. Slater (Guest Editors)
No ratings yet
GPU-Based Nonlinear Ray Tracing: EUROGRAPHICS 2004 / M.-P. Cani and M. Slater (Guest Editors)
9 pages
Motion Estimation: Advancements and Applications in Computer Vision
From Everand
Motion Estimation: Advancements and Applications in Computer Vision
Fouad Sabry
No ratings yet
Key-Frame Animation & Related Techniques
No ratings yet
Key-Frame Animation & Related Techniques
12 pages
Assignment III
No ratings yet
Assignment III
6 pages
07 Pipeline
No ratings yet
07 Pipeline
62 pages
Krueger vg2010
No ratings yet
Krueger vg2010
4 pages
CUDA Cuts Fast Graph Cuts On The GPU
No ratings yet
CUDA Cuts Fast Graph Cuts On The GPU
8 pages
(CG - Ver1) 2 - Modeling-1
No ratings yet
(CG - Ver1) 2 - Modeling-1
43 pages
S5403 Nelson Inoue
No ratings yet
S5403 Nelson Inoue
36 pages
Tutorial9 WebGPU Encoding and Compression
No ratings yet
Tutorial9 WebGPU Encoding and Compression
15 pages
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Raster Graphics: Understanding the Foundations of Raster Graphics in Computer Vision
From Everand
Raster Graphics: Understanding the Foundations of Raster Graphics in Computer Vision
Fouad Sabry
No ratings yet
ECE 408 - Final PresentationPDF
No ratings yet
ECE 408 - Final PresentationPDF
27 pages
Bump Mapping: Exploring Depth in Computer Vision
From Everand
Bump Mapping: Exploring Depth in Computer Vision
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Persistent Grid Mapping
No ratings yet
Persistent Grid Mapping
10 pages
Rendering Pipeline: Viewing: Geometry Processing Rendering Pixel Processing
No ratings yet
Rendering Pipeline: Viewing: Geometry Processing Rendering Pixel Processing
16 pages
Computer Game Development / Design: Edited by Wolfgang Engel
No ratings yet
Computer Game Development / Design: Edited by Wolfgang Engel
574 pages
Graphics Pipeline & Rasterization MIT
No ratings yet
Graphics Pipeline & Rasterization MIT
98 pages
Raytracing Dynamic Scenes On The GPU Using Grids: Sashidhar Guntury and P.J. Narayanan
No ratings yet
Raytracing Dynamic Scenes On The GPU Using Grids: Sashidhar Guntury and P.J. Narayanan
12 pages
GPU Pro 1
No ratings yet
GPU Pro 1
711 pages
Chapter 5 - Representing 3D Objects
No ratings yet
Chapter 5 - Representing 3D Objects
29 pages
AdmitCard PDF
No ratings yet
AdmitCard PDF
1 page
Abaqus - Nonlinear Analysis of Reinforced Concrete Beam Experimentation2015
No ratings yet
Abaqus - Nonlinear Analysis of Reinforced Concrete Beam Experimentation2015
5 pages
I Killed An Academy Player Chapter 26 - Asura Sca
No ratings yet
I Killed An Academy Player Chapter 26 - Asura Sca
11 pages
College Events Management System Project
No ratings yet
College Events Management System Project
11 pages
ProSoft ROCKWELL CATALOG 2019 EN-dig
No ratings yet
ProSoft ROCKWELL CATALOG 2019 EN-dig
16 pages
Multidimensional Model Programming
100% (2)
Multidimensional Model Programming
317 pages
Sbi
No ratings yet
Sbi
1 page
Aug Realityr Paper
No ratings yet
Aug Realityr Paper
7 pages
DP Transmitter Yokogawa EJX 110A Fieldbus System
100% (1)
DP Transmitter Yokogawa EJX 110A Fieldbus System
162 pages
IT 6th 2020-24
No ratings yet
IT 6th 2020-24
21 pages
A Context-Aware IoT-Based Smart Wearable Health Monitoring System
No ratings yet
A Context-Aware IoT-Based Smart Wearable Health Monitoring System
6 pages
PrimaVera PM P3 PARefMan
No ratings yet
PrimaVera PM P3 PARefMan
156 pages
IT - 12th - Lab Manuals - Subject Specific
No ratings yet
IT - 12th - Lab Manuals - Subject Specific
20 pages
A New Approach To Voltage Sag Detection Based On Wavelet Transform
No ratings yet
A New Approach To Voltage Sag Detection Based On Wavelet Transform
8 pages
Ucc2893 PDF
No ratings yet
Ucc2893 PDF
41 pages
Chemistry Today June 2024
No ratings yet
Chemistry Today June 2024
101 pages
Project Management Lean & Six Sigma in Construction: Waste
No ratings yet
Project Management Lean & Six Sigma in Construction: Waste
3 pages
Nyxia Uprising The Nyxia Triad Scott Reintgen Download
No ratings yet
Nyxia Uprising The Nyxia Triad Scott Reintgen Download
28 pages
CSC203
No ratings yet
CSC203
5 pages
تحديد الأجور في المؤسسات العمومية الإدارية دراسة حالة مديرية التربية لولاية تلمسان
No ratings yet
تحديد الأجور في المؤسسات العمومية الإدارية دراسة حالة مديرية التربية لولاية تلمسان
15 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Cloud Computing - II Unit - IV
No ratings yet
Cloud Computing - II Unit - IV
12 pages
Can & MCP2515
No ratings yet
Can & MCP2515
31 pages
Sas-24-I Paper I Punjabi Iiii
No ratings yet
Sas-24-I Paper I Punjabi Iiii
16 pages
Corrected Thesis JoaoMaria67923
No ratings yet
Corrected Thesis JoaoMaria67923
118 pages
Lalit GJ
No ratings yet
Lalit GJ
15 pages
Dodgysquare (I Made This One 2)
No ratings yet
Dodgysquare (I Made This One 2)
2 pages
The Changing Role of Information System in Organisations
No ratings yet
The Changing Role of Information System in Organisations
9 pages
Asr1001 Datasheet PDF
No ratings yet
Asr1001 Datasheet PDF
6 pages

GPU-Driven Real-Time Mesh Contour Vectorization

Uploaded by

GPU-Driven Real-Time Mesh Contour Vectorization

Uploaded by

Eurographics Symposium on Rendering (2022)

A. Ghosh and L.-Y. Wei (Editors)

GPU-Driven Real-Time Mesh Contour Vectorization

Wangziwei Jiang, Guiqing Li∗ , Yongwei Nie, Chuhua Xian

1. Introduction the contour of a given 3D model into 2D or 3D stroke curves

© 2022 The Author(s)

© 2022 The Author(s)

© 2022 The Author(s)

© 2022 The Author(s)

© 2022 The Author(s)

(a) The case of foreground and background contours meet at a junction.

Figure 7: Pixel-edge loops: two pixel-edge loops with a red arrow

© 2022 The Author(s)

6.3. Stroke generation based on inside-outside test

7.1. Implementation details

© 2022 The Author(s)

is the vectorization process consisting of pixel-edge chaining (Sec-

Figure 11: GPU runtime performance per stage, under 1920 ×

© 2022 The Author(s)

© 2022 The Author(s)

© 2022 The Author(s)

© 2022 The Author(s)

(e) Active Strokes

© 2022 The Author(s)

[Wyl79] W YLLIE J. C.: The Complexity of Parallel Computations. PhD

© 2022 The Author(s)

You might also like