0% found this document useful (0 votes)

183 views12 pages

On FidelityFX

FidelityFX Super Resolution is an upscaling algorithm that analyzes source images for gradient reversals to reconstruct high-resolution edges at upscaled resolutions. It works by taking the current anti-aliased frame and upscaling it without relying on frame history or motion vectors. At its core, FSR uses an edge-adaptive spatial upsampling algorithm to detect edges in the source image and apply different filter weights during upscaling based on detected edge directions and lengths. FSR aims to provide consistent upscaling quality regardless of frame motion.

Uploaded by

dagush

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

183 views12 pages

On FidelityFX

Uploaded by

dagush

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Understanding AMD’s FidelityFX

Overview
From: https://gpuopen.com/fidelityfx-superresolution/

FidelityFX Super Resolution is a spatial upscaler: it works by taking the current anti-aliased
frame and upscaling it to display resolution without relying on other data such as frame history
or motion vectors.
At the heart of FSR is a cutting-edge algorithm that detects and recreates high-resolution edges
from the source image. Those high-resolution edges are a critical element required for turning
the current frame into a “super-resolution” image.
FSR provides consistent upscaling quality regardless of whether the frame is in motion, which
can provide quality advantages compared to other types of upscalers.
FSR is composed of two main passes:
● An upscaling pass called EASU (Edge-Adaptive Spatial Upsampling) also performs edge
reconstruction. In this pass, the input frame is analyzed and the main part of the
algorithm detects gradient reversals – essentially looking at how neighboring gradients
differ – from a set of input pixels. The intensity of the gradient reversals defines the
weights to apply to the reconstructed pixels at display resolution.
● A sharpening pass called RCAS (Robust Contrast-Adaptive Sharpening) extracts pixel
detail in the upscaled image.
FSR also comes with helper functions for color space conversions, dithering, and tone mapping
to assist with integrating it into common rendering pipelines used with today’s games.

FidelityFX Super Resolution looks for gradient reversals in the source image to reconstruct
high-definition edges at upscaled resolution.

Notes on the code:

● All code is at https://github.com/GPUOpen-Effects/FidelityFX-FSR

● But all interesting stuff is in the files ffx-fsr/ffx_a.h, where all the definitions are, and the
file ffx-fsr/ffx_fsr1.h, where all the real code is.

EASU Explanation
In the following we are going to explain all we know/understand/gather from this technique.

EASU preprocessing
● Image should be well antialiased (TAA, MSAA, etc.)
● Image should be in perceptual space
○ This means we should do a conversion. The idea conversion is described
elsewhere, for instance at these blogs (in particular, the second one):
https://lettier.github.io/3d-game-shaders-for-beginners/gamma-correction.html
http://chilliant.blogspot.com/2012/08/srgb-approximations-for-hlsl.html
However, at the Unity presentation at siggraph, they say they used sqrt to go
from RGB to sRGB, and squared to go the other way round…. Not as accurate,
but probably faster...
● Input image must be normalized to [0,1]
○ Negative input results in RCAS to output NaN!
● Image should be generated using negative MIP bias to increase texture detail
● Image should be noise-free
○ Add that AFTER rescaling with FidellityFX

EASU Algorithm
● Uses a fixed 12-tap kernel window. EASU requires an analysis on those 12 taps before it
can figure out the filter kernel, selecting the nearest 12-taps in a circular pattern. The
reason why 12 taps was chosen, instead of 16, is because with 12 taps you only need
36 registers for the 32-bit version.

❖ 12 Taps = Good Upper Limit

❖ Single-pass algorithm (radial/elliptical filtering)
❖ 12 taps * 3 channels = 36 VGPRs (FP32)
❖ To avoid reading the 12 taps twice, it has to keep all of them in registers during
the full algorithm. And therefore, if you wanted to do anything higher, you’d run
out of temporary registers for logic. And the goal being that we want to around or
under 64 registers, as that is a good upper limit on AMD’s hardware to be able to
hide latency.
❖ 64 VGPRs (good upper limit) - 36 = 28 VGPRs for logic
❖ The algorithm needs all 12 taps for analysis then filtering

● Does analysis on each ‘+’ pattern that surrounds the inner 2x2 quad in luma (r+2g+b).

So if we look at the 12-tap kernel, there is 4 taps in the center, and for each one of
those, it needs to compute the analysis for direction and length. And to do the analysis it
is working in luma, and by luma I mean an approximation, red plus two green plus blue.
So is it not a complicated luma, it is more of a “get all the channels included so we don’t
miss anything” approximation.

❖ Example of “Pass Merging”

❖ Analysis could be done as a separate pass
❖ But that would require extra round trip through memory. Using even more data
❖ Instead ALU logic gets duplicated 4 times / output pixel

● Analysis is bilinearly interpolated and used to shape final filter kernel

The analysis done on the 2x2 quad, and this is effectively a form of pass merging. As
the analysis could have been done in a separate pass, but then one would require two
round trips through memory, and therefore we don’t want to do that, instead we
duplicate a little amount of work in the shader, thus we don’t have to go through
memory many times. Once the analysis is finished we are going to bilinearly interpolate
the analysis at the position we actually want to filter at. And that is going to be used to
shape the final filter kernel.

This part is setup at the code at the function (line 156):

A_STATIC void FsrEasuCon(

outAU4 con0,
outAU4 con1,
outAU4 con2,
outAU4 con3,
// This the rendered image resolution being upscaled
AF1 inputViewportInPixelsX,
AF1 inputViewportInPixelsY,
// This is the resolution of the resource containing the input image (useful for dynamic
resolution)
AF1 inputSizeInPixelsX,
AF1 inputSizeInPixelsY,
// This is the display resolution which the input image gets upscaled to
AF1 outputSizeInPixelsX,
AF1 outputSizeInPixelsY){
…
}

However, there it is only the setup, the function where this is really applied, i.e., where the
textures are sampled, is at (line 239):

void FsrEasuTapF(
inout AF3 aC, // Accumulated color, with negative lobe.
inout AF1 aW, // Accumulated weight.
AF2 off, // Pixel offset from resolve position to tap.
AF2 dir, // Gradient direction.
AF2 len, // Length.
AF1 lob, // Negative lobe strength.
AF1 clp, // Clipping point.
AF3 c){ // Tap color.
…
}
EASU Sampling

● The 12 tap pattern is done via 4 positions

● Setup so {X,Y} and {Z,W} pairs have necessary data
● Otherwise would have to shuffle data around

EASU Analysis

Edge direction is estimated from central difference: For the analysis, once the taps are in the
edge direction is estimated using a central difference. The central difference does miss
single-pixel features, however as we will see later, as feature-length becomes very small, the
filter kernel becomes symmetric and non-directional so we don’t care about directionality for thin
features.

A diagonal diff would have been more expensive and have 0.5 texel offset: Therefore a diagonal
diff is not used, and also a diagonal diff would have been more expensive and we would have
had to deal with a half-texel offset which would have made the logic a little more complicated.
Ok to miss single-pixel features (feature-length forces asymmetric non-directional filters in those
cases anyway). So once the edge direction is finished, we look at feature-length, and by
feature-length we are estimating that by looking at the 3 texels in the horizontal and 3 texels in
the vertical.

Feature-length is estimated by scoring the amount of gradient reversal

And looking at what happens with the luma gradient: If the luma gradient has a reversal, for
instance starting at black going to white, and returning to black, that would be a “Full Reversal”
which is a significant probability of being a thin feature. Whereas if we look at something that
has no reversal, say going from black to white to white, that is probably a large feature which we
can have a larger filter kernel on.

EASU and Color Spaces

Most gaming AA ends up with perceptually even gradients on edges

Thus directional analysis works better in perceptual space for games

● A directional analysis is based on horizontal and vertical gradients
● Perceptual as in sRGB (piecewise curve), gamma 2.0, gamma 2.2, etc

This way the computation is not that expensive, because if we were to input in linear and
convert to perceptual, we would have to do that 12 times for the 12 taps.
So it is much better and in fact required for good performance to factor any linear to perceptual
translation into the prior pass, prior to EASU.
Since linear to perceptual transforms are expensive and using 12 taps
It is better (required for good perf) to factor that out to the pass prior to EASU

Highly recommended to run EASU in a perceptual space

● It will work in linear too, just doesn’t look as good on some content

The one compromise of course is that if we are running on perceptual, we are running all the
filtering in perceptual, but as it turns out it is typically acceptable in this case.
EASU Kernel Shaping
Analysis after interpolation produces {direction, length}
● The ‘direction’ used to rotate the filter kernel
● The ‘length’ drives post-rotation kernel scaling, and kernel window adjustment

X scales from {1.0 to sqrt(2.0)} on {axis-aligned to diagonal} direction

● Diagonals get larger kernels as they can sharpen more without banding
Y scales from {1.0 to 2.0} on {small to larger feature length}
● Small axis-aligned features end up with a small symmetric kernel to avoid artifacts
● Longer features get a larger kernel to better restore the edge

So once we have all the analysis finished,

we have a {direction and length} for all the 2x2 quad,
we are going to use the interpolated direction to rotate the filter kernel,
the length to scale the post-rotation kernel scaling on the X and Y axis,
and we are also going to use the length to adjust the kernel window (which I will show on
another slide).
So in the X-axis, we are going to go from no scaling to sqrt(2) based on whether we are
axis-aligned or we are running on the diagonal.
So when we are axis-aligned we don’t do any scaling on the X-axis,
but when we are on a diagonal we are scaling by sqrt(2) because we can allow a larger kernel
there without seeing any banding.
The banding would have been created by the negative lobe.
The Y-axis has no scaling to double size.
We use the no-scaling for the small features, and that way we end up with a small symmetric
kernel which does not sample outside the feature itself.
And as the feature gets larger we are using a longer kernel so we can better restore the edge.
EASU Kernel

Uses a polynomial approximation to lanczos. This code is at the function

// Filtering for a given tap for the scalar.

● Lanczos is expensive using a {sin(),rcp(),sqrt()} and those are transcendentals, which

run at quarter rate depending on your hardware, and therefore they are best to be
avoided if possible.

The EASU kernel itself started as a polynomial approximation to lanczos(2).

So instead this is broken down into a base and a window, similar to the way lanczos is a sync
function that is windowed by another sync function, and also because we want the window to be
adaptable to the length. When the window is small, we have a kernel which goes from +/-
sqrt(2). That window has been shortened which truncates the negative lobe. We don’t get as
much sharpening. We don’t have the ringing and other problems that we would potentially have.
The wide kernel goes from +/- 2, and that kernel has a very strong negative lobe which helps
restore the edge.

Instead base*window via

This implements an approximation of lancos2 without sin() or rcp(), or sqrt() to get x:

(25/16 * (2/5 * x^2 - 1)^2 - (25/16 - 1)) * (1/4 * x^2 - 1)^2

|_____________________________| |____________|
base window

(w = 1/4). Note, the general form of the 'base' is,

(a*(b*x^2-1)^2-(a-1)),

Where 'a=1/(2*b-b^2)' and 'b' moves around the negative lobe.

Where window term ‘w’ varies from 1/4 for {+/- 2} kernel, and 1/2 for {+/- sqrt(2)}

EASU Deringing
The local 2x2 texel quad {min,max} used to clamp the EASU output

Removes all ringing

Also removes some artifacts of the 12-tap limited window

● Or alternatively some artifacts of the kernel adaption
We move on to the deringing step where we take the local 2x2 texel quad,
the min and max of RGB, and we use that to clamp the EASU output.
This removes all the ringing.
This also removes artifacts of the limited 12-tap window.
Therefore when scaling is larger, and you might see the clipping of the window,
It is best to run the reringing step to try to minimize that.

The first part is implemented in two functions, FsrEasuSetF and FsrEasuF. First, FsrEasuSetF
accumulates direction and length:

void FsrEasuSetF(
inout AF2 dir,
inout AF1 len,
AF2 pp,
AP1 biS,AP1 biT,AP1 biU,AP1 biV,
AF1 lA,AF1 lB,AF1 lC,AF1 lD,AF1 lE){
…
}

Direction is the '+' diff:

a
bcd
e

Then, the algorithm takes magnitude from abs average of both sides of 'c'. Length converts
gradient reversal to 0, smoothly to non-reversal at 1, shaped, then adding horz and vert terms.
This is done as:

AF1 dc=lD-lC;
AF1 cb=lC-lB;
AF1 lenX=max(abs(dc),abs(cb));
lenX=APrxLoRcpF1(lenX);
AF1 dirX=lD-lB;
dir.x+=dirX*w;
lenX=ASatF1(abs(dirX)*lenX);
lenX*=lenX;
len+=lenX*w;

Then the code repeats for the y axis, accumulating in the variable len. Here (from ffx_a.h):

AF1 APrxLoRcpF1(AF1 a){return AF1_AU1(AU1_(0x7ef07ebb)-AU1_AF1(a));}

A_STATIC AF1 ASatF1(AF1 a){return AMinF1(1.0f,AMaxF1(0.0f,a));

Here AMinF1 and AMaxF1 are simple minimum- and maximum-computing functions:

A_STATIC AF1 AMinF1(AF1 a,AF1 b){return a<b?a:b;}

A_STATIC AF1 AMaxF1(AF1 a,AF1 b){return a>b?a:b;}

And AF1_AU1 is a conversion from int to float, and AU1_ is a simple float cast.

Finally, the function FsrEasuF is where the heavy duty is performed:

void FsrEasuF(
out AF3 pix,
AU2 ip, // Integer pixel position in output.
AU4 con0, // Constants generated by FsrEasuCon().
AU4 con1,
AU4 con2,
AU4 con3){
…
}

This code first gets the position of sample “f” from ip, the position of the pixel to calculate. Then
it applies the 12-tap kernel.
bc
efgh
ijkl
no

Remember that the gather 4 has the following ordering:

ab
rg

For packed FP16, need either {rg} or {ab} so using the following setup for gather in all versions.
Also, take into account that a b are unused (z).
// r g
// a b a b
// r g r g
// a b
// r g <- unused (z)
// Allowing dead-code removal to remove the 'z's.

Then it applies the simplest multi-channel approximate luma possible (luma times 2, in 2
FMA/MAD):

AF4 bczzL=bczzB*AF4_(0.5)+(bczzR*AF4_(0.5)+bczzG);
AF4 ijfeL=ijfeB*AF4_(0.5)+(ijfeR*AF4_(0.5)+ijfeG);
AF4 klhgL=klhgB*AF4_(0.5)+(klhgR*AF4_(0.5)+klhgG);
AF4 zzonL=zzonB*AF4_(0.5)+(zzonR*AF4_(0.5)+zzonG);

Next, it accumulates the gradients at variable len for bilinear interpolation:

AF2 dir=AF2_(0.0);
AF1 len=AF1_(0.0);
FsrEasuSetF(dir,len,pp,true, false,false,false,bL,eL,fL,gL,jL);
FsrEasuSetF(dir,len,pp,false,true ,false,false,cL,fL,gL,hL,kL);
FsrEasuSetF(dir,len,pp,false,false,true ,false,fL,iL,jL,kL,nL);
FsrEasuSetF(dir,len,pp,false,false,false,true ,gL,jL,kL,lL,oL);

En Manual Gwent ONLINE
No ratings yet
En Manual Gwent ONLINE
6 pages
Bunkers & Badasses
100% (2)
Bunkers & Badasses
7 pages
List of Role-Playing Games - Wikipedia
100% (1)
List of Role-Playing Games - Wikipedia
48 pages
03b The Virtual Brain (TVB-Script)
No ratings yet
03b The Virtual Brain (TVB-Script)
60 pages
02b The Virtual Brain (TVB-GUI)
No ratings yet
02b The Virtual Brain (TVB-GUI)
47 pages
Presentation JC
No ratings yet
Presentation JC
53 pages
Things I Did This Summer: Gustavo Patow Virvig-Udg, Cns-Upf Gustavo - Patow@udg - Edu
No ratings yet
Things I Did This Summer: Gustavo Patow Virvig-Udg, Cns-Upf Gustavo - Patow@udg - Edu
50 pages
Gears Engineering Information
No ratings yet
Gears Engineering Information
138 pages
ZZAP AMIGA No2 01.02 2022
No ratings yet
ZZAP AMIGA No2 01.02 2022
60 pages
GDC FidelityFX Super Resolution 2 0
No ratings yet
GDC FidelityFX Super Resolution 2 0
68 pages
Presentation EiDA
No ratings yet
Presentation EiDA
36 pages
Parallel Path Tracing
No ratings yet
Parallel Path Tracing
35 pages
FidelityFX FSR Overview Integration
No ratings yet
FidelityFX FSR Overview Integration
37 pages
Fowlkes 1983
No ratings yet
Fowlkes 1983
18 pages
Ray Tracing On GPU
No ratings yet
Ray Tracing On GPU
44 pages
Recharged MM v1.3
100% (1)
Recharged MM v1.3
33 pages
Inverse Problems - Charles W - Groetsch - American Mathematical Society, (N - P - ), 2018 - AMS - 9780883857168 - Anna's Archive
No ratings yet
Inverse Problems - Charles W - Groetsch - American Mathematical Society, (N - P - ), 2018 - AMS - 9780883857168 - Anna's Archive
248 pages
Continuity Equation - Wikipedia
No ratings yet
Continuity Equation - Wikipedia
12 pages
Decima Siggraph 2017
No ratings yet
Decima Siggraph 2017
70 pages
Modern Trends in The Automatic Generatio PDF
No ratings yet
Modern Trends in The Automatic Generatio PDF
34 pages
12+ +Sampling+and+Filtering
No ratings yet
12+ +Sampling+and+Filtering
46 pages
Shear Strength of Normal and High-Strength Fiber Reinforced Concrete Beams Without Stirrups
No ratings yet
Shear Strength of Normal and High-Strength Fiber Reinforced Concrete Beams Without Stirrups
9 pages
Atlasnet: A Papier-M Ach E Approach To Learning 3D Surface Generation
No ratings yet
Atlasnet: A Papier-M Ach E Approach To Learning 3D Surface Generation
16 pages
Fabled Lands - Wikipedia
No ratings yet
Fabled Lands - Wikipedia
10 pages
Ansys Commands
No ratings yet
Ansys Commands
1,928 pages
Gamebook - Wikipedia
No ratings yet
Gamebook - Wikipedia
11 pages
Gjoel Svendsen Rendering of Inside PDF
No ratings yet
Gjoel Svendsen Rendering of Inside PDF
187 pages
FFX Denoiser Shadows Technology
No ratings yet
FFX Denoiser Shadows Technology
9 pages
Propagation of Uncertainty - Wikipedia
No ratings yet
Propagation of Uncertainty - Wikipedia
8 pages
RAY - Giai2010-05 - Michael - Bunnell - Slides
No ratings yet
RAY - Giai2010-05 - Michael - Bunnell - Slides
37 pages
Neanderthal Flute
100% (1)
Neanderthal Flute
68 pages
Brucks Ryan Distance Fields and
No ratings yet
Brucks Ryan Distance Fields and
64 pages
Jcssi 2023
No ratings yet
Jcssi 2023
9 pages
A Survey of Procedural Content Generatio PDF
No ratings yet
A Survey of Procedural Content Generatio PDF
12 pages
Trophic Coherence - Wikipedia
No ratings yet
Trophic Coherence - Wikipedia
6 pages
Mset Rendering April29 2014
No ratings yet
Mset Rendering April29 2014
41 pages
2021 RNBehera
No ratings yet
2021 RNBehera
12 pages
Experience-Driven Procedural Content Gen PDF
No ratings yet
Experience-Driven Procedural Content Gen PDF
17 pages
FidelityFX FSR Overview Integration
No ratings yet
FidelityFX FSR Overview Integration
37 pages
Directed Graph - Wikipedia
No ratings yet
Directed Graph - Wikipedia
5 pages
Addressing Complexity Aspects in Conceptual Ship Design - A Systems Engineering Approach
100% (1)
Addressing Complexity Aspects in Conceptual Ship Design - A Systems Engineering Approach
21 pages
Filtering Theory: Battling Aliasing With Antialiasing
No ratings yet
Filtering Theory: Battling Aliasing With Antialiasing
20 pages
From Vertices To Fragments: Rasterization: Frame Buffer
No ratings yet
From Vertices To Fragments: Rasterization: Frame Buffer
22 pages
Rendering Techniques
No ratings yet
Rendering Techniques
35 pages
Ai, Chen, Rotter and Ooi - A Numerical and Experimental Study of The Base Pressure Distribution Beneath A Stockpile
No ratings yet
Ai, Chen, Rotter and Ooi - A Numerical and Experimental Study of The Base Pressure Distribution Beneath A Stockpile
18 pages
Java Programming 2nd Periodical Test NOTES
No ratings yet
Java Programming 2nd Periodical Test NOTES
5 pages
Siggraph2015 MMG Marius Notes
No ratings yet
Siggraph2015 MMG Marius Notes
36 pages
Volume Tiled Forward Shading
No ratings yet
Volume Tiled Forward Shading
52 pages
Ray Tracing On GPU: University of Applied Sciences Basel (FHBB) Diploma Thesis
No ratings yet
Ray Tracing On GPU: University of Applied Sciences Basel (FHBB) Diploma Thesis
44 pages
SMAA Enhanced Subpixel Morphological Antialiasing
No ratings yet
SMAA Enhanced Subpixel Morphological Antialiasing
87 pages
ch3 Ex Ques
No ratings yet
ch3 Ex Ques
10 pages
Vibrations Ss
No ratings yet
Vibrations Ss
14 pages
Automatic Anti-Swing Gantry Crane Based On PID6
No ratings yet
Automatic Anti-Swing Gantry Crane Based On PID6
6 pages
Full Math For The Trades 1st Edition Learningexpress Editors Ebook All Chapters
No ratings yet
Full Math For The Trades 1st Edition Learningexpress Editors Ebook All Chapters
91 pages
Spatten: Efficient Sparse Attention Architecture With Cascade Token and Head Pruning
No ratings yet
Spatten: Efficient Sparse Attention Architecture With Cascade Token and Head Pruning
14 pages
Computational Creativity in Procedural C PDF
No ratings yet
Computational Creativity in Procedural C PDF
6 pages
ISA Unequal Angle Column
No ratings yet
ISA Unequal Angle Column
4 pages
06 Pipeline
No ratings yet
06 Pipeline
40 pages
Thesis
No ratings yet
Thesis
47 pages
Holiday Homework
No ratings yet
Holiday Homework
2 pages
A Survey On Self-Supervised Learning Algorithms Applications and Future Trends
No ratings yet
A Survey On Self-Supervised Learning Algorithms Applications and Future Trends
20 pages
Ogre Shadows
No ratings yet
Ogre Shadows
21 pages
D&D5e Scavenger Character Sheet Fillable Printer Friendly
No ratings yet
D&D5e Scavenger Character Sheet Fillable Printer Friendly
2 pages
Nonparametric Statistics - Wikipedia
No ratings yet
Nonparametric Statistics - Wikipedia
5 pages
Peter Shirley - Ray Tracing in One Weekend (2016)
No ratings yet
Peter Shirley - Ray Tracing in One Weekend (2016)
38 pages
ShaderX2 AdvancedImageProcessing
No ratings yet
ShaderX2 AdvancedImageProcessing
30 pages
Matlab Laboratory (ELEC-323) Session III: Second-Order Systems 1. Overview
No ratings yet
Matlab Laboratory (ELEC-323) Session III: Second-Order Systems 1. Overview
4 pages
An EDP-Model of Open Pit Short Term Production Scheduling Optimization For Stratiform Orebodies
No ratings yet
An EDP-Model of Open Pit Short Term Production Scheduling Optimization For Stratiform Orebodies
10 pages
Duncan-Chang - Parameters For Hyperbolic Stress
No ratings yet
Duncan-Chang - Parameters For Hyperbolic Stress
5 pages
Compilation For GPU Accelerated Ray Tracing in OptiX PDF
No ratings yet
Compilation For GPU Accelerated Ray Tracing in OptiX PDF
70 pages
Indices Laws Mathematics IGCSE O Level
No ratings yet
Indices Laws Mathematics IGCSE O Level
6 pages
CG ch-8 (The Graphic Pipeline)
No ratings yet
CG ch-8 (The Graphic Pipeline)
22 pages
RayTracing Tutorial
100% (1)
RayTracing Tutorial
38 pages
Blending & Stencil
No ratings yet
Blending & Stencil
41 pages
Peter Shirley-Ray Tracing in One Weekend 2
No ratings yet
Peter Shirley-Ray Tracing in One Weekend 2
10 pages
Understanding The Graphics Pipeline
No ratings yet
Understanding The Graphics Pipeline
35 pages
Module 5
No ratings yet
Module 5
8 pages
APM4814 - Assessment #1 (Assign) - 2024
No ratings yet
APM4814 - Assessment #1 (Assign) - 2024
4 pages
Newton's Forward Difference Formula1
No ratings yet
Newton's Forward Difference Formula1
5 pages
Example of Rasterization
No ratings yet
Example of Rasterization
2 pages
Lighting Technology of The Last of Us PDF
No ratings yet
Lighting Technology of The Last of Us PDF
40 pages
Adaptive Sharpen (Contrast)
No ratings yet
Adaptive Sharpen (Contrast)
2 pages
Deferred Shading Optimizations
No ratings yet
Deferred Shading Optimizations
40 pages
0901 PDF Toc
No ratings yet
0901 PDF Toc
13 pages
CG Cheat Sheet 2020
No ratings yet
CG Cheat Sheet 2020
6 pages
GP Notes
No ratings yet
GP Notes
23 pages
Alex Tardif Graphics Engineer
No ratings yet
Alex Tardif Graphics Engineer
11 pages
The YUV420 Planar Image Format.: Color Space Luma Chrominance
No ratings yet
The YUV420 Planar Image Format.: Color Space Luma Chrominance
4 pages
Progressive Spatiotemporal Variance-Guided Filtering
No ratings yet
Progressive Spatiotemporal Variance-Guided Filtering
8 pages
Attributes
No ratings yet
Attributes
18 pages
A New Anti-Aliasing Algorithm For Computer Graphics Images'
No ratings yet
A New Anti-Aliasing Algorithm For Computer Graphics Images'
5 pages
Writeupfinal
No ratings yet
Writeupfinal
4 pages
FXAA WhitePaper
No ratings yet
FXAA WhitePaper
15 pages
Shadow Mapping in Ogre: Hamilton Chong Aug 2006
No ratings yet
Shadow Mapping in Ogre: Hamilton Chong Aug 2006
21 pages
Lesson 8 2
No ratings yet
Lesson 8 2
2 pages
Lighting Deep G-Buffers: Single-Pass, Layered Depth Images With Minimum Separation Applied To Indirect Illumination
No ratings yet
Lighting Deep G-Buffers: Single-Pass, Layered Depth Images With Minimum Separation Applied To Indirect Illumination
2 pages
ADRE Datasheet
No ratings yet
ADRE Datasheet
16 pages
Fokker-Planck Equation - Wikipedia
No ratings yet
Fokker-Planck Equation - Wikipedia
14 pages
Logui
No ratings yet
Logui
9 pages
Mesh Tutorials
100% (1)
Mesh Tutorials
24 pages
Cylinder Viv Udf
100% (1)
Cylinder Viv Udf
2 pages
Siggraph2016 Idtech6
No ratings yet
Siggraph2016 Idtech6
58 pages
1 Shaders
No ratings yet
1 Shaders
13 pages
Mathematics Grade 6 Blueprint (2024-25)
No ratings yet
Mathematics Grade 6 Blueprint (2024-25)
2 pages
Lorentzian Classification Strategy - Teamviewer
No ratings yet
Lorentzian Classification Strategy - Teamviewer
9 pages
Rendering Pipeline: Viewing: Geometry Processing Rendering Pixel Processing
No ratings yet
Rendering Pipeline: Viewing: Geometry Processing Rendering Pixel Processing
16 pages
SweetFX Settings
No ratings yet
SweetFX Settings
7 pages
Practice 6 - 2D Sketch
100% (1)
Practice 6 - 2D Sketch
2 pages
A First Course in Abstract Algebra 8th Edition John B Fraleigh Neal Brand ISBN10 0321390369 ISBN13 9780136731627 PDF Download
No ratings yet
A First Course in Abstract Algebra 8th Edition John B Fraleigh Neal Brand ISBN10 0321390369 ISBN13 9780136731627 PDF Download
341 pages
Multiple System Atrophy - Wikipedia
No ratings yet
Multiple System Atrophy - Wikipedia
16 pages
Dementia - Wikipedia
No ratings yet
Dementia - Wikipedia
67 pages
Research Advances Toward Real-Time Path Tracing GTC 2022
No ratings yet
Research Advances Toward Real-Time Path Tracing GTC 2022
90 pages
Unit3 11
No ratings yet
Unit3 11
8 pages
Effect
No ratings yet
Effect
7 pages
Tutorial Chapter 2-Tqm1063
No ratings yet
Tutorial Chapter 2-Tqm1063
2 pages
Login Kau
No ratings yet
Login Kau
47 pages

On FidelityFX

Uploaded by

On FidelityFX

Uploaded by

Understanding AMD’s FidelityFX

Notes on the code:

● All code is at https://github.com/GPUOpen-Effects/FidelityFX-FSR

❖ 12 Taps = Good Upper Limit

❖ Example of “Pass Merging”

● Analysis is bilinearly interpolated and used to shape final filter kernel

This part is setup at the code at the function (line 156):

A_STATIC void FsrEasuCon(

● The 12 tap pattern is done via 4 positions

Feature-length is estimated by scoring the amount of gradient reversal

EASU and Color Spaces

Thus directional analysis works better in perceptual space for games

Highly recommended to run EASU in a perceptual space

X scales from {1.0 to sqrt(2.0)} on {axis-aligned to diagonal} direction

So once we have all the analysis finished,

Uses a polynomial approximation to lanczos. This code is at the function

// Filtering for a given tap for the scalar.

● Lanczos is expensive using a {sin(),rcp(),sqrt()} and those are transcendentals, which

The EASU kernel itself started as a polynomial approximation to lanczos(2).

Instead base*window via

(25/16 * (2/5 * x^2 - 1)^2 - (25/16 - 1)) * (1/4 * x^2 - 1)^2

(w = 1/4). Note, the general form of the 'base' is,

Where 'a=1/(2*b-b^2)' and 'b' moves around the negative lobe.

Removes all ringing

Also removes some artifacts of the 12-tap limited window

Direction is the '+' diff:

AF1 APrxLoRcpF1(AF1 a){return AF1_AU1(AU1_(0x7ef07ebb)-AU1_AF1(a));}

A_STATIC AF1 AMinF1(AF1 a,AF1 b){return a<b?a:b;}

Finally, the function FsrEasuF is where the heavy duty is performed:

Remember that the gather 4 has the following ordering:

Next, it accumulates the gradients at variable len for bilinear interpolation:

You might also like