WebGL Forward+ and Clustered Deferred Shading

University of Pennsylvania, CIS 565: GPU Programming and Architecture, Project 5

Thy (Tea) Tran
- LinkedIn, personal website, email
Tested on: Google Chrome, Version 86.0.4240.183 (Official Build) (64-bit) on Windows 10, i7-8750H @ 2.20GHz 22GB, GTX 1070

Live Online

Demo Video/GIF

Features

Forward plus

I tried implementing this parts in 3 different ways.
- The first way is to iterate through all the x, y and z indices of the z-buffer, then have an inner loop to iterate through all the lights to check whether it intersects with this cluster. Constructing the cluster frustum takes time due to the construction of 6 planes making the frustum, and iterating through all lights is problematic when there are large number of lights in a scene.
- The second and third ways are similar to each other. I iterate through all the lights, and for each light, i find the bounding box for the light and find the minimum and maximum x, y, and z indices of the cluster that the light overlaps. Then, iterate through those clusters and update them to include this light to have influence on the geometry within the cluster. However, the difference is that one method calculates the view and width of the frustum slice where the bounding coordinates lie on to find the min and max cluster indicies (x and y), while the other gets the cluster indices from transforming the bounding cordinates to screen space. The former method requires less matrix math and fewer calculations, which makes it faster than the latter method.
For the shader, to test my calculation for the cluster index per fragment, I output the calculated cluster index in each x, y, z direction.

Cluster shader x	Cluster shader y	Cluster shader z

Clustered deferred shading

Reuse clustering logic from with optimizations
Store vertex attributes in g-buffers (position, albedo color and normal)
Read g-buffer in a shader to produce final output

To make sure that I was inputting and getting the right data from the g-buffers, I output the data from g-buffers to the final shader.

Position	Color (albedo)	Normal

Blinn-Phong shading for point lights (deferred shading)

The effects makes the light reflection more intense at some places, while the original implementation makes the light looking more diffuse (spreading out more uniformly)

No Blinn-Phong	Blinn-Phong

Optimizations for g-buffers by using 2-component normals

By storing 2-components of a normal, we can estimate the remaining component by 'normal.z = sqrt(1 - pow(normal.x, 2) - pow(normal.y, 2))' because the normals are normalized. However, this is clearly not perfect, since the third component can originally be positive or negative, but we can't have that exact info when re-constructing the third component. There are some details lost as a result. Another noticeable artifact is the occasional random black specs in the renders.

Normals (ground truth)	Normals (z reconstructed)

Deferred shading (ground truth)	Deferred shading (z-reconstructed)

Performance & Analysis

Why is clustered deferred faster than forward plus, and forward plus faster than forward?
- Clustered deferred is better than forward plus because of the lighting calculation in the final shader. If a scene is complex, having all polygons in a fragment shader and also calculating the lighting per fragment would be an expensive combination (the fragment shader has to do a lot of work.) By using g-buffers in deferred shading, we break down the tasks and hence make it lighter per fragment to calculate lighting. The reason that clustered deferred and forward plus faster than forward is that in the fragment shader, forward would consider all lights in the scene and the lighting calculation hence is expensive. However, the clustering method in both clustered deferred and forward plus makes sure that each fragment only calculates lighting term with lights that actually has impacts on the fragment.
Tradeoffs between forward+ and clustered deferred?
- Clustered deferred requires many g-buffers to store data, and packing/reducing data often requires some additional calculation and potentially loss in details (which happens with reconstructing the third component of the normal above.) Besides, having g-buffers mean requiring more memory usage. It might not be ideal if there are a lot of attributes to store and when the image is huge. However, as the analysis above, supported by the runtime graph comparing forward plus and clustered deferred, it is better to pick deferred when there are a lot of lights in the scene.

Credits

Three.js by @mrdoob and contributors
stats.js by @mrdoob and contributors
webgl-debug by Khronos Group Inc.
glMatrix by @toji and contributors
minimal-gltf-loader by @shrekshao

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
img		img
lib		lib
models/sponza		models/sponza
src		src
.gitignore		.gitignore
INSTRUCTION.md		INSTRUCTION.md
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WebGL Forward+ and Clustered Deferred Shading

Live Online

Demo Video/GIF

Features

Performance & Analysis

Credits

About

Releases

Packages

Languages

tatran5/Project5-WebGL-Forward-Plus-and-Clustered-Deferred

Folders and files

Latest commit

History

Repository files navigation

WebGL Forward+ and Clustered Deferred Shading

Live Online

Demo Video/GIF

Features

Performance & Analysis

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages