Name	Name	Last commit message	Last commit date
Latest commit History 639 Commits
LICENSE	LICENSE
README.md	README.md

Awesome 3D Gaussian Splatting Resources

A curated list of papers and open-source resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months. If you have any additions or suggestions, feel free to contribute. Additional resources like blog posts, videos, etc. are also welcome.

Seminal Paper introducing 3D Gaussian Splatting

3D Object Detection
Autonomous Driving
Avatars
Classic work
Compression
Diffusion
Dynamics and Deformation
Editing
Language Embedding
Mesh Extraction and Physics
Misc
Regularization and Optimization
Rendering
Reviews
SLAM
Sparse
Navigation and Autonomous Driving
Poses
Large-Scale

Data
Courses

Open Source Implementations

Blog Posts
Tutorial Videos
Credits

Update Log:

Oct 24, 2024

Added 2 papers: IGS, V^3

Oct 16, 2024

Added one paper:DGD

Sept 07, 2024

Added one paper:MoDGS

May 10, 2024

Added 18 papers: Z-Splat, Dual-Camera, StylizedGS, Hash3D, Revisiting Densification, Gaussian Pancakes, 3D-aware Deformable Gaussians, SpikeNVS, Zero-shot PC completion, SplatPose, DreamScene360, RealmDreamer, Gaussian-ILC, Reinforcment Learning with GGS, GoMAvatar, OccGaussian, LoopGaussian, Review

April 11, 2024

Code release of latentSplat

April 9, 2024

Added 1 paper: EgoLifter

April 8, 2024

Added 3 papers: Robust Gaussian Splatting, SC4D, and MM-Gaussian

April 5, 2024

Added 5 papers: Surface Reconstruction, TCLC-GS, GaSpCT, OmniGS, and Per-Gaussian Embedding,
Fixes

April 2, 2024

Added 11 papers: HO, SGD, HGS, Snap-it, InstantSplat, 3DGSR, MM3DGS, HAHA, CityGaussain, Mirror-3DGS, and Feature Splatting

March 30, 2024

Added 8 papers: Modeling uncertainty, GRM, Gamba, CoherentGS, TOGS, SA-GS, and GaussianCube

March 27, 2024

Added Other Implementation: 360-gaussian-splatting
CVPR '24 labels added
Added 5 papers: Comp4D, DreamPolisher, DN-Splatter, 2D GS, and Octree-GS

March 26, 2024

Added 13 paper: latentSplat, GS on the Move, RadSplat, Mini-Splatting, SyncTweedies, HAC, STAG4D, EndoGSLAM, Pixel-GS, Semantic Gaussians, Gaussian in the Wild, CG-SLAM, and GSDF

March 24, 2024:

Added paper: Gaussian Frosting

March 20, 2024:

Added 4 papers: GVGEN, HUGS, RGBD GS-ICP SLAM, and High-Fidelity SLAM

March 19, 2024:

Added Pointrix
Added 3DGS tutorial by the original authors
Added GauStudio
Added 23 papers: Touch-GS, GGRt, FDGaussian, SWAG, Den-SOFT, Gaussian-Flow, View-Consistent 3D Editing, BAGS, GeoGaussian, GS-Pose, Analytic-Splatting, Seamless 3D Maps, Texture-GS, Recent Advances in 3DGS, Compact 3DGS for Dense Visual SLAM, BrightDreamer, 3DGS-Reloc, Beyond Uncertainty, Motion-Aware 3DGS, Fed3DGS, GaussNav, 3DGS-Calib, and NEDS-SLAM

March 17, 2024:

Update repo name and link for 3DGS.cpp (originally VulkanSplatting)

March 16, 2024:

SplatTV
Added 6 papers: GaussianGrasper, new splitting algorithm, Controllable Text-to-3D Generation, Spring-Mass 3DGS, Hyper-3DGS, and DreamScene

March 14, 2024:

Added 6 papers: SemGauss, StyleGaussian, Gaussian Splatting in Style, GaussCtrl, GaussianImage, and RAIN-GS

March 8, 2024:

Tutorial: Howto capture images for 3DGS
Added 6 papers: SplattingAvatar, DNGaussian, Radiative Gaussians, BAGS, GSEdit, and ManiGaussian

March 8, 2024:

Added 3DGStream Viewer

March 6, 2024:

1 paper added: Splat-Nav

March 5, 2024:

1 paper added: 3DGStream
Code releases
New viewer added

March 2, 2024:

1 paper added: 3D Gaussian Model for Animation and Texturing
New section: Courses that also teach 3DGS.

February 28, 2024:

VastGaussian

February 27, 2024:

2 papers added: Spec-Gaussian and GEA
SC-GS code released

February 24, 2024:

2 papers added: Identifying unnecessary Gaussians and Gaussian Pro

February 23, 2024:

Corrected Authors and updated abstract for EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting

February 21, 2024:

Added one paper: Reshaping SLAM: a Survey

February 20, 2024:

GaussianObject code released
Added one paper: GaussianHair

February 19, 2024:

Blog post added: NeRFs vs. 3DGS.

February 16, 2024:

2 papers added: IM-3D and GES
GaMeS code released

February 14, 2024:

Added viewer: VulkanSplatting - cross-platform, high performance 3DGS renderer in C++ and Vulkan Compute

February 13, 2024:

Code releases: (16th Jan 2024) Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
3 papers added: 3DGala, ImplicitDeepFake, and 3D Gaussians as a New Vision Era.

February 9, 2024:

1 paper added: HeadStudio

February 8, 2024:

3 papers added: Rig3DGS, Mesh-based GS, and LGM February 6, 2024:
Added 2 papers: SGS-SLAM and 4D Gaussian Splatting

February 5, 2024:

Moved SWAGS to Dynmatics and Deformation section
Added 2 paper: GaussianObject and GaMeSh
GS++ renamed to Optimal Projection

February 2, 2024:

Added 6 papers: VR-GS, Segment Anything, Gaussian Splashing, GS++, 360-GS, and StopThePop
TRIPS code release

January 30, 2024:

Code changes: GaussianAvatars code changed to private

January 29, 2024:

Added 2 papers: LIV-GaussMap and TIP-Editor

January 26, 2024:

Removed retracted paper: Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions
3 papers added: EndoGaussians, PSAvatar, and GauU-Scene

January 25, 2024:

Added viewer: Splatapult - 3d gaussian splatting renderer in C++ and OpenGL, works with OpenXR for tethered VR

January 24, 2024:

Added utility: GSOPs (Gaussian Splat Operators) for SideFX Houdini
Code releases: GaussianAvatars

January 23, 2024:

3 papers added: Amortized Gen3D, Deformable Endoscopic Tissues, Fast dynamic 3D Object Generation
Code releases: Animatable Avatars, Compressed 3D Gaussians, GaussianAvatar

January 13, 2024:

4 papers added: CoSSegGaussians, TRIPS, Gaussian Shadow Casting for Neural Characters and DISTWAR

January 9, 2024:

1 paper added: A Survey on 3D Gaussian Splatting (The first survey)

January 8, 2024:

4 papers added: SWAGS (added paper from 2023 which I forgot to add before, ), first review paper, compressed 3DGS, and an application paper for Characterizing Satellite Geometry.

January 7, 2024:

1 Open source implementation: taichi-splatting - work is originally derived off Taichi 3D Gaussian Splatting, with significant re-organisation and changes.

January 5, 2024:

3 papers added: FMGS, PEGASUS, and Repaint123.

January 2, 2024:

1 paper added: Street Gaussians.

January 2, 2024:

Deblurring Gaussians paper link updated.
SAGA code released.
2 papers from 2023 added: Text2Immersion and 2D-Guided 3DG Segmentation.
Mathematical supplemend of gsplat lib.
Add years in categories.
GSM code released.

December 29, 2023:

1 paper added (apparently missed that one before): Gaussian-Head-Avatar.
Blog post head avatars added.

December 29, 2023:

3 papers added: DreamGaussian4D, 4DGen, and Spacetime Gaussian.

December 27, 2023:

3 papers added: LangSplat, Deformable 3DGS, and Human101.
Blog post added: Comprehensive Review of 3DGS.

December 25, 2023:

Efficient 3D Gaussian Representation for Monocular/Multi-view Dynamic Scenes code released.
GPS-Gaussian code released.

December 24, 2023:

2 papers added: Self-Organization Gaussian Grids and Gaussian Splitting.
Added repo for enhancing Gaussian rendering to model more complex scenes.

December 21, 2023:

3 papers added: Splatter Image, pixelSplat, and align your gaussians.
Gaussian Grouping code released.

December 19, 2023:

2 papers added: GAvatar and GauFRe.

December 18, 2023:

Added utility: SpectacularAI - Conversion scripts for different 3DGS conventions.
SuGaR code released.

December 16, 2023:

Added WebGL viewer 3: Gauzilla.

December 15, 2023:

4 papers added: DrivingGaussian, iComMa, Triplane, and 3DGS-Avatar.
Relightable Gaussians code released.

December 13, 2023:

5 papers added: Gaussian-SLAM, CoGS, ASH, CF-GS, and Photo-SLAM.

December 11, 2023:

2 papers added: Gaussian Splatting SLAM and Denoising Scores for 3D Generation.
ScaffoldGS code released.

December 8, 2023:

2 papers added: EAGLES and MonoGaussianAvatar.

December 7, 2023:

LucidDreamer code released.
9 papers added: GauHuman, HeadGaS, HiFi4G, Gaussian-Flow, Feature-3DGS, Gaussian-Avatar, FlashAvatar, Relightable, and Deblurring Gaussians.

December 5, 2023:

9 papers added: NeuSG, GaussianHead, GaussianAvatars, GPS-Gaussian, Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction, SplaTAM, MANUS, Segment Any, and Language embedded 3D Gaussians.

December 4, 2023:

8 papers added: Gaussian Grouping, MD Splatting, DynMF, Scaffold-GS, SparseGS, FSGS, Control4D, and SC-GS.

December 1, 2023:

4 papers added: Compact3D, GaussianShader, Periodic Vibration Gaussian and Gaussian Shell Maps for Efficient 3D Human Generation.
Created Table of contents for each category and added line breaks.

November 30, 2023:

Added Unreal game engine implementation.
5 papers added: LightGaussian, FisherRF, HUGS, HumanGaussian, CG3D, and Multi Scale 3DGS.

November 29, 2023:

Added two papers: Point and Move and IR-GS.

November 28, 2023:

Added five papers: GaussinEditor, Relightable Gaussians, GART, Mip-Splatting, HumanGaussian.

November 27, 2023:

Added two papers: Gaussian Editing and Compact 3D Gaussians.

November 25, 2023:

Animatable Gaussians project added (paper not yet released).

November 22, 2023:

3 new GS papers added: Animatable, Depth-Regularized, and Monocular/Multi-view 3DGS.
Added some classic papers.
Added another GS paper also called LucidDreamer.

November 21, 2023:

3 new GS papers added: GaussianDiffusion, LucidDreamer, PhysGaussian.
2 more GS papers added: SuGaR, PhysGaussian.

November 21, 2023:

Added the paper GS-SLAM

November 17, 2023:

Added PlayCanvas implementation to Game Engines section.

November 16, 2023:

Deformable 3D Gaussians code released.
Drivable 3D Gaussian Avatars paper added.

November 8, 2023:

Some notes about the 3DGS implementation and unsive/rsal format discussion.

November 4, 2023:

Added 2D gaussian splatting.
Added very detailed (technical) blog post explaining 3D gaussian splatting.

October 28, 2023:

Added Utilities Section.
Added 3DGS Converter for editing 3DGS .ply files in Cloud Compare to Utilities.
Added Kapture (for bundler to colmap model conversion) and Kapture image cropper script with conversion instructions to Utilities.

October 23, 2023:

Added python WebGL viewer 2.
Added Intro to gaussian splatting (and Unity viewer) video blog.

October 21, 2023:

Added python OpenGL viewer.
Added typescript WebGPU viewer.

October 20, 2023:

Made abstracts readable (removed hyphenations).
Added Windows tutorial.
Other minor text fixes.
Added Jupyter notebook viewer.

October 19, 2023:

Added Github page link for Real-time Photorealistic Dynamic Scene Representation.
Re-ordered headings.
Added other unofficial implementations.
Moved Nerfstudio gsplat and fast: C++/CUDA to Unofficial Implementations.
Added Nerfstudio, Blender, WebRTC, iOS & Metal viewers.

October 17, 2023:

GaussianDreamer code released.
Added Real-time Photorealistic Dynamic Scene Representation.

October 16, 2023:

Added Deformable 3D Gaussians paper.
Dynamic 3D Gaussians code released. October 15, 2023: Initial list with first 6 papers.

Seminal Paper introducing 3D Gaussian Splatting:

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Authors: Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, George Drettakis

Abstract

Radiance Field methods have recently revolutionized novel-view synthesis of scenes captured with multiple photos or videos. However, achieving high visual quality still requires neural networks that are costly to train and render, while recent faster methods inevitably trade off speed for quality. For unbounded and complete scenes (rather than isolated objects) and 1080p resolution rendering, no current method can achieve real-time display rates. We introduce three key elements that allow us to achieve state-of-the-art visual quality while maintaining competitive training times and importantly allow high-quality real-time (≥ 30 fps) novel-view synthesis at 1080p resolution. First, starting from sparse points produced during camera calibration, we represent the scene with 3D Gaussians that preserve desirable properties of continuous volumetric radiance fields for scene optimization while avoiding unnecessary computation in empty space; Second, we perform interleaved optimization/density control of the 3D Gaussians, notably optimizing anisotropic covariance to achieve an accurate representation of the scene; Third, we develop a fast visibility-aware rendering algorithm that supports anisotropic splatting and both accelerates training and allows real-time rendering. We demonstrate state-of-the-art visual quality and real-time rendering on several established datasets.

3D Object Detection

2024

1. 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Authors: Yang Cao, Yuanliang Jv, Dan Xu

Abstract

Neural Radiance Fields (NeRF) are widely used for novel-view synthesis and have been adapted for 3D Object Detection (3DOD), offering a promising approach to 3D object detection through view-synthesis representation. However, NeRF faces inherent limitations: (i) It has limited representational capacity for 3DOD due to its implicit nature, and (ii) it suffers from slow rendering speeds. Recently, 3D Gaussian Splatting (3DGS) has emerged as an explicit 3D representation that addresses these limitations with faster rendering capabilities. Inspired by these advantages, this paper introduces 3DGS into 3DOD for the first time, identifying two main challenges: (i) Ambiguous spatial distribution of Gaussian blobs – 3DGS primarily relies on 2D pixel-level supervision, resulting in unclear 3D spatial distribution of Gaussian blobs and poor differentiation between objects and background, which hinders 3DOD; (ii) Excessive background blobs – 2D images often include numerous background pixels, leading to densely reconstructed 3DGS with many noisy Gaussian blobs representing the background, negatively affecting detection. To tackle the challenge (i), we leverage the fact that 3DGS reconstruction is derived from 2D images, and propose an elegant and efficient solution by incorporating 2D Boundary Guidance to significantly enhance the spatial distribution of Gaussian blobs, resulting in clearer differentiation between objects and their background (see Fig. 1). To address the challenge (ii), we propose a Box-Focused Sampling strategy using 2D boxes to generate object probability distribution in 3D spaces, allowing effective probabilistic sampling in 3D to retain more object blobs and reduce noisy background blobs. Benefiting from the proposed Boundary Guidance and Box-Focused Sampling, our final method, 3DGS-DET, achieves significant improvements (+5.6 on [email protected], +3.7 on [email protected]) over our basic pipeline version, without introducing any additional learnable parameters. Furthermore, 3DGS-DET significantly outperforms the state-of-the-art NeRF-based method, NeRF-Det, achieving improvements of +6.6 on [email protected] and +8.1 on [email protected] for the ScanNet dataset, and impressive +31.5 on [email protected] for the ARKITScenes dataset. Codes and models are publicly available at: https://github.com/yangcaoai/3DGS-DET.

📄 Paper | 💻 Code (not yet)

Autonomous Driving:

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications in broader scenarios. To tackle these issues, we present HumanSplat that predicts the 3D Gaussian Splatting properties of any human from a single input image in a generalizable manner. In particular, HumanSplat comprises a 2D multi-view diffusion model and a latent reconstruction transformer with human structure priors that adeptly integrate geometric priors and semantic features within a unified framework. A hierarchical loss that incorporates human semantic information is further designed to achieve high-fidelity texture modeling and better constrain the estimated multiple views. Comprehensive experiments on standard benchmarks and in-the-wild images demonstrate that HumanSplat surpasses existing state-of-the-art methods in achieving photorealistic novel-view synthesis. Project page: https://humansplat.github.io/.

📄 Paper | 🌐 Project Page

Classic work:

1. A Generalization of Algebraic Surface Drawing

Authors: James F. Blinn

Comment:: First paper rendering 3D gaussians.

Abstract

The mathematical description of three-dimensional surfaces usually falls into one of two classifications: parametric and implicit. An implicit surface is defined to be all points which satisfy some equation F (x, y, z) = 0. This form is ideally suited for image space shaded picture drawing; the pixel coordinates are substituted for x and y, and the equation is solved for z. Algorithms for drawing such objects have been developed primarily for first- and second-order polynomial functions, a subcategory known as algebraic surfaces. This paper presents a new algorithm applicable to other functional forms, in particular to the summation of several Gaussian density distributions. The algorithm was created to model electron density maps of molecular structures, but it can be used for other artistically interesting shapes.

📄 Paper

2. Approximate Differentiable Rendering with Algebraic Surfaces

Authors: Leonid Keselman and Martial Hebert

Comment:: First paper to do differentiable rendering optimization of 3D gaussians.

Abstract

Differentiable renderers provide a direct mathematical link between an object’s 3D representation and images of that object. In this work, we develop an approximate differentiable renderer for a compact, interpretable representation, which we call Fuzzy Metaballs. Our approximate renderer focuses on rendering shapes via depth maps and silhouettes. It sacrifices fidelity for utility, producing fast runtimes and high-quality gradient information that can be used to solve vision tasks. Compared to mesh-based differentiable renderers, our method has forward passes that are 5x faster and backwards passes that are 30x faster. The depth maps and silhouette images generated by our method are smooth and defined everywhere. In our evaluation of differentiable renderers for pose estimation, we show that our method is the only one comparable to classic techniques. In shape from silhouette, our method performs well using only gradient descent and a per-pixel loss, without any surrogate losses or regularization. These reconstructions work well even on natural video sequences with segmentation artifacts.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

Authors: Jan U. Müller, Michael Weinmann, Reinhard Klein

Comment: Builds 2D screen-space gaussians from underlying 3D representations.

Abstract

We propose an efficient and GPU-accelerated sampling framework which enables unbiased gradient approximation for differentiable point cloud rendering based on surface splatting. Our framework models the contribution of a point to the rendered image as a probability distribution. We derive an unbiased approximative gradient for the rendering function within this model. To efficiently evaluate the proposed sample estimate, we introduce a tree-based data-structure which employs multi-pole methods to draw samples in near linear time. Our gradient estimator allows us to avoid regularization required by previous methods, leading to a more faithful shape recovery from images. Furthermore, we validate that these improvements are applicable to real-world applications by refining the camera poses and point cloud obtained from a real-time SLAM system. Finally, employing our framework in a neural rendering setting optimizes both the point cloud and network parameters, highlighting the framework’s ability to enhance data driven approaches.

📄 Paper 💻 Code

4. Generating and Real-Time Rendering of Clouds

Authors: Petr Man

Comment: Splatting of anisotropic gaussians. Basically a non-differentiable implementation of 3DGS.

Abstract

This paper presents a method for generation and real-time rendering of static clouds. Perlin noise function generates three dimensional map of a cloud. We also present a twopass rendering algorithm that performs physically based approximation. In the first preprocessed phase it computes multiple forward scattering. In the second phase first order anisotropic scattering at runtime is evaluated. The generated map is stored as voxels and is unsuitable for the real-time rendering. We introduce a more suitable inner representation of cloud that approximates the original map and contains much less information. The cloud is then represented by a set of metaballs (spheres) with parameters such as center positions, radii and density values. The main contribution of this paper is to propose a method, that transforms the original cloud map to the inner representation. This method uses the Radial Basis Function (RBF) neural network.

📄 Paper

Compression:

3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static 3D scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing for very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, e.g. on resource constrained devices. In this paper, we introduce a compact scene representation organizing the parameters of 3D Gaussian Splatting (3DGS) into a 2D grid with local homogeneity, ensuring a drastic reduction in storage requirements without compromising visual quality during rendering. Central to our idea is the explicit exploitation of perceptual redundancies present in natural scenes. In essence, the inherent nature of a scene allows for numerous permutations of Gaussian parameters to equivalently represent it. To this end, we propose a novel highly parallel algorithm that regularly arranges the high-dimensional Gaussian parameters into a 2D grid while preserving their neighborhood structure. During training, we further enforce local smoothness between the sorted parameters in the grid. The uncompressed Gaussians use the same structure as 3DGS, ensuring a seamless integration with established renderers. Our method achieves a reduction factor of 17x to 42x in size for complex scenes with no increase in training time, marking a substantial leap forward in the domain of 3D scene distribution and consumption.

📄 Paper | 🌐 Project Page | 💻 Code

Diffusion:

2024:

1. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Authors: Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat

Abstract

Given the growing need for automatic 3D content creation pipelines, various 3D representations have been studied to generate 3D objects from a single image. Due to its superior rendering efficiency, 3D Gaussian splatting-based models have recently excelled in both 3D reconstruction and generation. 3D Gaussian splatting approaches for image to 3D generation are often optimization-based, requiring many computationally expensive score-distillation steps. To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization. Utilizing an intermediate hybrid representation, AGG decomposes the generation of 3D Gaussian locations and other appearance attributes for joint optimization. Moreover, we propose a cascaded pipeline that first generates a coarse representation of the 3D data and later upsamples it with a 3D Gaussian super-resolution module. Our method is evaluated against existing optimization-based 3D Gaussian frameworks and sampling-based pipelines utilizing other 3D representations, where AGG showcases competitive generation abilities both qualitatively and quantitatively while being several orders of magnitude faster.

📄 Paper | 🌐 Project Page| 🎥 Short Presentation

2. Fast Dynamic 3D Object Generation from a Single-view Video

Authors: Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang

Abstract

Generating dynamic three-dimensional (3D) object from a single-view video is challenging due to the lack of 4D labeled data. Existing methods extend text-to-3D pipelines by transferring off-the-shelf image generation models such as score distillation sampling, but they are slow and expensive to scale (e.g., 150 minutes per object) due to the need for back-propagating the information-limited supervision signals through a large pretrained model. To address this limitation, we propose an efficient video-to-4D object generation framework called Efficient4D. It generates high-quality spacetime-consistent images under different camera views, and then uses them as labeled data to directly train a novel 4D Gaussian splatting model with explicit point cloud geometry, enabling real-time rendering under continuous camera trajectories. Extensive experiments on synthetic and real videos show that Efficient4D offers a remarkable 10-fold increase in speed when compared to prior art alternatives while preserving the same level of innovative view synthesis quality. For example, Efficient4D takes only 14 minutes to model a dynamic object.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Authors: Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Abstract

Reconstructing and rendering 3D objects from highly sparse views is of critical importance for promoting applications of 3D vision techniques and improving user experience. However, images from sparse views only contain very limited 3D information, leading to two significant challenges: 1) Difficulty in building multi-view consistency as images for matching are too few; 2) Partially omitted or highly compressed object information as view coverage is insufficient. To tackle these challenges, we propose GaussianObject, a framework to represent and render the 3D object with Gaussian splatting, that achieves high rendering quality with only 4 input images. We first introduce techniques of visual hull and floater elimination which explicitly inject structure priors into the initial optimization process for helping build multi-view consistency, yielding a coarse 3D Gaussian representation. Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined. We design a self-generating strategy to obtain image pairs for training the repair model. Our GaussianObject is evaluated on several challenging datasets, including MipNeRF360, OmniObject3D, and OpenIllumination, achieving strong reconstruction results from only 4 views and significantly outperforming previous state-of-the-art methods.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

Authors: Heng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Laszlo A Jeni, Sergey Tulyakov, Hsin-Ying Lee

Authors: Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

Abstract

Recent one image to 3D generation methods commonly adopt Score Distillation Sampling (SDS). Despite the impressive results, there are multiple deficiencies including multi-view inconsistency, over-saturated and over-smoothed textures, as well as the slow generation speed. To address these deficiencies, we present Repaint123 to alleviate multi-view bias as well as texture degradation and speed up the generation process. The core idea is to combine the powerful image generation capability of the 2D diffusion model and the texture alignment ability of the repainting strategy for generating high-quality multi-view images with consistency. We further propose visibility-aware adaptive repainting strength for overlap regions to enhance the generated image quality in the repainting process. The generated high-quality and multi-view consistent images enable the use of simple Mean Square Error (MSE) loss for fast 3D content generation. We conduct extensive experiments and show that our method has a superior ability to generate high-quality 3D content with multi-view consistency and fine textures in 2 minutes from scratch.

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Dynamics and Deformation:

Recently, 3D Gaussian, as an explicit 3D representation method, has demonstrated strong competitiveness over NeRF (Neural Radiance Fields) in terms of expressing complex scenes and training duration. These advantages signal a wide range of applications for 3D Gaussians in 3D understanding and editing. Meanwhile, the segmentation of 3D Gaussians is still in its infancy. The existing segmentation methods are not only cumbersome but also incapable of segmenting multiple objects simultaneously in a short amount of time. In response, this paper introduces a 3D Gaussian segmentation method implemented with 2D segmentation as supervision. This approach uses input 2D segmentation maps to guide the learning of the added 3D Gaussian semantic information, while nearest neighbor clustering and statistical filtering refine the segmentation results. Experiments show that our concise method can achieve comparable performances on mIOU and mAcc for multi-object segmentation as previous single-object segmentation methods.

📄 Paper

Language Embedding:

3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking. However, 3DGS fails to accurately represent surfaces due to the multi-view inconsistent nature of 3D Gaussians. We present 2D Gaussian Splatting (2DGS), a novel approach to model and reconstruct geometrically accurate radiance fields from multi-view images. Our key idea is to collapse the 3D volume into a set of 2D oriented planar Gaussian disks. Unlike 3D Gaussians, 2D Gaussians provide view-consistent geometry while modeling surfaces intrinsically. To accurately recover thin surfaces and achieve stable optimization, we introduce a perspective-accurate 2D splatting process utilizing ray-splat intersection and rasterization. Additionally, we incorporate depth distortion and normal consistency terms to further enhance the quality of the reconstructions. We demonstrate that our differentiable renderer allows for noise-free and detailed geometry reconstruction while maintaining competitive appearance quality, fast training speed, and real-time rendering.

1. [CVPR '24] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Authors: Tianyi Xie, Zeshun Zong, Yuxin Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang

Abstract

We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principles. A defining characteristic of our method is the seamless integration between physical simulation and visual rendering: both components utilize the same 3D Gaussian kernels as their discrete representations. This negates the necessity for triangle/tetrahedron meshing, marching cubes, "cage meshes," or any other geometry embedding, highlighting the principle of "what you see is what you simulate (WS2)." Our method demonstrates exceptional versatility across a wide variety of materials--including elastic entities, metals, non-Newtonian fluids, and granular materials--showcasing its strong capabilities in creating diverse visual content with novel viewpoints and movements.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

2. [CVPR '24] SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Authors: Antoine Guédon, Vincent Lepetit

Abstract

We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting. Gaussian Splatting has recently become very popular as it yields realistic rendering while being significantly faster to train than NeRFs. It is however challenging to extract a mesh from the millions of tiny 3D gaussians as these gaussians tend to be unorganized after optimization and no method has been proposed so far. Our first key contribution is a regularization term that encourages the gaussians to align well with the surface of the scene. We then introduce a method that exploits this alignment to sample points on the real surface of the scene and extract a mesh from the Gaussians using Poisson reconstruction, which is fast, scalable, and preserves details, in contrast to the Marching Cubes algorithm usually applied to extract meshes from Neural SDFs. Finally, we introduce an optional refinement strategy that binds gaussians to the surface of the mesh, and jointly optimizes these Gaussians and the mesh through Gaussian splatting rendering. This enables easy editing, sculpting, rigging, animating, compositing and relighting of the Gaussians using traditional softwares by manipulating the mesh instead of the gaussians themselves. Retrieving such an editable mesh for realistic rendering is done within minutes with our method, compared to hours with the state-of-the-art methods on neural SDFs, while providing a better rendering quality.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance

Authors: Hanlin Chen, Chen Li, Gim Hee Lee

Abstract

Existing neural implicit surface reconstruction methods have achieved impressive performance in multi-view 3D reconstruction by leveraging explicit geometry priors such as depth maps or point clouds as regularization. However, the reconstruction results still lack fine details because of the over-smoothed depth map or sparse point cloud. In this work, we propose a neural implicit surface reconstruction pipeline with guidance from 3D Gaussian Splatting to recover highly detailed surfaces. The advantage of 3D Gaussian Splatting is that it can generate dense point clouds with detailed structure. Nonetheless, a naive adoption of 3D Gaussian Splatting can fail since the generated points are the centers of 3D Gaussians that do not necessarily lie on the surface. We thus introduce a scale regularizer to pull the centers close to the surface by enforcing the 3D Gaussians to be extremely thin. Moreover, we propose to refine the point cloud from 3D Gaussians Splatting with the normal priors from the surface predicted by neural implicit models instead of using a fixed set of points as guidance. Consequently, the quality of surface reconstruction improves from the guidance of the more accurate 3D Gaussian splatting. By jointly optimizing the 3D Gaussian Splatting and the neural implicit model, our approach benefits from both representations and generates complete surfaces with intricate details. Experiments on Tanks and Temples verify the effectiveness of our proposed method.

📄 Paper

Misc:

In this paper, we address the limitations of Adaptive Density Control (ADC) in 3D Gaussian Splatting (3DGS), a scene representation method achieving high-quality, photorealistic results for novel view synthesis. ADC has been introduced for automatic 3D point primitive management, controlling densification and pruning, however, with certain limitations in the densification logic. Our main contribution is a more principled, pixel-error driven formulation for density control in 3DGS, leveraging an auxiliary, per-pixel error function as the criterion for densification. We further introduce a mechanism to control the total number of primitives generated per scene and correct a bias in the current opacity handling strategy of ADC during cloning operations. Our approach leads to consistent quality improvements across a variety of benchmark scenes, without sacrificing the method's efficiency.

📄 Paper

2023:

1. [CVPRW '24] Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images

Authors: Jaeyoung Chung, Jeongtaek Oh, Kyoung Mu Lee

Abstract

In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide to mitigate overfitting. We obtained the depth map using a pre-trained monocular depth estimation model and aligning the scale and offset using sparse COLMAP feature points. The adjusted depth aids in the color-based optimization of 3D Gaussian splatting, mitigating floating artifacts, and ensuring adherence to geometric constraints. We verify the proposed method on the NeRF-LLFF dataset with varying numbers of few images. Our approach demonstrates robust geometry compared to the original method that relies solely on images.

📄 Paper | 🌐 Project Page | 💻 Code

2. EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Authors: Sharath Girish, Kamal Gupta, Abhinav Shrivastava

Abstract

Recently, 3D Gaussian splatting (3D-GS) has gained popularity in novel-view scene synthesis. It addresses the challenges of lengthy training times and slow rendering speeds associated with Neural Radiance Fields (NeRFs). Through rapid, differentiable rasterization of 3D Gaussians, 3D-GS achieves real-time rendering and accelerated training. They, however, demand substantial memory resources for both training and storage, as they require millions of Gaussians in their point cloud representation for each scene. We present a technique utilizing quantized embeddings to significantly reduce memory storage requirements and a coarse-to-fine training strategy for a faster and more stable optimization of the Gaussian point clouds. Our approach results in scene representations with fewer Gaussians and quantized representations, leading to faster training times and rendering speeds for real-time rendering of high resolution scenes. We reduce memory by more than an order of magnitude all while maintaining the reconstruction quality. We validate the effectiveness of our approach on a variety of datasets and scenes preserving the visual quality while consuming 10-20x less memory and faster training/inference speed.

📄 Paper | 🌐 Project Page | 💻 Code

3. [CVPR '24] COLMAP-Free 3D Gaussian Splatting

Authors: Yang Fu, Sifei Liu, Amey Kulkarni, Jan Kautz, Alexei A. Efros, Xiaolong Wang

Abstract

While neural rendering has led to impressive advances in scene reconstruction and novel view synthesis, it relies heavily on accurately pre-computed camera poses. To relax this constraint, multiple efforts have been made to train Neural Radiance Fields (NeRFs) without pre-processed camera poses. However, the implicit representations of NeRFs provide extra challenges to optimize the 3D structure and camera poses at the same time. On the other hand, the recently proposed 3D Gaussian Splatting provides new opportunities given its explicit point cloud representations. This paper leverages both the explicit geometric representation and the continuity of the input video stream to perform novel view synthesis without any SfM preprocessing. We process the input frames in a sequential manner and progressively grow the 3D Gaussians set by taking one input frame at a time, without the need to pre-compute the camera poses. Our method significantly improves over previous approaches in view synthesis and camera pose estimation under large motion changes.

📄 Paper | 🌐 Project Page | 💻 Code (not yet) | 🎥 Short Presentation

4. iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching

Authors: Yuan Sun, Xuan Wang, Yunfan Zhang, Jie Zhang, Caigui Jiang, Yu Guo, Fei Wang

Abstract

We present a method named iComMa to address the 6D pose estimation problem in computer vision. The conventional pose estimation methods typically rely on the target's CAD model or necessitate specific network training tailored to particular object classes. Some existing methods address mesh-free 6D pose estimation by employing the inversion of a Neural Radiance Field (NeRF), aiming to overcome the aforementioned constraints. However, it still suffers from adverse initializations. By contrast, we model the pose estimation as the problem of inverting the 3D Gaussian Splatting (3DGS) with both the comparing and matching loss. In detail, a render-and-compare strategy is adopted for the precise estimation of poses. Additionally, a matching module is designed to enhance the model's robustness against adverse initializations by minimizing the distances between 2D keypoints. This framework systematically incorporates the distinctive characteristics and inherent rationale of render-and-compare and matching-based approaches. This comprehensive consideration equips the framework to effectively address a broader range of intricate and challenging scenarios, including instances with substantial angular deviations, all while maintaining a high level of prediction accuracy. Experimental results demonstrate the superior precision and robustness of our proposed jointly optimized framework when evaluated on synthetic and complex real-world data in challenging scenarios.

📄 Paper | 💻 Code

Rendering:

Neural Radiance Fields (NeRFs) have demonstrated the remarkable potential of neural networks to capture the intricacies of 3D objects. By encoding the shape and color information within neural network weights, NeRFs excel at producing strikingly sharp novel views of 3D objects. Recently, numerous generalizations of NeRFs utilizing generative models have emerged, expanding its versatility. In contrast, Gaussian Splatting (GS) offers a similar renders quality with faster training and inference as it does not need neural networks to work. We encode information about the 3D objects in the set of Gaussian distributions that can be rendered in 3D similarly to classical meshes. Unfortunately, GS are difficult to condition since they usually require circa hundred thousand Gaussian components. To mitigate the caveats of both models, we propose a hybrid model that uses GS representation of the 3D object's shape and NeRF-based encoding of color and opacity. Our model uses Gaussian distributions with trainable positions (i.e. means of Gaussian), shape (i.e. covariance of Gaussian), color and opacity, and neural network, which takes parameters of Gaussian and viewing direction to produce changes in color and opacity. Consequently, our model better describes shadows, light reflections, and transparency of 3D objects.

📄 Paper | 💻 Code

Reviews:

📄 Paper

SLAM:

The integration of neural rendering and the SLAM system recently showed promising results in joint localization and photorealistic view reconstruction. However, existing methods, fully relying on implicit representations, are so resource-hungry that they cannot run on portable devices, which deviates from the original intention of SLAM. In this paper, we present Photo-SLAM, a novel SLAM framework with a hyper primitives map. Specifically, we simultaneously exploit explicit geometric features for localization and learn implicit photometric features to represent the texture information of the observed environment. In addition to actively densifying hyper primitives based on geometric features, we further introduce a Gaussian-Pyramid-based training method to progressively learn multi-level features, enhancing photorealistic mapping performance. The extensive experiments with monocular, stereo, and RGB-D datasets prove that our proposed system Photo-SLAM significantly outperforms current state-of-the-art SLAM systems for online photorealistic mapping, e.g., PSNR is 30% higher and rendering speed is hundreds of times faster in the Replica dataset. Moreover, the Photo-SLAM can run at real-time speed using an embedded platform such as Jetson AGX Orin, showing the potential of robotics applications.

📄 Paper | 🌐 Project Page | 💻 Code

Sparse:

We introduce the Splatter Image, an ultra-fast approach for monocular 3D object reconstruction which operates at 38 FPS. Splatter Image is based on Gaussian Splatting, which has recently brought real-time rendering, fast training, and excellent scaling to multi-view reconstruction. For the first time, we apply Gaussian Splatting in a monocular reconstruction setting. Our approach is learning-based, and, at test time, reconstruction only requires the feed-forward evaluation of a neural network. The main innovation of Splatter Image is the surprisingly straightforward design: it uses a 2D image-to-image network to map the input image to one 3D Gaussian per pixel. The resulting Gaussians thus have the form of an image, the Splatter Image. We further extend the method to incorporate more than one image as input, which we do by adding cross-view attention. Owning to the speed of the renderer (588 FPS), we can use a single GPU for training while generating entire images at each iteration in order to optimize perceptual metrics like LPIPS. On standard benchmarks, we demonstrate not only fast reconstruction but also better results than recent and much more expensive baselines in terms of PSNR, LPIPS, and other metrics.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

Navigation:

License

MrNeRF/awesome-3D-gaussian-splatting

Folders and files

Latest commit

History

Repository files navigation

Awesome 3D Gaussian Splatting Resources

Table of contents

Seminal Paper introducing 3D Gaussian Splatting:

3D Gaussian Splatting for Real-Time Radiance Field Rendering

3D Object Detection

2024

1. 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Autonomous Driving:

2024:

1. Street Gaussians for Modeling Dynamic Urban Scenes

2. TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

3. OmniRe: Omni Urban Scene Reconstruction

2023:

1. [CVPR '24] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

2. [CVPR '24] HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting

Avatars:

2024:

1. GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting

2. PSAvatar: A Point-based Morphable Shape Model for Real-Time Head Avatar Creation with 3D Gaussian Splatting

3. Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

4. HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

5. ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting

6. GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians

7. GVA: Reconstructing Vivid 3D Gaussian Avatars from Monocular Videos

8. [CVPR '24] SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting

9. SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface

10. HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

11. [CVPRW '24] Gaussian Splatting Decoder for 3D‑aware Generative Adversarial Networks

12. GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

13. OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

14. [CVPR '24] Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

15. [NeurIPS '24] Generalizable and Animatable Gaussian Head Avatar

16. [SIGGRAPH Asia'24] DualGS: Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos

17. [SIGGRAPH Asia'24] V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians

2023:

1. Drivable 3D Gaussian Avatars

2. SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

3. [CVPR '24] Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

4. [CVPR '24] GART: Gaussian Articulated Template Models

5. [CVPR '24] Human Gaussian Splatting: Real-time Rendering of Animatable Avatars

6. [CVPR '24] HUGS: Human Gaussian Splats

7. [CVPR '24] Gaussian Shell Maps for Efficient 3D Human Generation

8. GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation

9. [CVPR '24] GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians

10. [CVPR '24] GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis

11. GauHuman: Articulated Gaussian Splatting from Monocular Human Videos

12. HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting

13. [CVPR '24] HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting

14. [CVPR '24] GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

15. [CVPR '24] FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding

16. [CVPR '24] Relightable Gaussian Codec Avatars

17. MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar

18. [CVPR '24] ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

19. [CVPR '24] 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

20. [CVPR '24] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

21. Deformable 3D Gaussian Splatting for Animatable Human Avatars

22. Human101: Training 100+FPS Human Gaussians in 100s from 1 View

23. [CVPR '24] Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

24. HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

Classic work:

1. A Generalization of Algebraic Surface Drawing

2. Approximate Differentiable Rendering with Algebraic Surfaces

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

4. Generating and Real-Time Rendering of Clouds

Compression:

2024:

1. [I3D '24] Reducing the Memory Footprint of 3D Gaussian Splatting

2. [CVPR '24] Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

3. HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

4. [ECCV '24] End-to-End Rate-Distortion Optimized 3D Gaussian Representation

5. 3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods

6. LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming

7. Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation

2023: