- Download the dataset
- Pre-process the dataset using pre-process code in SplaTAM to generate undistorted DSLR depth.
- Place it in
./data/scannetpp
NOTE: Scannetpp is a great dataset for debugging and test purposes.
- Download the dataset
- Extract and organize the dataset using pre-process script in SimpleRecon
- Place it in
./data/scannet
- Download the dataset
- Render 5-frame video as in Croco. You may want to read the instructions
- Place it in
./data/habitat_5frame
NOTE: We render the 5-frame using aminimum covisiblity of 0.1. This can improve the rendering speed, but the generated data may not be optimal for training Spann3R.
- Download the dataset
- Place it in
./data/arkit_lowres
NOTE: Due to the limit of storage, we use low-resolution input to supervise Spann3R. Ideally, you can use a higher resolution i.e. vga_wide
, as in DUSt3R, for training.
NOTE: For Co3D, we use two sampling strategies to train our model, one is the same as in DUSt3R, another is our own sampling strategy as in other datasets that contain videos.
- Download the dataset
- Place it in
./data/blendmvg
- Download the dataset. You may want to use code in SimpleRecon to download the data
- Use pre-process code in SimpleRecon to generate pseudo gt depth
- Place it in
./data/7scenes
- Download the dataset
- Place it in
./data/neural_rgbd
- Download the dataset. Note that we render the depth as in MVSNet and use our own mask annotations for evaluation. You can download our pre-processed DTU that contains the rendered depth map for evaluation here.
- Place it in
./data/dtu_test