Transferring Relative Monocular Depth to Surgical Vision with Temporal Consistency (MICCAI 2024)

This is the official repository for our state-of-the-art approach to monocular depth in surgical vision as presented in our paper...

Transferring Relative Monocular Depth to Surgical Vision with Temporal Consistency

arXiv

Using Our Models

First, install our package...

pip install git+https://github.com/charliebudd/transferring-relative-monocular-depth-to-surgical-vision

The model may then be used as follows (WEIGHTS_URL.DEPTHANYTHING_SUP_TEMP best model):

import matplotlib.pyplot as plt
import torch
from torchvision.io import read_image
from torchvision.transforms.functional import resize

from trmdsv import WEIGHTS_URL, load_model

model, resize_for_model, normalise_for_model = load_model(
    model_type="depthanything",
    weights_path=WEIGHTS_URL.DEPTHANYTHING_SUP_TEMP_AUG,
    device="cuda",
)
model.eval()

image = read_image("data/cholec80_sample.png").cuda() / 255.0
original_size = image.shape[-2:]
image_for_model = normalise_for_model(resize_for_model(image.unsqueeze(0)))

with torch.no_grad():
    depth = model(image_for_model)

depth = resize(depth, original_size)

plt.subplot(121).axis("off")
plt.imshow(image.cpu().permute(1, 2, 0))
plt.subplot(122).axis("off")
plt.imshow(depth.cpu().permute(1, 2, 0))
plt.show()

Recreating Our Results

### awaiting publication ###

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Transferring Relative Monocular Depth to Surgical Vision with Temporal Consistency (MICCAI 2024)

Using Our Models

Recreating Our Results

Files

README.md

Latest commit

History

README.md

File metadata and controls

Transferring Relative Monocular Depth to Surgical Vision with Temporal Consistency (MICCAI 2024)

Using Our Models

Recreating Our Results