GitHub - Shruti-Bansal/Dual-Attention-Network-for-Scene-Segmentation: Autonomous Vehicle Scene Segmentation Network which can work on multiple modalities like Camera, LiDAR and RADAR

Solution Formulation

To approach the problem, the first step was to recreate the DANet implementation on RGB images of Cityscapes Dataset
Next, we explored publicly available datasets in the Autonomous Driving Domain which have semantic segmentation ground truth data for RGB images and LIDAR/ Depth data
- Identified KITTI as a good dataset
Run DANet on the KITTI semantic dataset
Obtain LIDAR data from KITTI-360 along with corresponding segmentation information
Creating our own custom network (a simplified version of DANet) for flexibility of input data
Customizing this network for segmentation on RGB images combined with Depth Data

This repository is adapted from A PyTorch implementation of DANet based on CVPR 2019 paper Dual Attention Network for Scene Segmentation.

Requirements

Anaconda
PyTorch

conda install pytorch torchvision cudatoolkit=10.1 -c pytorch

opencv

pip install opencv-python

tensorboard

pip install tensorboard

pycocotools

pip install git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI

fvcore

pip install git+https://github.com/facebookresearch/fvcore

cityscapesScripts

pip install git+https://github.com/mcordts/cityscapesScripts.git

detectron2

pip install git+https://github.com/facebookresearch/detectron2.git@master

Datasets

For a few datasets that detectron2 natively supports, the datasets are assumed to exist in a directory called datasets/, under the directory where you launch the program. They need to have the following directory structure:

Expected dataset structure for Cityscapes:

cityscapes/
  gtFine/
    train/
      aachen/
        color.png, instanceIds.png, labelIds.png, polygons.json,
        labelTrainIds.png
      ...
    val/
    test/
  leftImg8bit/
    train/
    val/
    test/

run ./datasets/prepare_cityscapes.py to creat labelTrainIds.png.

Training

To train a model, run

python train_net.py --config-file <config.yaml>

For example, to launch end-to-end DANet training with ResNet-50 backbone on 8 GPUs, one should execute:

python train_net.py --config-file configs/r50.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

python train_net.py --config-file configs/r50.yaml --num-gpus 8 --eval-only MODEL.WEIGHTS checkpoints/model.pth

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
checkpoints		checkpoints
configs		configs
danet		danet
datasets		datasets
output		output
README.md		README.md
train_net.py		train_net.py
train_net_new.py		train_net_new.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solution Formulation

Requirements

Datasets

Expected dataset structure for Cityscapes:

Training

Evaluation

About

Releases

Packages

Languages

Shruti-Bansal/Dual-Attention-Network-for-Scene-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Solution Formulation

Requirements

Datasets

Expected dataset structure for Cityscapes:

Training

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages