This repository contains our code used in our VLR project, Spring 2022. Please take a look at our accompanying report : https://sites.google.com/andrew.cmu.edu/vlr-ldlp/
Given the above two objects to a human would be able to reason that the wall clock would probably go on the wall whereas the table clock would go on a flat surface like the counter or coffee table.
This problem can be thought of in two ways:
- Given an object to search for what are the most likely places to start looking
- Given a new object, what is the most reasonable location to place it in the scene
We used the COCO dataset and whiteout the bounding boxes for one object at a time then make the model predict the class that goes into the whited out box.
run
train_baseline_classifier.py
after updating dataset paths.
Uncomment the last line in the train_baseline_classifier.py
file, and comment out the second last line.
Kindly visit our website above for details for the object detection performance on RetinaNet and UP-DETR.
*This project was done toward completion of the course Visual Learning and Recognition (16-824) in Spring 2022 at Carnegie Mellon University