Description

Experiments with CLIP based image search for dataset creation and foundational models for image autolabeling

Setup and usage

cd ./dataset_creator/
# create conda environment
conda env create -f environment.yml
# install this pkg
pip install -e .

# then it should be possible to run existing scripts
python scripts/download_data.py
python scripts/select_dataset.py
python scripts/autolabel_dataset.py

Results

Data Lake

python scripts/download_data.py
Total images:

Data Selection

python scripts/select_dataset.py
Rough Inference Times (RTX 3070 laptop):
- CLIP img/text embedding: ~0.06s / it (~15it/s)
Selected images:

Autolabeling

python scripts/autolabel_dataset.py TODO:
Rough Inference Times (RTX 3070 laptop):
- DepthAnything: ~0.35s / it (~2.8it/s)
- GroundingSAM: ~17s / it (scales ~linearly with instances to detect in class_onthology)
- COCA: ~1s / it
Autolabeled images:

Ideas / TODOs

script to download files from internet (Pixabay API)
CLIP based image directory search
- image based search
- text based search
- similarity based filtering
GT Autolabeling
- ImageCaptions (based on COCA model)
- BBox + InstanceSegmentation (based on Grounding-Sam)
- Depth (based on DepthAnything)

References

CLIP: Learning Transferable Visual Models From Natural Language Supervision
CoCa: Contrastive Captioners are Image-Text Foundation Models
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
https://github.com/mlfoundations/open_clip
https://github.com/IDEA-Research/Grounded-Segment-Anything
https://github.com/LiheYoung/Depth-Anything
https://github.com/autodistill/autodistill
https://medium.com/red-buffer/diving-into-clip-by-creating-semantic-image-search-engines-834c8149de56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Description

Setup and usage

Results

Ideas / TODOs

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Description

Setup and usage

Results

Ideas / TODOs

References