Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness

Install

Clone git repo and submodules by running

git clone --recursive [email protected]:gxxu-ml/Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness.git

For already clone repos, fetch the submodules by running

cd Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness
git submodule update --init --recursive

Setup conda environment with

pip install git+https://github.com/borisdayma/dalle-mini.git
conda env create -f env.yaml -n disaster-t2i
conda activate disaster-t2i

Data

UCI-5k data

This dataset contains 5k image-text pairs multimodal-deep-learning-for-disaster-response; GITHUB; Dataset Download Link

Download

cd ./data/multimodal/
./download.sh

Custom 40k data

Download

cd ./data/data40k/
./download.sh

GLIDE

Train

UCI-5k data

cd ./src/
CUDA_VISIBLE_DEVICES=0,1,2,3 OPENAI_LOGDIR=[LOG_DIR] mpiexec -n 4 python main_glide.py --data_dir  ../data/multimodal

Custom data 50k

cd ./src/
CUDA_VISIBLE_DEVICES=0,1,2,3 OPENAI_LOGDIR=[LOG_DIR] mpiexec -n 4 python main_glide.py

Inference

cd ./src/
CUDA_VISIBLE_DEVICES=0,1,2,3 mpiexec -n 4 python sample_glide.py --ckpt [runname] --ckpt_dir [checkpoint parent directory] --save_dir [save results parent directory]

Dalle-mini

Instructions for Dalle-mini training:

If you want to finetune on the UCI-5k dataset, the encoded data is already available after cloning the repo;
If you want to finetune on our custom 40k dataset, you may need to download the data via this link: ;

cd dalle-mini-custom 
curl -L https://ucla.box.com/shared/static/szt6wcypjlqhj8d8885la5bd2jn50k8h --output aug_data.zip
unzip aug_data.zip -d data

Encode the downloaded raw data, and output to a `encode_output` directory

You may need to adjust GPU settings, the default is using all available gpus, and on a batch-size of 128 There will be a output dir created at path dalle-mini-custom/tools/src/encoded_output

cd dalle-mini-custom/tools/src
python encode_dataset_dallemini.py

Training & Finetuning DALLE-mini

To directly train on 5k UCI dataset using default params

cd dalle-mini-custom/tools/train

CUDA_VISIBLE_DEVICES=1,2,3,4,5,6,7 nohup python train.py \
    --model_name_or_path dalle-mini/dalle-mini/model-1reghx5l:latest \
    --tokenizer_name dalle-mini/dalle-mini/model-1reghx5l:latest \
    --dataset_repo_or_path ../src/encoded_data \
    --warmup_steps 1\
    --streaming True \
    --learning_rate 0.00005\
    --num_train_epochs 3\
    --do_train True \
    --do_eval True \
    --output_dir ../aug_finetuned_model1_lr5_5k &

To train on 40k Custom dataset using default params

Follow the Dataset section to properly download and encode the dataset
Suppose with the encoded data at dalle-mini-custom/tools/src/encoded_output1
If you want to run validation along with training, split the .parquet files at encoded_output1 into two subfolders: train and validation, following the format ofencoded_data dir.
Run the following commands

cd dalle-mini-custom/tools/train

CUDA_VISIBLE_DEVICES=1,2,3,4,5,6,7 nohup python train.py \
    --model_name_or_path dalle-mini/dalle-mini/model-1reghx5l:latest \
    --tokenizer_name dalle-mini/dalle-mini/model-1reghx5l:latest \
    --dataset_repo_or_path ../src/encoded_output1 \
    --warmup_steps 1\
    --streaming True \
    --learning_rate 0.0005\
    --num_train_epochs 3\
    --do_train True \
<!--     --do_eval True \ -->
    --output_dir ../aug_finetuned_model1_lr4_adafactor &

Inference using finetuned DALLE-mini model

Run the following command to save the generation of validation prompts, and also reports the clip-score for the validation The validation dir is found under the unzipped data file, that you can download from our provided link.

cd dalle-mini-custom/tools/inference
python inference.py path_to_validation_dir

Note: our code is based on the training script from dalle-mini repo: https://github.com/borisdayma/dalle-mini;

Evaluation

FID Score

Compute FID score between two distributions

python src/fid_score.py /dir/with/images/from/distribution1 /dir/with/images/from/distribution2

--image-size option (default 256) will center crop and resize images from both distributions to specified size

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
dalle-mini @ 2e02683		dalle-mini @ 2e02683
dalle-mini-custom/tools		dalle-mini-custom/tools
data		data
guided-diffusion @ 27c20a8		guided-diffusion @ 27c20a8
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
env.yaml		env.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness

Install

Data

UCI-5k data

Download

Custom 40k data

Download

GLIDE

Train

Inference

Dalle-mini

Instructions for Dalle-mini training:

Encode the downloaded raw data, and output to a `encode_output` directory

Training & Finetuning DALLE-mini

To train on 40k Custom dataset using default params

Inference using finetuned DALLE-mini model

Evaluation

FID Score

About

Releases

Packages

Contributors 3

Languages

License

gxxu-ml/Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness

Folders and files

Latest commit

History

Repository files navigation

Natural-Disaster-Image-Generation-to-raise-Environmental-Awareness

Install

Data

UCI-5k data

Download

Custom 40k data

Download

GLIDE

Train

Inference

Dalle-mini

Instructions for Dalle-mini training:

Encode the downloaded raw data, and output to a encode_output directory

Training & Finetuning DALLE-mini

To train on 40k Custom dataset using default params

Inference using finetuned DALLE-mini model

Evaluation

FID Score

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Encode the downloaded raw data, and output to a `encode_output` directory

Packages