Skip to content

Commit

Permalink
Update training data format
Browse files Browse the repository at this point in the history
  • Loading branch information
wangguanzhi authored Apr 2, 2023
1 parent 112a278 commit e6c06a0
Showing 1 changed file with 16 additions and 0 deletions.
16 changes: 16 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -83,6 +83,22 @@ You can also find two sample env implementations `HuntCowDenseRewardEnv` and `Co

We provide [a superset of 640K video clips](https://drive.google.com/file/d/1cLXC64Cu2EJj2nsb4K0ajl8qqX6l0lKd/view?usp=sharing) we used for pre-training. You can subsample from this set and grow start/end timestamps as you like (basically what we did for training MineCLIP).

Format:
```python
{
"keyword1": list[
[video_id1, start_timestamp1, end_timestamp1],
[video_id2, start_timestamp2, end_timestamp2],
[video_id3, start_timestamp3, end_timestamp3],
],
"keyword2": list[
[video_id4, start_timestamp4, end_timestamp4],
...
],
...
}
```

# Paper and Citation

Our paper is posted on [Arxiv](https://arxiv.org/abs/2206.08853). If you find our work useful, please consider citing us!
Expand Down

0 comments on commit e6c06a0

Please sign in to comment.