Skip to content

Commit

Permalink
Merge pull request #352 from cxh0519/main
Browse files Browse the repository at this point in the history
Update Report-v1.2.0.md
  • Loading branch information
LinB203 authored Jul 25, 2024
2 parents f0667d8 + a6eb95f commit adb2a20
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions docs/Report-v1.2.0.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,9 +11,10 @@ Compared to previous video generation models, Open-Sora-Plan v1.2.0 offers the f
### Open-Source Release
We open-source the Open-Sora-Plan to facilitate future development of Video Generation in the community. Code, data, model are made publicly available.
- Code: All training scripts and sample scripts.
- Model: Both Diffusion Model and CasualVideoVAE [here](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0).
- Model: Both Diffusion Model and CausalVideoVAE [here](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0).
- Data: Filtered data [here](https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.2.0).


## Gallery

93×1280×720 Text-to-Video Generation. The video quality has been compressed for playback on GitHub.
Expand All @@ -26,7 +27,7 @@ We open-source the Open-Sora-Plan to facilitate future development of Video Gene

## Detailed Technical Report

### CasualVideoVAE
### CausalVideoVAE

#### Model Structure

Expand Down Expand Up @@ -162,7 +163,7 @@ Coming soon...

## Future Work and Discussion

#### CasualVideoVAE
#### CausalVideoVAE
We observed that high-frequency motion information in videos tends to exhibit jitter, and increasing training duration and data volume does not significantly alleviate this issue. In videos, compressing the duration while maintaining the original latent dimension can lead to significant information loss. A more robust VAE will be released in the next version.

#### Diffusion Model
Expand Down

0 comments on commit adb2a20

Please sign in to comment.