Merge pull request #352 from cxh0519/main

Update Report-v1.2.0.md
PKU-YuanGroup · Jul 25, 2024 · adb2a20 · adb2a20
2 parents f0667d8 + a6eb95f
commit adb2a20
Showing 1 changed file with 4 additions and 3 deletions.
diff --git a/docs/Report-v1.2.0.md b/docs/Report-v1.2.0.md
@@ -11,9 +11,10 @@ Compared to previous video generation models, Open-Sora-Plan v1.2.0 offers the f
 ### Open-Source Release
 We open-source the Open-Sora-Plan to facilitate future development of Video Generation in the community. Code, data, model are made publicly available.
 - Code: All training scripts and sample scripts.
-- Model: Both Diffusion Model and CasualVideoVAE [here](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0).
+- Model: Both Diffusion Model and CausalVideoVAE [here](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0).
 - Data: Filtered data [here](https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.2.0).
 
+
 ## Gallery
 
 93×1280×720 Text-to-Video Generation. The video quality has been compressed for playback on GitHub.
@@ -26,7 +27,7 @@ We open-source the Open-Sora-Plan to facilitate future development of Video Gene
 
 ## Detailed Technical Report
 
-### CasualVideoVAE
+### CausalVideoVAE
 
 #### Model Structure
 
@@ -162,7 +163,7 @@ Coming soon...
 
 ## Future Work and Discussion
 
-#### CasualVideoVAE
+#### CausalVideoVAE
 We observed that high-frequency motion information in videos tends to exhibit jitter, and increasing training duration and data volume does not significantly alleviate this issue. In videos, compressing the duration while maintaining the original latent dimension can lead to significant information loss. A more robust VAE will be released in the next version.
 
 #### Diffusion Model