-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
137 lines (123 loc) · 12.8 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
<html>
<head>
<style>
h1,
h1,
h1 {
border-bottom: 1px solid lightgrey;
margin-bottom: 20px;
font-weight: bold;
}
article {
margin-bottom: 50px;
margin-left: 15px;
}
img {
margin: 20px 0px;
}
footer {
margin-top: 50px;
border-top: 1px outset lightgrey;
/* background-color: mediumorchid; */
}
</style>
</head>
<body style="padding:100px;">
<div class="sixteen columns">
<h1>What is VCTUBE?</h1>
<article>
<p>
VCTUBE is open-source Python library, that can automatically generate <audio, text> pair speech data
from a given Youtube video URL.
</p>
</article>
</div>
<br>
<div class="sixteen columns">
<h1>Why We Need VCTUBE?</h1>
<article>
<p>
Recent studies have shown that Text-to-Speech (TTS) systems based on deep neural networks (e.g.,
Tacotron, Deep Voice, etc.) can generate human-like speech with high quality. <br>
However, it has been reported that training such a deep learning model to generate human-like speech
requires a large amount of speech data. <br>
At least 10 hours of <audio, text> pair data to generate high quality speech. In practice, collecting
and processing such a large amount of speech data is challenging.
<br>For this reason, VCTUBE can solve this problem. There are many video in Youtube. And Many of
these videos have subtitles.
<div style="text-align:center; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/VCTUBE.jpg" width:300 height:700><br>
<div>An architecture of VCTUBE's overall processss.</div>
</div>
</article>
</div>
<div class="sixteen columns">
<h1>How To Use VCTUBE?</h1>
<article>
<p>
<h4>Requirment for VCTUBE<br></h4>
<li> Currently requires python >= 3.6</li>
<li> FFmpeg </li>
<br>
At first you need to install VCTUBE library by pip install command
<!--
<div style="text-align:left; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/install.PNG" width:300 height:700><br>
<div>Python pip command for installing VCTUBE</div>
</div>
-->
<div class="colorscripter-code" style="color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important; position:relative !important;overflow:auto"><table class="colorscripter-code-table" style="margin:0;padding:0;border:none;background-color:#fafafa;border-radius:4px;" cellspacing="0" cellpadding="0"><tr><td style="padding:6px;border-right:2px solid #e5e5e5"><div style="margin:0;padding:0;word-break:normal;text-align:right;color:#666;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="line-height:130%">1</div></div></td><td style="padding:6px 0;text-align:left"><div style="margin:0;padding:0;color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="padding:0 6px; white-space:pre; line-height:130%">pip3 install vctube</div></div></td><td style="vertical-align:bottom;padding:0 2px 4px 0"><a href="http://colorscripter.com/info#e" target="_blank" style="text-decoration:none;color:white"><span style="font-size:9px;word-break:normal;background-color:#e5e5e5;color:white;border-radius:10px;padding:1px">cs</span></a></td></tr></table></div>
<br>
Command for VCTUBE
<div class="colorscripter-code" style="color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important; position:relative !important;overflow:auto"><table class="colorscripter-code-table" style="margin:0;padding:0;border:none;background-color:#fafafa;border-radius:4px;" cellspacing="0" cellpadding="0"><tr><td style="padding:6px;border-right:2px solid #e5e5e5"><div style="margin:0;padding:0;word-break:normal;text-align:right;color:#666;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="line-height:130%">1</div><div style="line-height:130%">2</div><div style="line-height:130%">3</div><div style="line-height:130%">4</div><div style="line-height:130%">5</div><div style="line-height:130%">6</div><div style="line-height:130%">7</div><div style="line-height:130%">8</div><div style="line-height:130%">9</div><div style="line-height:130%">10</div></div></td><td style="padding:6px 0;text-align:left"><div style="margin:0;padding:0;color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="padding:0 6px; white-space:pre; line-height:130%"><span style="color:#a71d5d">from</span> vctube <span style="color:#a71d5d">import</span> VCtube</div><div style="padding:0 6px; white-space:pre; line-height:130%"> </div><div style="padding:0 6px; white-space:pre; line-height:130%">playlist_name <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> <span style="color:#63a35c">""</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">playlist_url <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> <span style="color:#63a35c">""</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">lang <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> <span style="color:#63a35c">""</span> <span style="color:#999999"># ex) ko, en, fr, de ...</span></div><div style="padding:0 6px; white-space:pre; line-height:130%"> </div><div style="padding:0 6px; white-space:pre; line-height:130%">vc <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> VCtube(playlist_name, playlist_url, lang)</div><div style="padding:0 6px; white-space:pre; line-height:130%">vc.download_audio() <span style="color:#999999">#download audios from youtube</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">vc.download_captions() <span style="color:#999999">#download captions from youtube</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">vc.audio_split() <span style="color:#999999">#split audio with captions</span></div></div></td><td style="vertical-align:bottom;padding:0 2px 4px 0"><a href="http://colorscripter.com/info#e" target="_blank" style="text-decoration:none;color:white"><span style="font-size:9px;word-break:normal;background-color:#e5e5e5;color:white;border-radius:10px;padding:1px">cs</span></a></td></tr></table></div>
<!--
<div style="text-align:left; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/command.jpg" width:300 height:700><br>
</div>
-->
</p>
</article>
</div>
<div class="sixteen columns">
<h1>VCTUBE Example</h1>
<article>
<li>Setting for VCTUBE</li>
<div class="colorscripter-code" style="color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important; position:relative !important;overflow:auto"><table class="colorscripter-code-table" style="margin:0;padding:0;border:none;background-color:#fafafa;border-radius:4px;" cellspacing="0" cellpadding="0"><tr><td style="padding:6px;border-right:2px solid #e5e5e5"><div style="margin:0;padding:0;word-break:normal;text-align:right;color:#666;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="line-height:130%">1</div><div style="line-height:130%">2</div><div style="line-height:130%">3</div><div style="line-height:130%">4</div><div style="line-height:130%">5</div></div></td><td style="padding:6px 0;text-align:left"><div style="margin:0;padding:0;color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="padding:0 6px; white-space:pre; line-height:130%"><span style="color:#a71d5d">from</span> vctube <span style="color:#a71d5d">import</span> VCtube</div><div style="padding:0 6px; white-space:pre; line-height:130%">playlist_url <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> <span style="color:#63a35c">"https://www.youtube.com/watch?v=fj5BcN6Blks"</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">playlist_name<span style="color:#0086b3"></span><span style="color:#a71d5d">=</span><span style="color:#63a35c">"TEST"</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">lang <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> <span style="color:#63a35c">"en"</span> <span style="color:#999999">#ex) ko, en, fr, de...</span></div><div style="padding:0 6px; white-space:pre; line-height:130%">vc <span style="color:#0086b3"></span><span style="color:#a71d5d">=</span> VCtube(playlist_name, playlist_url, lang)</div></div></td><td style="vertical-align:bottom;padding:0 2px 4px 0"><a href="http://colorscripter.com/info#e" target="_blank" style="text-decoration:none;color:white"><span style="font-size:9px;word-break:normal;background-color:#e5e5e5;color:white;border-radius:10px;padding:1px">cs</span></a></td></tr></table></div>
<br><br>
<!--
<div style="text-align:left; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/E.PNG" style="width:550px; height:auto;"><br>
</div>
-->
<li> Result of this process</li>
<div class="colorscripter-code" style="color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important; position:relative !important;overflow:auto"><table class="colorscripter-code-table" style="margin:0;padding:0;border:none;background-color:#fafafa;border-radius:4px;" cellspacing="0" cellpadding="0"><tr><td style="padding:6px;border-right:2px solid #e5e5e5"><div style="margin:0;padding:0;word-break:normal;text-align:right;color:#666;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="line-height:130%">1</div><div style="line-height:130%">2</div><div style="line-height:130%">3</div></div></td><td style="padding:6px 0;text-align:left"><div style="margin:0;padding:0;color:#010101;font-family:Consolas, 'Liberation Mono', Menlo, Courier, monospace !important;line-height:130%"><div style="padding:0 6px; white-space:pre; line-height:130%">vc.download_audio()</div><div style="padding:0 6px; white-space:pre; line-height:130%">vc.download_captions()</div><div style="padding:0 6px; white-space:pre; line-height:130%">vc.audio_split()</div></div></td><td style="vertical-align:bottom;padding:0 2px 4px 0"><a href="http://colorscripter.com/info#e" target="_blank" style="text-decoration:none;color:white"><span style="font-size:9px;word-break:normal;background-color:#e5e5e5;color:white;border-radius:10px;padding:1px">cs</span></a></td></tr></table></div><br><br>
<!--
<div style="text-align:left; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/total.PNG" style="width:550px;height:auto;"><br>
</div>
-->
<li> Audio file information</li>
<div style="text-align:left; margin-bottom: 10px;">
<img src="https://dsail-skku.github.io/VCTUBE.github.io/info.PNG" stlye="width:650px;height:auto;">
</div>
</article>
</div>
<div class="seven columns">
<h1>Paper URL</h1>
<em><a href="NOt now">INTERSPEECH 2020 (Published: 2020)</a></em>
</div>
<div class="seven columns">
<h1>Our Lab Site</h1>
<em><a href="https://sites.google.com/view/datasciencelab/"> This is our lab in Sungkyunkwan University </a></em>
</div>
<div class="seven columns">
<h1>Code URL</h1>
<em><a href="https://github.com/zldzmfoq12/aud-crawler"> The code for VCTUBE is available at here </a></em>
</div>
</div>
</body>
<footer>
<img src="https://dsail-skku.github.io/VCTUBE.github.io/Team.jpg" width:300 height:700><br>
© 2020 GitHub, Inc. Terms Privacy Security Status Help Contact GitHub Pricing API Training Blog About
</footer>
</html>