-
Notifications
You must be signed in to change notification settings - Fork 49
/
LICENSE.txt
208 lines (197 loc) · 5.42 KB
/
LICENSE.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
This corpus was built on data obtained from different sources. The underlying texts
are licensed under the following licenses:
Academic: Multiple sources, all https://creativecommons.org/licenses/by/4.0/:
* Proceedings of DH2017 (see https://dh2017.adho.org/program/abstracts/)
* Linguistic Society of America (see https://journals.linguisticsociety.org/proceedings/index.php/PLSA/article/view/4112/3810)
* PLOS (see https://www.plos.org/terms-of-use)
Biographies: http://creativecommons.org/licenses/by-sa/3.0/ (Source: https://en.wikipedia.org/wiki/Wikipedia:Copyrights)
Court: https://creativecommons.org/licenses/by/4.0/ (multiple public sources, see document metadata)
Essays: https://creativecommons.org/licenses/by-nc-sa/4.0/ (Source: https://openwa.pressbooks.pub/lwtech88readings/)
Fiction: http://creativecommons.org/licenses/by-nc-sa/3.0/ (Source: http://smallbeerpress.com/creative-commons/)
Letters: https://creativecommons.org/licenses/by-nc-sa/4.0/ (multiple CC sources, see document metadata)
Podcasts: https://creativecommons.org/licenses/by-nc-sa/4.0/ (multiple CC sources, see document metadata)
WikiHow: http://creativecommons.org/licenses/by-nc-sa/3.0/ (Source: http://www.wikihow.com/wikiHow:Creative-Commons)
WikiVoyage: https://creativecommons.org/licenses/by-sa/3.0/ (Source: https://wikimediafoundation.org/wiki/Terms_of_Use)
Wikinews/interviews: http://creativecommons.org/licenses/by/2.5/ (Source: https://en.wikinews.org/wiki/Wikinews:Copyright)
reddit: Data available from reddit for non-commercial use only (https://www.reddit.com/help/useragreement/)
All annotations are licensed under the Creative Commons Attribution (CC-BY) version 4.0, although the texts themselves follow the licenses above.
The annotations were produced by the following people:
* Adrienne Isaac
* Akitaka Yamada
* Alex Giorgioni
* Alexandra Berends
* Alexandra Slome
* Amani Aloufi
* Amber Hall
* Amelia Becker
* Andrea Price
* Andrew O'Brien
* Ángeles Ortega Luque
* Aniya Harris
* Anna Prince
* Anna Runova
* Anne Butler
* Arianna Janoff
* Aryaman Arora
* Ayan Mandal
* Ayşenur Sağdıç
* Bertille Baron
* Bradford Salen
* Brandon Tullock
* Brent Laing
* Caitlyn Pineault
* Calvin Engstrom
* Candice Penelton
* Carlotta Hübener
* Caroline Gish
* Charlie Dees
* Chenyue Guo
* Chloe Evered
* Cindy Luo
* Colleen Diamond
* Connor O'Dwyer
* Cristina Lopez
* Cynthia Li
* Dan DeGenaro
* Dan Simonson
* Derek Reagan
* Devika Tiwari
* Didem Ikizoglu
* Edwin Ko
* Eliza Rice
* Emile Zahr
* Emily Pace
* Emma Manning
* Emma Rafkin
* Ethan Beaman
* Felipe De Jesus
* Han Bu
* Hana Altalhi
* Hang Jiang
* Hannah Wingett
* Hanwool Choe
* Hassan Munshi
* Helen Dominic
* Ho Fai Cheng
* Hortensia Gutierrez
* Jakob Prange
* James Maguire
* Janine Karo
* Jehan al-Mahmoud
* Jemm Excelle Dela Cruz
* Jess Godes
* Jessica Cusi
* Jessica Kotfila
* Jingni Wu
* Joaquin Gris Roca
* John Chi
* Jongbong Lee
* Juliet May
* Jungyoon Koh
* Katarina Starcevic
* Katelyn Carroll
* Katelyn MacDougald
* Katherine Vadella
* Khalid Alharbi
* Kristen Cook
* Lara Bryfonski
* Lauren Levine
* Leah Northington
* Lindley Winchester
* Linxi Zhang
* Lucia Donatelli
* Luke Gessler
* Mackenzie Gong
* Margaret Anne Rowe
* Margaret Borowczyk
* Maria Laura Zalazar
* Maria Stoianova
* Mariko Uno
* Mary Henderson
* Maya Barzilai
* Md. Jahurul Islam
* Michael Kranzlein
* Michaela Harrington
* Mingyeong Choi
* Minnie Annan
* Mitchell Abrams
* Mohammad Ali Yektaie
* Naomee-Minh Nguyen
* Negar Siyari
* Nicholas Mararac
* Nicholas Workman
* Nicole Steinberg
* Nitin Venkateswaran
* Parker DiPaolo
* Phoebe Fisher
* Rachel Kerr
* Rachel Thorson
* Rebecca Childress
* Rebecca Farkas
* Riley Breslin Amalfitano
* Rima Elabdali
* Robert Maloney
* Ruizhong Li
* Ryan Mannion
* Ryan Murphy
* Sakol Suethanapornkul
* Sarah Bellavance
* Sarah Carlson
* Sasha Slone
* Saurav Goswami
* Sean Macavaney
* Sean Simpson
* Seyma Toker
* Shane Quinn
* Shannon Mooney
* Shelby Lake
* Shira Wein
* Sichang Tu
* Siddharth Singh
* Siona Ely
* Siyao Peng
* Siyu Liang
* Stephanie Kramer
* Sylvia Sierra
* Talal Alharbi
* Tatsuya Aoyama
* Tess Feyen
* Timothy Ingrassia
* Trevor Adriaanse
* Ulie Xu
* Wai Ching Leung
* Wenxi Yang
* Wesley Scivetti
* Xiaopei Wu
* Xiulin Yang
* Yang Liu
* Yi-Ju Lin
* Yifu Mu
* Yilun Zhu
* Yingzhu Chen
* Yiran Xu
* Young-A Son
* Yu-Tzu Chang
* Yuhang Hu
* Yunjung Ku
* Yushi Zhao
* Zhijie Song
* Zhuosi Luo
* Zhuxin Wang
* Amir Zeldes
* and other annotators who wish to remain anonymous
To credit and find the latest list of annotators, please cite the following URL:
https://gucorpling.org/gum/
For scholarly work referencing the corpus, please cite this paper:
Zeldes, Amir (2017) "The GUM Corpus: Creating Multilayer Resources in the Classroom". Language Resources and Evaluation 51(3), 581–612.
@Article{Zeldes2017,
author = {Amir Zeldes},
title = {The {GUM} Corpus: Creating Multilayer Resources in the Classroom},
journal = {Language Resources and Evaluation},
year = {2017},
volume = {51},
number = {3},
pages = {581--612},
doi = {http://dx.doi.org/10.1007/s10579-016-9343-x}
}
For full license texts of individual sources, see the URLs above.