-
Notifications
You must be signed in to change notification settings - Fork 0
/
README
34 lines (26 loc) · 1.33 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
To generate a single-dataset-style mturk evaluation package for NELL geolocations,
1) Place a copy of Locator-out.tsv in this folder (find it in latlongWorkingDir)
Locator-out.tsv has the following fields:
1 category:concept
2 maxVarianceInMiles
3 latVariance
4 lonVariance
5 lat
6 lon
7 count
8 list of matched names
2) Run make.sh with the following arguments:
1 dataset - give a name to your dataset, to distinguish these files from others
2 Locator-out.tsv - as above
3 minCategoryCount - only consider concepts from categories in Locator-out which have at least this many concepts listed
4 maxVarInMiles - only consider geolocations in Locator-out which have this maxVar or less
5 sampleSize - select this many random concepts from each category in Locator-out (should be equal or less than minCategoryCount)
3) Add a symlink to www/ from somewhere publicly browsable on the web. Make sure you have PHP turned on.
4) Generate gold data:
a. Move www/save.safe.php to www/save.php
b. Visit www/gold.html and do a few concepts in each category
c. Move www/save.php to www/save.safe.php
5) Upload hitinput.txt to Mechanical Turk
6) Double check that the google links work
7) Run Mechanical Turk
8) Follow the directions in gal/README