-
Notifications
You must be signed in to change notification settings - Fork 3
/
stripped.html
1741 lines (1740 loc) · 145 KB
/
stripped.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<h2 id="TOC">Contents</h2>
<ul>
<li><a href="#summary">Summary</a></li>
<li><a href="#background">Background</a><ul>
<li><a href="#first-surveymonkey-survey-s1">First SurveyMonkey survey (S1)</a></li>
<li><a href="#second-surveymonkey-survey-s2">Second SurveyMonkey survey (S2)</a></li>
<li><a href="#google-surveys-survey-gs">Google Surveys survey (GS)</a></li>
<li><a href="#wikimedia-foundation-new-readers-survey-nr">Wikimedia Foundation New Readers survey (NR)</a></li>
</ul></li>
<li><a href="#other-surveys">Other surveys</a></li>
<li><a href="#motivation">Motivation</a></li>
<li><a href="#survey-questions">Survey questions</a><ul>
<li><a href="#survey-questions-for-the-first-surveymonkey-survey-s1">Survey questions for the first SurveyMonkey survey (S1)</a></li>
<li><a href="#survey-questions-for-the-second-surveymonkey-survey-s2">Survey questions for the second SurveyMonkey survey (S2)</a></li>
<li><a href="#survey-questions-for-the-google-surveys-survey-gs">Survey questions for the Google Surveys survey (GS)</a></li>
<li><a href="#survey-questions-for-the-new-readers-survey-nr">Survey questions for the New Readers survey (NR)</a></li>
</ul></li>
<li><a href="#results">Results</a><ul>
<li><a href="#s1q1-number-of-wikipedia-pages-read-per-week">S1Q1: number of Wikipedia pages read per week</a></li>
<li><a href="#s1q2-affinity-for-wikipedia-in-search-results">S1Q2: affinity for Wikipedia in search results</a></li>
<li><a href="#s1q3-section-vs-whole-page">S1Q3: section vs whole page</a></li>
<li><a href="#s1q4-search-functionality-on-wikipedia-and-surprise-at-lack-of-wikipedia-pages">S1Q4: search functionality on Wikipedia and surprise at lack of Wikipedia pages</a></li>
<li><a href="#s1q5-behavior-on-pages">S1Q5: behavior on pages</a></li>
<li><a href="#s2q1-number-of-wikipedia-pages-read-per-week">S2Q1: number of Wikipedia pages read per week</a></li>
<li><a href="#s2q2-multiple-choice-of-articles-read">S2Q2: multiple-choice of articles read</a></li>
<li><a href="#s2q3-free-response-of-articles-read">S2Q3: free response of articles read</a></li>
<li><a href="#s2q4-free-response-of-surprise-at-lack-of-wikipedia-pages">S2Q4: free response of surprise at lack of Wikipedia pages</a></li>
<li><a href="#gs-google-surveys">GS (Google Surveys)</a></li>
<li><a href="#data-validation-using-known-total-united-states-wikipedia-pageviews">Data validation using known total United States Wikipedia pageviews</a></li>
<li><a href="#nrq1-do-you-use-the-internet-and-nrq4-have-you-ever-heard-of-wikipedia">NRQ1: Do you use the Internet? and NRQ4: Have you ever heard of Wikipedia?</a></li>
<li><a href="#nrq7-how-often-do-you-use-wikipedia">NRQ7: How often do you use Wikipedia?</a></li>
<li><a href="#comparison-against-united-states-audiences">Comparison against United States audiences</a></li>
<li><a href="#data-validation-against-known-total-country-traffic">Data validation against known total country traffic</a></li>
</ul></li>
<li><a href="#takeaway-huge-gap-between-heavy-users-and-general-us-audience-plus-predictors-of-heavy-use">Takeaway: Huge gap between heavy users and general US audience, plus predictors of heavy use</a><ul>
<li><a href="#confirming-the-gap-with-numbers">Confirming the gap with numbers</a></li>
<li><a href="#qualitative-differences-in-other-aspects-of-wikipedia-engagement">Qualitative differences in other aspects of Wikipedia engagement</a></li>
<li><a href="#predictors-of-audiences-with-high-proportions-of-heavy-users">Predictors of audiences with high proportions of heavy users</a></li>
</ul></li>
<li><a href="#takeaway-effect-on-impact-estimates-for-pageviews">Takeaway: Effect on impact estimates for pageviews</a><ul>
<li><a href="#upgrading-estimate-of-impact-based-on-reader-quality">Upgrading estimate of impact based on reader quality</a></li>
<li><a href="#downgrading-estimate-of-impact-through-reach">Downgrading estimate of impact through reach</a></li>
</ul></li>
<li><a href="#takeaway-comparison-with-demographic-gaps-in-the-us-and-worldwide">Takeaway: Comparison with demographic gaps in the US and worldwide</a><ul>
<li><a href="#gender-within-the-united-states">Gender within the United States</a></li>
<li><a href="#age-within-the-united-states">Age within the United States</a></li>
<li><a href="#cross-country-comparison-in-perspective">Cross-country comparison in perspective</a></li>
</ul></li>
<li><a href="#survey-making-lessons">Survey-making lessons</a></li>
<li><a href="#further-questions">Further questions</a></li>
<li><a href="#further-reading">Further reading</a></li>
<li><a href="#summaries-of-responses-exports-for-surveymonkey-weblink-for-google-surveys">Summaries of responses (exports for SurveyMonkey, weblink for Google Surveys)</a></li>
<li><a href="#acknowledgements">Acknowledgements</a></li>
<li><a href="#survey-cost">Survey cost</a></li>
<li><a href="#document-source-and-versions">Document source and versions</a></li>
<li><a href="#license">License</a></li>
</ul>
<h2 id="summary">Summary</h2>
<ul>
<li><p>Wikipedia consumption is skewed pretty heavily towards a certain profile of people. Vipul’s Facebook friends and Slate Star Codex readers sample much more heavily from that profile than general United States Internet users. (<a href="#takeaway-huge-gap-between-heavy-users-and-general-us-audience-plus-predictors-of-heavy-use">more</a>)</p></li>
<li><p>Tentatively, these heavy users tend to be more “elite” and influential. This led us to revise upward our estimate of the impact per pageview, and revise downward our estimate of the reach and broad appeal of Wikipedia. (<a href="#takeaway-effect-on-impact-estimates-for-pageviews">more</a>)</p></li>
<li><p>The gap between elite samples of Wikipedia users and general United States Internet users is significantly greater than the gap between the different demographics within the United States that we measured. It is comparable to the gap between United States Internet users and Internet users in low-income countries. (<a href="#takeaway-comparison-with-demographic-gaps-in-the-us-and-worldwide">more</a>)</p></li>
<li><p>Vipul Naik and I are interested in understanding how people use Wikipedia. One reason is that we are getting more people to work on editing and adding content to Wikipedia. We want to understand the impact of these edits, so that we can direct efforts more strategically. We are also curious!</p></li>
<li><p>We conducted surveys of Wikipedia usage. We collected survey responses from audience segments include Slate Star Codex readers, Vipul’s Facebook friends, and a few United States audiences through SurveyMonkey Audience and Google Surveys (known at the time as Google Consumer Surveys). Our survey questions measured how heavily people use Wikipedia, what sort of pages they read or expected to find, the relation between their search habits and Wikipedia, and other actions they took within Wikipedia.</p></li>
<li><p>Different audience segments responded very differently to the survey. Notably, the SurveyMonkey and Google Surveys audiences (which are closer to being representative of the general United States Internet-using population) appear to use Wikipedia a lot less than Vipul’s Facebook friends and Slate Star Codex readers. Their consumption of Wikipedia is also more passive: they are less likely to explicitly seek Wikipedia pages when searching for a topic, and less likely to engage in additional actions on Wikipedia pages. Even the college-educated SurveyMonkey audience used Wikipedia very little.</p></li>
<li><p>After we conducted the survey and published the first version of this blog post, the Wikimedia Foundation’s Global Reach team published the results of their New Readers surveys in Nigeria, India, Mexico, Brazil, and Egypt. These surveys have some overlap with the surveys we conducted. Unsurprisingly, Wikipedia usage in the New Readers surveys is lower than for the general United States audiences in our surveys.</p></li>
<li><p>This was my first exercise in survey construction. I learned a number of lessons about survey design in the process.</p></li>
<li><p>All the survey questions, as well as the breakdown of responses for each of the audience segments, are described in this post. Links to PDF exports of response summaries are at the end of the post.</p></li>
</ul>
<h2 id="background">Background</h2>
<h3 id="first-surveymonkey-survey-s1">First SurveyMonkey survey (S1)</h3>
<p>At the end of May 2016, <a href="http://lesswrong.com/user/VipulNaik/overview/">Vipul Naik</a> and I created a <a href="https://meta.wikimedia.org/wiki/Research:Wikipedia_usage_survey" title='Vipul Naik and Issa Rice. "Research:Wikipedia usage survey". May 31, 2016. Wikimedia Meta Wiki.'>Wikipedia usage survey</a> to gauge the usage habits of Wikipedia readers and editors. <a href="https://en.wikipedia.org/wiki/SurveyMonkey">SurveyMonkey</a> allows the use of different “collectors” (i.e. survey URLs that keep results separate), so we circulated several different URLs among four locations to see how different audiences would respond. The audiences were as follows:</p>
<ul>
<li><p>SurveyMonkey’s United States audience with no demographic filters (62 responses, 54 of which are full responses).</p>
<p>Acquisition cost: $100 ($2 per response for 50 responses, 4 extra responses given gratis)</p></li>
<li><p>Vipul Naik’s timeline (<a href="https://www.facebook.com/vipulnaik.r/posts/10208540131276697" title="Vipul Naik. May 30, 2016. Archived at http://archive.is/M35mf.">post asking people to take the survey</a>; 70 responses, 69 of which are full responses). For background on Vipul’s timeline audience, see <a href="http://vipulnaik.com/facebook/" title="Vipul Naik. “Facebook”.">his page on how he uses Facebook</a>.</p>
<p>Acquisition cost: None</p></li>
<li><p>The <a href="https://lists.wikimedia.org/mailman/listinfo/analytics" title="Wikimedia. “Analytics”.">Wikipedia Analytics mailing list</a> (<a href="https://lists.wikimedia.org/pipermail/analytics/2016-May/005219.html" title="Vipul Naik. “Survey for Wikipedia readers”. May 31, 2016. Analytics mailing list.">email linking to the survey</a>; 7 responses, 6 of which are full responses). Note that due to the small size of this group, the results below should not be trusted, unless possibly when the votes are decisive.</p>
<p>Acquisition cost: None</p></li>
<li><p>Slate Star Codex (<a href="http://slatestarcodex.com/2016/06/02/links-616-linkandescence/" title="Scott Alexander. “Links 6/16: Linkandescence”. June 2, 2016. Slate Star Codex.">post that links to the survey</a>; 618 responses, 596 of which are full responses).</p>
<p>While Slate Star Codex isn’t the same as LessWrong, we think there is significant overlap in the two sites’ audiences (see e.g. <a href="http://lesswrong.com/lw/nor/2016_lesswrong_diaspora_survey_analysis_part_two/" title="ingres. “2016 LessWrong Diaspora Survey Analysis: Part Two (LessWrong Use, Successorship, Diaspora)”. June 10, 2016. LessWrong.">the recent LessWrong diaspora survey results</a>).</p>
<p>Acquisition cost: None</p></li>
<li><p>In addition, although not an actual audience with a separate URL, several of the tables we present below will include an “H” group; this is the heavy users group of people who responded by saying they read 26 or more articles per week on Wikipedia. This group has 179 people: 164 from Slate Star Codex, 11 from Vipul’s timeline, and 4 from the Analytics mailing list.</p></li>
</ul>
<p>We ran the survey from May 30 to July 9, 2016 (although only the Slate Star Codex survey had a response past June 1).</p>
<h3 id="second-surveymonkey-survey-s2">Second SurveyMonkey survey (S2)</h3>
<p>After we looked at the survey responses on the first day, Vipul and I decided to create a second survey to focus on the parts from the first survey that interested us the most. The second survey was only circulated among SurveyMonkey’s audiences:</p>
<ul>
<li><p>SurveyMonkey’s US audience with no demographic filters (54 responses).</p>
<p>Acquisition cost: $50 ($1 per response for 50 responses, 4 extra responses given gratis)</p></li>
<li><p>SurveyMonkey’s US audience with the following filters: ages 18–29 with a college or graduate degree (50 responses).</p>
<p>Acquisition cost: $125 ($2.50 per response for 50 responses)</p></li>
</ul>
<p>We first ran the survey on the unfiltered audience again because the wording of our first question was changed and we wanted to have the new baseline. We then chose to filter for young college-educated people because our prediction was that more educated people would be more likely to read Wikipedia. The SurveyMonkey demographic data does not include education, and we hadn’t seen the Pew Internet Research surveys in the next section, so we were relying on our intuition and some demographic data from past surveys) for the “college-educated” part. Our selection of the age group was based on the fact that young people in our first survey gave more informative free-form responses in survey 2 (SurveyMonkey’s demographic data <em>does</em> include age).</p>
<h3 id="google-surveys-survey-gs">Google Surveys survey (GS)</h3>
<p>We ran a third survey on Google Surveys (known at the time as Google Consumer Surveys) with a <em>single question</em> that was a word-to-word replica of the first question from the second survey. The main motivation here was that on Google Surveys, a single-question survey costs only 10 cents per response, so it was possible to get to a large number of responses at relatively low cost, and achieve more confidence in the tentative conclusions we had drawn from the SurveyMonkey surveys.</p>
<h3 id="wikimedia-foundation-new-readers-survey-nr">Wikimedia Foundation New Readers survey (NR)</h3>
<p>In late 2016 (after initial publication of this post), the Wikimedia Foundation’s Global Reach team published the results of <a href="https://meta.wikimedia.org/wiki/Global_Reach/Insights">New Readers phone surveys</a> in the following five countries: <a href="https://meta.wikimedia.org/wiki/Global_Reach/Nigeria_survey">Nigeria</a>, <a href="https://meta.wikimedia.org/wiki/Global_Reach/India_survey">India</a>, <a href="https://meta.wikimedia.org/wiki/Global_Reach/Mexico_Survey">Mexico</a>, <a href="https://meta.wikimedia.org/wiki/Global_Reach/Brazil_survey">Brazil</a>, and <a href="https://meta.wikimedia.org/wiki/Global_Reach/Egypt_survey">Egypt</a>. For the first three countries, results of additional in-person surveys have also been published. The questions in these surveys have some overlap with the questions in our surveys, so we have updated our post to include a discussion of these surveys and how the results compare with ours.</p>
<h2 id="other-surveys">Other surveys</h2>
<p>Several demographic surveys regarding Wikipedia have been conducted, targeting both editors and users. The surveys we found most helpful were the following:</p>
<ul>
<li>The <a href="https://web.archive.org/web/20130717211630/http://wikipediastudy.org/">2010 Wikipedia survey</a> by the Collaborative Creativity Group and the Wikimedia Foundation. The explanation before the bottom table on page 7 of the <a href="https://web.archive.org/web/20131209060146/http://wikipediastudy.org/docs/Wikipedia_Overview_15March2010-FINAL.pdf">overview PDF</a> has “Contributors show slightly but significantly higher education levels than readers”, which provides weak evidence that more educated people are more likely to engage with Wikipedia.</li>
<li>The <a href="https://upload.wikimedia.org/wikipedia/commons/8/8a/Global_South_User_Survey_2014_-_Full_Analysis_Report.pdf">Global South User Survey 2014</a> by the Wikimedia Foundation</li>
<li><a href="http://www.pewinternet.org/files/old-media/Files/Reports/2011/PIP_Wikipedia.pdf">Pew Internet Research’s 2011 survey</a>: “Education level continues to be the strongest predictor of Wikipedia use. The collaborative encyclopedia is most popular among internet users with at least a college degree, 69% of whom use the site.” (page 3)</li>
<li><a href="http://www.pewinternet.org/2007/04/24/wikipedia-users/">Pew Internet Research’s 2007 survey</a>.</li>
</ul>
<p>We found the Pew Internet Research surveys after conducting our own two surveys (and during the write-up of this document).</p>
<p>There is also the New Readers survey mentioned earlier, that we examine in detail in this post.</p>
<h2 id="motivation">Motivation</h2>
<p>Vipul and I ultimately want to get a better sense of the value of a Wikipedia pageview (one way to measure the impact of content creation), and one way to do this is to understand how people are using Wikipedia. As we focus on <a href="http://contractwork.vipulnaik.com">getting more people to work on editing Wikipedia</a> – thus causing more people to read the content we pay and help to create – it becomes more important to understand who is reading the content, and how they engage with it.</p>
<p>For some previous discussion, see also Vipul’s answers to the following Quora questions:</p>
<ul>
<li><a href="https://www.quora.com/What-are-the-various-parameters-that-affect-the-value-of-a-pageview/answer/Vipul-Naik" title="April 16, 2016.">What are the various parameters that affect the value of a pageview?</a></li>
<li><a href="https://www.quora.com/Whats-the-relative-social-value-of-1-Quora-pageview-as-measured-by-Quora-stats-http-www-quora-com-stats-and-1-Wikipedia-pageview-as-measured-at-say-Wikipedia-article-traffic-statistics/answer/Vipul-Naik" title="August 30, 2014.">What’s the relative social value of 1 Quora pageview (as measured by Quora stats http://www.quora.com/stats) and 1 Wikipedia pageview (as measured at, say, Wikipedia article traffic statistics)?</a></li>
</ul>
<p>Wikipedia allows relatively easy access to pageview data (especially by using tools developed for this purpose, including <a href="http://wikipediaviews.org/">one that Vipul made</a>), and there are some surveys that provide demographic data (see <a href="#previous-surveys">“Previous surveys”</a> above). However, after looking around, it was apparent that the kind of information our survey was designed to find was not available.</p>
<p>We were also driven by our curiosity of how people use Wikipedia.</p>
<h2 id="survey-questions">Survey questions</h2>
<h3 id="survey-questions-for-the-first-surveymonkey-survey-s1">Survey questions for the first SurveyMonkey survey (S1)</h3>
<p>For reference, here are the survey questions for the first survey. A dummy/mock-up version of the survey can be found here: <a class="uri" href="https://www.surveymonkey.com/r/PDTTBM8">https://www.surveymonkey.com/r/PDTTBM8</a>.</p>
<p>The survey introduction said the following:</p>
<blockquote>
<p>This survey is intended to gauge Wikipedia use habits. This survey has 3 pages with 5 questions total (3 on the first page, 1 on the second page, 1 on the third page). Please try your best to answer all of the questions, and make a guess if you’re not sure.</p>
</blockquote>
<p>And the actual questions:</p>
<ol style="list-style-type: decimal">
<li><p>How many distinct Wikipedia pages do you read per week on average?</p>
<ul>
<li>less than 1</li>
<li>1 to 10</li>
<li>11 to 25</li>
<li>26 or more</li>
</ul></li>
<li><p>On a search engine (e.g. Google) results page, do you explicitly seek Wikipedia pages, or do you passively click on Wikipedia pages only if they show up at the top of the results?</p>
<ul>
<li>I explicitly seek Wikipedia pages</li>
<li>I have a slight preference for Wikipedia pages</li>
<li>I just click on what is at the top of the results</li>
</ul></li>
<li><p>Do you usually read a particular section of a page or the whole article? (Multiple options can be selected)</p>
<ul>
<li>Particular section</li>
<li>Whole page</li>
</ul></li>
<li><p>How often do you do the following? (Choices: Several times per week, About once per week, About once per month, About once per several months, Never/almost never.)</p>
<ul>
<li>Use the search functionality on Wikipedia</li>
<li>Be surprised that there is no Wikipedia page on a topic</li>
</ul></li>
<li><p>For what fraction of pages you read do you do the following? (Choices: For every page, For most pages, For some pages, For very few pages, Never. These were displayed in a random order for each respondent, but displayed in alphabetical order here.)</p>
<ul>
<li>Check (click or hover over) at least one citation to see where the information comes from on a page you are reading</li>
<li>Check how many pageviews a page is getting (on an external site or through the Pageview API)</li>
<li>Click through/look for at least one cited source to verify the information on a page you are reading</li>
<li>Edit a page you are reading because of grammatical/typographical errors on the page</li>
<li>Edit a page you are reading to add new information</li>
<li>Look at the “See also” section for additional articles to read</li>
<li>Look at the editing history of a page you are reading</li>
<li>Look at the editing history solely to see if a particular user wrote the page</li>
<li>Look at the talk page of a page you are reading</li>
<li>Read a page mostly for the “Criticisms” or “Reception” (or similar) section, to understand different views on the subject</li>
<li>Share the page with a friend/acquaintance/coworker</li>
</ul></li>
</ol>
<p>For the SurveyMonkey audience, there were also some demographic questions (age, gender, household income, US region, and device type). These questions were not filled by respondents at the time of the survey, but rather, are filled in by respondents in order to be able to participate in these surveys. You can learn more on the <a href="https://contribute.surveymonkey.com/home">SurveyMonkey Contribute page</a>.</p>
<h3 id="survey-questions-for-the-second-surveymonkey-survey-s2">Survey questions for the second SurveyMonkey survey (S2)</h3>
<p>For reference, here are the survey questions for the second survey. A dummy/mock-up version of the survey can be found here: <a class="uri" href="https://www.surveymonkey.com/r/28BW78V">https://www.surveymonkey.com/r/28BW78V</a>.</p>
<p>The survey introduction said the following:</p>
<blockquote>
<p>This survey is intended to gauge Wikipedia use habits. Please try your best to answer all of the questions, and make a guess if you’re not sure.</p>
<p>This survey has 4 questions across 3 pages.</p>
<p>In this survey, “Wikipedia page” refers to a Wikipedia page in any language (not just the English Wikipedia).</p>
</blockquote>
<p>And the actual questions:</p>
<ol style="list-style-type: decimal">
<li><p>How many distinct Wikipedia pages do you read (at least one sentence of) per week on average?</p>
<ul>
<li>Fewer than 1</li>
<li>1 to 10</li>
<li>11 to 25</li>
<li>26 or more</li>
</ul></li>
<li><p>Which of these articles have you read (at least one sentence of) on Wikipedia (select all that apply)? (These were displayed in a random order except the last option for each respondent, but displayed in alphabetical order except the last option here.)</p>
<ul>
<li>Adele</li>
<li>Barack Obama</li>
<li>Bernie Sanders</li>
<li>China</li>
<li>Donald Trump</li>
<li>Google</li>
<li>Hillary Clinton</li>
<li>India</li>
<li>Japan</li>
<li>Justin Bieber</li>
<li>Justin Trudeau</li>
<li>Katy Perry</li>
<li>Taylor Swift</li>
<li>The Beatles</li>
<li>United States</li>
<li>World War II</li>
<li>None of the above</li>
</ul></li>
<li><p>What are some of the Wikipedia articles you have most recently read (at least one sentence of)? Feel free to consult your browser’s history.</p></li>
<li><p>Recall a time when you were surprised that a topic did not have a Wikipedia page. What were some of these topics?</p></li>
</ol>
<p>As with the SurveyMonkey Audience responses for S1, the responses for S2 also came with demographic information that the respondents had previously filled in.</p>
<h3 id="survey-questions-for-the-google-surveys-survey-gs">Survey questions for the Google Surveys survey (GS)</h3>
<p>This survey had exactly one question. The wording of the question was exactly the same as that of the first question of the second survey.</p>
<ol style="list-style-type: decimal">
<li><p>How many distinct Wikipedia pages do you read (at least one sentence of) per week on average?</p>
<ul>
<li>Fewer than 1</li>
<li>1 to 10</li>
<li>11 to 25</li>
<li>26 or more</li>
</ul></li>
</ol>
<p>One slight difference was that whereas in the second SurveyMonkey survey, the order of the options was fixed, the Google Surveys survey did a 50/50 split between that order and the exact reverse order. Such splitting is a best practice to deal with any order-related biases, while still preserving the logical order of the options. You can read more on the <a href="http://www.pewresearch.org/methodology/u-s-survey-research/questionnaire-design/">questionnaire design page of the Pew Research Center</a>.</p>
<p>Overall, Google Surveys in the United States is reasonably close to representative of the voting US population and the Internet-using population. For more information on the reliability and validity of Google Surveys responses, you can look at the citations on the <a href="https://en.wikipedia.org/wiki/Google_Surveys">Wikipedia page for Google Surveys</a>. Also, the sample size of the survey was largest. Therefore, among the surveys we did, this survey comes closest to approximating the behavior of the Internet-using population in the United States.</p>
<p>The GS responses come with inferred demographic and geographic data (age, gender, income level, location). The geographic data is generally reliable because it is based on IP address, but inferred age and gender data is not as reliable as the self-reported data that we get from SurveyMonkey Audience. For more on the accuracy of the inferred data, see the <a href="http://www.people-press.org/2012/11/07/a-comparison-of-results-from-surveys-by-the-pew-research-center-and-google-consumer-surveys/">Pew Research Center’s comparison</a>.</p>
<h3 id="survey-questions-for-the-new-readers-survey-nr">Survey questions for the New Readers survey (NR)</h3>
<p>We will compare the results of our surveys with the results of the New Readers surveys. To shed light on this comparison, we include below the list of questions in the New Readers phone survey.</p>
<p>Not all questions were presented in all surveys. The <a href="https://meta.wikimedia.org/wiki/Global_Reach/Egypt_survey">Egypt survey</a>, which is the more recent, had the longest list of questions, and we provide this list below. The numbering is mostly based on the Egypt survey, though off by one for later questions due to a question missing from the Egypt survey.</p>
<ol style="list-style-type: decimal">
<li><p>Do you use the Internet?</p>
<ul>
<li>Yes</li>
<li>Said no, but uses Facebook</li>
<li>No</li>
</ul></li>
<li><p>What do you use the Internet for the most? (for those who said Yes to Q1)</p>
<ul>
<li>Look up info</li>
<li>Social media</li>
<li>Entertainment</li>
<li>News</li>
<li>Others</li>
</ul></li>
<li><p>What’s the biggest reason you don’t use the Internet? (for those who said No to Q1)</p>
<ul>
<li>Too expensive</li>
<li>Not sure it’s useful</li>
<li>Not sure what it is</li>
<li>Other</li>
</ul></li>
<li><p>Have you ever heard of Wikipedia?</p>
<ul>
<li>Yes</li>
<li>No</li>
</ul></li>
<li><p>Where did you find out about Wikipedia?</p>
<ul>
<li>Internet</li>
<li>School</li>
<li>Friends and family</li>
<li>Radio or TV</li>
<li>Not sure</li>
</ul></li>
<li><p>What do you use Wikipedia for?</p>
<ul>
<li>School</li>
<li>Work</li>
<li>Entertainment</li>
<li>Other</li>
</ul></li>
<li><p>How often do you use Wikipedia?</p>
<ul>
<li>Daily</li>
<li>Weekly</li>
<li>Monthly</li>
<li>Rarely</li>
<li>Never</li>
</ul></li>
<li><p>How interested are you in reading Wikipedia? (for those who answered “Rarely” or “Never” to the previous question)</p>
<ul>
<li>Not interested</li>
<li>Somewhat</li>
<li>Very interested</li>
</ul></li>
<li><p>What’s the largest barrier keeping you from reding Wikipedia? (for those who answere “Very interested” to Q8)</p>
<ul>
<li>Don’t trust content</li>
<li>Expensive data</li>
<li>Not interesting enough</li>
<li>Can’t find it</li>
<li>Other</li>
</ul></li>
<li><p>What would make you more likely to use Wikipedia? (for those who answered “Not interested” to Q8)</p>
<ul>
<li>Trusted the content</li>
<li>Cheaper data</li>
<li>More interesting articles</li>
<li>Known how to find it</li>
<li>None</li>
</ul></li>
<li><p>Do you have a mobile phone? (This question was in some other country surveys though not in the Egypt one. Hence the numbering for later questions is one more than the numbering in the actual Egypt survey)</p>
<ul>
<li>Yes</li>
<li>No</li>
</ul></li>
<li><p>Can you use the Internet with your phone?</p>
<ul>
<li>Yes</li>
<li>No</li>
</ul></li>
<li><p>How do you access the Internet on your phone?</p>
<ul>
<li>Cellular</li>
<li>Wifi and cell</li>
<li>Wifi only</li>
<li>No Internet</li>
<li>Not sure</li>
</ul></li>
<li><p>What is your usual network speed?</p>
<ul>
<li>2G / Edge</li>
<li>3G</li>
<li>Better than 3G</li>
<li>Not sure</li>
</ul></li>
<li><p>Do you download and use Apps?</p>
<ul>
<li>Yes</li>
<li>No</li>
</ul></li>
<li><p>What is your gender?</p>
<ul>
<li>Male</li>
<li>Female</li>
</ul></li>
<li><p>What is your age?</p>
<ul>
<li>Under 18</li>
<li>19–31</li>
<li>31–50</li>
<li>over 50</li>
<li>Prefer not to say</li>
</ul></li>
<li><p>What is your location?</p>
<ul>
<li>Urban</li>
<li>Rural</li>
<li>Not sure</li>
</ul></li>
<li><p>What is your geographical zone? (options specific to Egypt)</p></li>
</ol>
<p>Our later analysis will focus on the first, fourth, and seventh question, which are together comparable against the first question of S1, S2, and GS.</p>
<h2 id="results">Results</h2>
<p>In this section we present the highlights from each of the survey questions. If you prefer to dig into the data yourself, there are also some <a href="#summaries-of-responses-exports-for-surveymonkey-weblink-for-google-surveys">exported PDFs below</a> provided by SurveyMonkey. Most of the inferences can be made using these PDFs, but there are some cases where additional filters are needed to deduce certain percentages.</p>
<p>For the SurveyMonkey surveys, we use the notation “S<em>n</em>Q<em>m</em>” to mean “survey <em>n</em> question <em>m</em>”. The Google Surveys survey question is referred to as GS, and the New Readers questions are referred to with the notation “NRQ<em>m</em>” for question <em>m</em> of the survey.</p>
<h3 id="s1q1-number-of-wikipedia-pages-read-per-week">S1Q1: number of Wikipedia pages read per week</h3>
<p>Here is a table that summarizes the data for Q1. Note that SMM and SMF don’t add up to SM as some respondents did not specify their gender.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>How many distinct Wikipedia pages do you read per week on average? SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, SMM = SurveyMonkey males, SMF = SurveyMonkey females</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=70)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=618)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">less than 1</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">42%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">25%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">58%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">1 to 10</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">45%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">40%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">37%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">46%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">42%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">11 to 25</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">43%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">36%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">26 or more</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">27%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">57%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">pgs/wk lower</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.88</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9.29</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">11.35</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16.65</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.65</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0.42</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">pgs/wk upper</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">8.17</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19.56</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">20.81</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">23.50</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12.10</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.70</td>
</tr>
</tbody>
</table>
<p>The “pgs/wk lower” is obtained as the average pages read per week if everybody read at the lower end of their estimate (so the respective estimates are 0, 1, 11, and 26).</p>
<p>The “pgs/wk upper” is obtained as the average of pages read per week if everybody read at the upper end of their estimate, except the “26 or more” case where we assume a value of 30 (so the respective estimates are 1, 10, 25, and 30).</p>
<p>Here are some highlights from the first question that aren’t apparent from the table:</p>
<ul>
<li><p>Of the people who read fewer than 1 distinct Wikipedia page per week (26 people), 68% were female even though females were only 48% of the respondents. (Note that gender data is only available for the SurveyMonkey audience.)</p></li>
<li><p>Filtering for high household income ($150k or more; 11 people) in the SurveyMonkey audience, only 2 read fewer than 1 page per week, although most (7) of the responses still fall in the “1 to 10” category.</p></li>
</ul>
<p>The comments indicated that this question was flawed in several ways: we didn’t specify which language Wikipedias count nor what it meant to “read” an article (the whole page, a section, or just a sentence?).</p>
<p>One comment questioned the “low” ceiling of 26; in fact, I had initially made the cutoffs 1, 10, 100, 500, and 1000, but Vipul suggested the final cutoffs because he argued they would make it easier for people to answer (without having to look it up in their browser history).</p>
<p>It turned out this modification was reasonable because the “26 or more” group was a minority.</p>
<h3 id="s1q2-affinity-for-wikipedia-in-search-results">S1Q2: affinity for Wikipedia in search results</h3>
<p>We asked Q2, “On a search engine (e.g. Google) results page, do you explicitly seek Wikipedia pages, or do you passively click on Wikipedia pages only if they show up at the top of the results?”, to see to what extent people preferred Wikipedia in search results.</p>
<p>The main implication to this for people who do content creation on Wikipedia is that if people do explicitly seek Wikipedia pages (for whatever reason), it makes sense to give them more of what they want. On the other hand, if people don’t prefer Wikipedia, it makes sense to update in favor of diversifying one’s content creation efforts while still keeping in mind that raw pageviews indicate that content will be read more if placed on Wikipedia (see for instance <a href="http://reducing-suffering.org/the-value-of-wikipedia-contributions-in-social-sciences/#Readership" title="Brian Tomasik. “The Value of Wikipedia Contributions in Social Sciences” § Readership. 2015. Essays on Reducing Suffering.">Brian Tomasik’s experience</a>, which is similar to my own, or <a href="http://www.gwern.net/Wikipedia%20and%20Other%20Wikis" title="gwern branwen. “Wikipedia and Other Wikis”. February 16, 2013.">gwern’s page comparing Wikipedia with other wikis</a>).</p>
<p>The following table summarizes our results. Wikipedia has been shortened to WP to conserve column width.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>On a search engine (e.g. Google) results page, do you explicitly seek Wikipedia pages, or do you passively click on Wikipedia pages only if they show up at the top of the results? SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, H = heavy users (26 or more articles per week) of Wikipedia.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=70)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=618)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">H (N=179)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Explicitly seek WP</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">60%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">63%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">57%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">79%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">25%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Slight preference for WP</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">39%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">34%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">43%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">20%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">39%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">23%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Just click on top results</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">52%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">35%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">65%</td>
</tr>
</tbody>
</table>
<p>One error on my part was that I didn’t include an option for people who avoided Wikipedia or did something else. This became apparent from the comments. For this reason, the “Just click on top results” options might be inflated. In addition, some comments indicated a mixed strategy of preferring Wikipedia for general overviews while avoiding it for specific inquiries, so allowing multiple selections might have been better for this question.</p>
<h3 id="s1q3-section-vs-whole-page">S1Q3: section vs whole page</h3>
<p>This question is relevant for Vipul and me because the <a href="https://github.com/vipulnaik/contractwork/blob/master/new-article-pool.mediawiki" title='Vipul Naik. "List of new articles that any interested author can work on". GitHub.'>work Vipul funds</a> is mainly whole-page creation. If people are mostly reading the introduction or a particular section like the “Criticisms” or “Reception” section (see <a href="#s1q5-behavior-on-pages">S1Q5</a>), then that forces us to consider spending more time on those sections, or to strengthen those sections on weak existing pages.</p>
<p>Responses to this question were fairly consistent across different audiences, as can be see in the following table.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>Do you usually read a particular section of a page or the whole article? SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, H = Heavy users (26 or more articles per week) of Wikipedia, SMM = SurveyMonkey male, SMF = SurveyMonkey female.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=70)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=618)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">H (N=179)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Section</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">73%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">80%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">74%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">86%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">70%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">68%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">73%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Whole</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">34%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">23%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">33%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">37%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">39%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">31%</td>
</tr>
</tbody>
</table>
<p>People were allowed to select more than one option for this question. The comments indicate that several people do a combination, where they read the introductory portion of an article, then narrow down to the section of their interest.</p>
<h3 id="s1q4-search-functionality-on-wikipedia-and-surprise-at-lack-of-wikipedia-pages">S1Q4: search functionality on Wikipedia and surprise at lack of Wikipedia pages</h3>
<p>We asked about whether people use the search functionality on Wikipedia because we wanted to know more about people’s article discovery methods. The data is summarized in the following table.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>How often do you use the search functionality on Wikipedia? SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, H = heavy users (26 or more articles per week) of Wikipedia.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=69)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=613)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">H (N=176)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Several times per week</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">8%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">32%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">57%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">55%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per week</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">17%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">21%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">21%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per month</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per several months</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">5%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Never/almost never</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">45%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">43%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">24%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">23%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">43%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">50%</td>
</tr>
</tbody>
</table>
<p>Many people noted here that rather than using Wikipedia’s search functionality, they use Google with “wiki” attached to their query, DuckDuckGo’s “!w” expression, or some browser configuration to allow a quick search on Wikipedia.</p>
<p>To be more thorough about discovering people’s content discovery methods, we should have asked about other methods as well. We did ask about the “See also” section in <a href="#s1q5-behavior-on-pages">S1Q5</a>.</p>
<p>Next, we asked how often people are surprised that there is no Wikipedia page on a topic to gauge to what extent people notice a “gap” between how Wikipedia exists today and how it <em>could</em> exist. We were curious about what articles people specifically found missing, so we followed up with <a href="#s2q4-free-response-of-surprise-at-lack-of-wikipedia-pages">S2Q4</a>.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>How often are you surprised that there is no Wikipedia page on a topic? SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, H = heavy users (26 or more articles per week) of Wikipedia, SMM = SurveyMonkey males, SMF = SurveyMonkey females.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=69)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=613)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">H (N=176)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Several times per week</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">6%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per week</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">8%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">22%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">34%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per month</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">36%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">34%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">31%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">About once per several months</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">21%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">22%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">27%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Never/almost never</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">52%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">20%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">10%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">36%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">65%</td>
</tr>
</tbody>
</table>
<p>Two comments on this question (out of 59) – both from the SSC group – specifically bemoaned deletionism, with one comment calling deletionism “a cancer killing Wikipedia”.</p>
<h3 id="s1q5-behavior-on-pages">S1Q5: behavior on pages</h3>
<p>This question was intended to gauge how often people perform an action for a specific page; as such, the frequencies are expressed in page-relative terms.</p>
<p>The following table presents the scores for each response, which are weighted by the number of responses. The scores range from 1 (for every page) to 5 (never); in other words, the lower the number, the more frequently one does the thing.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>For what fraction of pages you read do you do the following? Note that the responses have been shortened here; see the <a href="#survey-questions-for-the-first-survey">“Survey questions”</a> section for the wording used in the survey. Responses are sorted by the values in the SSC column. SM = SurveyMonkey audience, V = Vipul Naik’s timeline, SSC = Slate Star Codex audience, AM = Wikipedia Analytics mailing list, H = heavy users (26 or more articles per week) of Wikipedia, SMM = SurveyMonkey males, SMF = SurveyMonkey females</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=54)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">V (N=69)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SSC (N=596)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">AM (N=7)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">H (N=169)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=28)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Check <span class="math inline">≥1</span> citation</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.57</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.80</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.91</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.67</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.69</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.43</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.73</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Look at “See also”</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.65</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.93</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.92</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.67</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.76</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.43</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.88</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Read mostly for “Criticisms” or “Reception”</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.35</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.12</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.34</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.83</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.14</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.32</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.38</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Click through <span class="math inline">≥1</span> source to verify information</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.80</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.07</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.47</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.17</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.36</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.86</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.73</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Share the page</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.11</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.72</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.86</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.67</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.79</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.11</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.12</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Look at the talk page</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.31</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.28</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.03</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.00</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.86</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.21</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.42</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Look at the editing history</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.35</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.32</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.12</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.33</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.92</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.36</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.35</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Edit a page for grammatical/typographical errors</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.50</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.41</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.22</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.67</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.02</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.54</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.46</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Edit a page to add new information</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.61</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.55</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.49</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.83</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.34</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.57</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.65</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Look at editing history to verify author</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.50</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.65</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.48</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.67</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.73</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.46</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.54</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Check how many pageviews a page is getting</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.63</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.88</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.96</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.17</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.92</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.68</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.58</td>
</tr>
</tbody>
</table>
<p>The table above provides a good ranking of how often people perform these actions on pages, but not the distribution information (which would require three dimensions to present fully). In general, the more common actions (scores of 2.5–4) had responses that clustered among “For some pages”, “For very few pages”, and “Never”, while the less common actions (scores above 4) had responses that clustered mainly in “Never”.</p>
<p>One comment (out of 43) – from the SSC group, but a different individual from the two in S1Q4 – bemoaned deletionism.</p>
<h3 id="s2q1-number-of-wikipedia-pages-read-per-week">S2Q1: number of Wikipedia pages read per week</h3>
<p>Note the wording changes on this question for the second survey: “less” was changed to “fewer”, the clarification “at least one sentence of” was added, and we explicitly allowed any language. We have also presented the survey 1 results for the SurveyMonkey audience in the corresponding rows, but note that because of the change in wording, the correspondence isn’t exact.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>How many distinct Wikipedia pages do you read (at least one sentence of) per week on average? SM = SurveyMonkey audience with no demographic filters, CEYP = College-educated young people of SurveyMonkey, S1SM = SurveyMonkey audience with no demographic filters from the first survey, SMM = SurveyMonkey males, SMF = SurveyMonkey females, CEYPM = College-educated young males of SurveyMonkey, CEYPF = College-educated young females of SurveyMonkey.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=54)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">CEYP (N=50)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">S1SM (N=62)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMM (N=25)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SMF (N=26)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">CEYPM (N=24)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">CEYPF (N=26)</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Fewer than 1</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">37%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">32%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">42%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">32%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">42%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">29%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">35%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">1 to 10</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">48%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">64%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">45%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">40%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">54%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">67%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">62%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">11 to 25</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">26 or more</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">pgs/wk lower</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.07</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.38</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.88</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">5.28</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.58</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.11</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.66</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">pgs/wk upper</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9.02</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7.82</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">8.17</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">11.92</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7.02</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7.99</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7.75</td>
</tr>
</tbody>
</table>
<p>The “pgs/wk lower” is obtained as the average pages read per week if everybody read at the lower end of their estimate (so the respective estimates are 0, 1, 11, and 26). The “pgs/wk upper” is obtained as the average of pages read per week if everybody read at the upper end of their estimate, except the “26 or more” case where we assume a value of 30 (so the respective estimates are 1, 10, 25, and 30).</p>
<p>Comparing SM with S1SM, we see that probably because of the wording, the percentages have drifted in the direction of more pages read. It might be surprising that the young educated audience seems to have a smaller fraction of heavy users than the general population. However note that each group only had ~50 responses, and that we have no education information for the SM group.</p>
<h3 id="s2q2-multiple-choice-of-articles-read">S2Q2: multiple-choice of articles read</h3>
<p>Our intention with this question was to see if people’s stated or recalled article frequencies matched the actual, revealed popularity of the articles. Therefore we present the <a href="https://web.archive.org/web/20160714023739/http://wikipediaviews.org/displayviewsformultipleyears.php?tag=Pages%20in%20SurveyMonkey%20second%20survey">pageview data</a> along with the percentage of people who said they had read an article.</p>
<table style="border-bottom: 2px solid #333; border-collapse: collapse; border-top: 2px solid #333; margin: 0px auto;">
<caption>Which of these articles have you read (at least one sentence of) on Wikipedia (select all that apply)? SM = SurveyMonkey audience with no demographic filters, CEYP = College-educated young people of SurveyMonkey. Columns “2016” and “2015” are desktop pageviews in millions. Note that the 2016 pageviews only include pageviews through the end of June. The rows are sorted by the values in the CEYP column followed by those in the SM column.</caption>
<thead>
<tr class="header" style="border-bottom: 1px dotted #ccc;">
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">Response</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">SM (N=54)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">CEYP (N=50)</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">2016</th>
<th align="left" style="border-bottom: 1px solid #333; font-weight: normal; padding: 0px 10px; vertical-align: top;">2015</th>
</tr>
</thead>
<tbody>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">None</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">37%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">40%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">—</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">—</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">World War II</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">17%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">22%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.6</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">6.5</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Barack Obama</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">17%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">20%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.0</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">7.7</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">United States</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">17%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.3</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9.6</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Donald Trump</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14.0</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">6.6</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Taylor Swift</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">18%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.7</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">5.3</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Bernie Sanders</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">17%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.3</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.8</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Japan</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">11%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.6</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.7</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Adele</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">6%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">16%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.0</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4.0</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Hillary Clinton</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">19%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.8</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.5</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">China</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.9</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">5.2</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">The Beatles</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">11%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">14%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">1.4</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.0</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Katy Perry</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">12%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">0.8</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.4</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Google</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">15%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">10%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">3.0</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">9.0</td>
</tr>
<tr class="odd" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">India</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">13%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">10%</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">2.4</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">6.4</td>
</tr>
<tr class="even" style="border-bottom: 1px dotted #ccc;">
<td align="left" style="padding: 0px 10px; vertical-align: top;">Justin Bieber</td>
<td align="left" style="padding: 0px 10px; vertical-align: top;">4%</td>