-
Notifications
You must be signed in to change notification settings - Fork 20
Guide to Ethical Data Collections Practices #59
Comments
The Datasheets for Datasets paper (mentioned in #10) advises that dataset creators answer ~60 questions (!) regarding motivation, curation (composition, collection, and data cleaning), and integration (uses, distribution, and maintenance). Would it makes sense to focus on a select number of these, especially those questions related specifically to data from other people? I copied these verbatim, but we can edit further… (Motivation)
(Composition)
(Collection Process)
(Uses)
(Maintenance)
The full list is here. Of note, this paper is frequently referenced in the Partnership on AI’s About ML project which ultimately aims to establish documentation standards across industries for the transparency of entire ML systems—both datasets and models. |
Thank you so much @ellennickles, this is fantastic. (And thank you for summarizing, super helpful.) I plan on discussing this in class tomorrow! |
The question came up today in class: "What if I want to collect data? Is there a helpful guide / document of tips / common strategies for ethical data collection?". Please add your suggestions here:
Also, nothing these two topics I referenced:
Duke University MTMC
Atlanta Asks Google Whether It Targeted Black Homeless People
The text was updated successfully, but these errors were encountered: