{"id":4408,"date":"2020-10-22T17:06:53","date_gmt":"2020-10-22T21:06:53","guid":{"rendered":"https:\/\/commons.princeton.edu\/ant347-f20\/?p=4408"},"modified":"2020-10-22T17:06:53","modified_gmt":"2020-10-22T21:06:53","slug":"big-data-or-any-data-is-never-objective","status":"publish","type":"post","link":"https:\/\/commons.princeton.edu\/ant347-f20\/big-data-or-any-data-is-never-objective\/","title":{"rendered":"Big Data (or any data) is never objective"},"content":{"rendered":"<p>I want to consider the question raised and answered by boyd and Crawford on page 666: \u201cDo numbers speak for themselves? We believe the answer is \u2018no\u2019\u201d and link it to their second point, that claims to objectivity that come with Big Data are misleading.<\/p>\n<p>People are always going to need words and description to write about data. Data needs to be graphed, labeled, and plotted; figures need captions and descriptions to let the audience know what \u201ctruth\u201d to take from the numbers. If the numbers speak for themselves, then why does anyone even bother writing research papers? Just give the numbers and have everyone draw the \u201ctrue conclusion\u201d from the data.<\/p>\n<p>Numbers always require a description and thus a context to give them meaning. -2 (arguably raw data) without a description is useless \u2013 is it the change in temperature, the drop in stock points, the velocity of a bird? Because of this requirement for a description, data is never objective. When deciding what data to collect, you describe and categorize what you\u2019re looking for, already shaping the outcome of the dataset and the conclusions you\u2019re able to draw. Then, in processing, even more layers of interpretation are heaped on to \u201craw\u201d numbers; researchers prune away numbers and data that they don\u2019t want in search of some true pattern. Arguably, the most crucial step in data science is the step of data cleaning \u2013 you\u2019re shaping the final result irrevocably by choosing what to keep and what to discard. Most people, however, fail to realize how crucial and important pruning and cleaning are to data analysis; the attention is on the final, \u201cobjective\u201d results, not on the critical work done in the filtering steps.<\/p>\n<p>Layers of meaning always need to be attached to numbers through words and by people, and this is no different for big data; the unique and dangerous thing about big data is that it attempts to obscure just how interpretive it really is.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I want to consider the question raised and answered by boyd and Crawford on page 666: \u201cDo numbers speak for themselves? We believe the answer is \u2018no\u2019\u201d and link it to their second point, that claims to objectivity that come with Big Data are misleading. People are always going to need words and description to [&hellip;]<\/p>\n","protected":false},"author":3128,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"class_list":["post-4408","post","type-post","status-publish","format-standard","hentry","category-post-production"],"_links":{"self":[{"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/posts\/4408","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/users\/3128"}],"replies":[{"embeddable":true,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/comments?post=4408"}],"version-history":[{"count":1,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/posts\/4408\/revisions"}],"predecessor-version":[{"id":4409,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/posts\/4408\/revisions\/4409"}],"wp:attachment":[{"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/media?parent=4408"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/categories?post=4408"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/commons.princeton.edu\/ant347-f20\/wp-json\/wp\/v2\/tags?post=4408"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}