{"id":113,"date":"2020-11-19T09:16:15","date_gmt":"2020-11-19T09:16:15","guid":{"rendered":"https:\/\/blogs.ncl.ac.uk\/opening-research\/?p=113"},"modified":"2020-11-19T12:40:16","modified_gmt":"2020-11-19T12:40:16","slug":"what-data-is","status":"publish","type":"post","link":"https:\/\/blogs.ncl.ac.uk\/opening-research\/2020\/11\/what-data-is\/","title":{"rendered":"Guest Post: Rethinking What Data Is"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"681\" src=\"https:\/\/blogs.ncl.ac.uk\/opening-research\/files\/2020\/11\/data.jpg\" alt=\"\" class=\"wp-image-121\" srcset=\"https:\/\/blogs.ncl.ac.uk\/opening-research\/files\/2020\/11\/data.jpg 1024w, https:\/\/blogs.ncl.ac.uk\/opening-research\/files\/2020\/11\/data-300x200.jpg 300w, https:\/\/blogs.ncl.ac.uk\/opening-research\/files\/2020\/11\/data-768x511.jpg 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p><em>This is our first guest post on the Opening Research blog. We are keen to hear from colleagues across the research landscape so please <a href=\"mailto:rdm@ncl.ac.uk?subject=Blog%20post\">do get in touch<\/a> if you&#8217;d like to write a post. But the honor of debut guest blogger goes to David Johnson, PGR in History, Classics and Archaeology.<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p>The trainings on open publishing and data storage fundamentally changed my perspective on what constitutes data.<\/p><\/blockquote>\n\n\n\n<p>Coming to start my PhD from a background in history and the humanities, I really didn\u2019t give the idea of data much thought.\u00a0 I knew I was expected to present evidence about my topic in order to defend my research and my ideas, but in my mind there was a fundamental difference between the kind of evidence I was going to work with and \u2018data\u2019.\u00a0 Data was something big and formal, a collection of numbers and formulae that people other than me collated and manipulated using advanced software.\u00a0 Evidence was the warm and fuzzy bits of people&#8217;s lives that I would be collecting in order to try and say something meaningful about them, not something to \u2018crunch\u2019, graph, or manipulate.\u00a0 This was a critical misconception that I am pleased to say I have come to terms with now.<\/p>\n\n\n\n<p>What I had to do was get away from the very numerical interpretation of the term \u2018data\u2019, and start to think in broader terms about the definition of the word.&nbsp; When I was asked about a data plan for my initial degree proposal, I said I didn\u2019t have one.&nbsp; I simply didn\u2019t think I was going to need one.&nbsp; In fact, I had already developed a basic data plan without realising what it was called.&nbsp; My initial degree proposal included going through a large volume of domestic literature and gathering as many examples of emotional language as I could find to create a lexicon of emotions words in use during the nineteenth century.&nbsp; In retrospect, it\u2019s obvious that effort was fundamentally based in data analysis, but my notion of what \u2018data\u2019 was prevented me from seeing that at the time.&nbsp;<\/p>\n\n\n\n<p>What changed my mind was some training I went to as part of my PhD programme, which demonstrates how important it is to engage with that training with an open mind.&nbsp; The trainings on open publishing and data storage fundamentally changed my perspective on what constitutes data.&nbsp; Together these two training events prompted me to reconsider the way I approached the material I was collecting for my project.&nbsp; My efforts to compile a vocabulary of emotions words from published material during the nineteenth century was not just a list of word, but was a data set that should be preserved and made available.&nbsp; Likewise, the ever-growing pile of diary entries demonstrating the lived emotional experiences of people in the nineteenth century constitutes a data set.&nbsp; Neither of these are in numerical form, yet they both can be qualitatively and quantitatively evaluated like other forms of data.<\/p>\n\n\n\n<p>I suspect I am not alone in carrying this misconception as far into my academic work as I have.&nbsp; I think what is required for many students is a rethinking of what constitutes data.&nbsp; Certainly in the hard sciences, and perhaps in the social sciences there is an expectation of working with traditional forms of data such as population numbers, or statistical variations from a given norm, but in the humanities we may not be as prepared to think in those terms.&nbsp; Yet whether analysing an author\u2019s novels, assessing parish records, or collecting large amounts of diary writings as I am, the pile of text still constitutes a form of data, a body of material that can be subjected to a range of data analysis tools.&nbsp; If I had been able to make this mind shift earlier in my degree, I might have been better able to manage the evidence I collected, and also make a plan to preserve that data for the long term.&nbsp; That said, it\u2019s still better late than never, and I am happy say I have made considerable progress since I rethought my notions of what data was.&nbsp; I have put my lexicon data set out on the Newcastle Data Repository, so feel free to take a look at <a href=\"https:\/\/doi.org\/10.25405\/data.ncl.11830383.v1\">https:\/\/doi.org\/10.25405\/data.ncl.11830383.v1<\/a>.<\/p>\n\n\n\n<p><em>Image credit: JD Handcock Photos: <a href=\"http:\/\/photos.jdhancock.com\/photo\/2012-09-28-001422-big-data.html\">http:\/\/photos.jdhancock.com\/photo\/2012-09-28-001422-big-data.html <\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This is our first guest post on the Opening Research blog. We are keen to hear from colleagues across the research landscape so please do get in touch if you&#8217;d like to write a post. But the honor of debut guest blogger goes to David Johnson, PGR in History, Classics and Archaeology. The trainings on &hellip; <a href=\"https:\/\/blogs.ncl.ac.uk\/opening-research\/2020\/11\/what-data-is\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Guest Post: Rethinking What Data Is<\/span><\/a><\/p>\n","protected":false},"author":7194,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,2,3],"tags":[13,12,8,10,11],"class_list":["post-113","post","type-post","status-publish","format-standard","hentry","category-guest-post","category-open-access","category-open-data","tag-data-management-planning","tag-guest-post","tag-open-research","tag-open-research-data","tag-training"],"_links":{"self":[{"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/posts\/113","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/users\/7194"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/comments?post=113"}],"version-history":[{"count":7,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/posts\/113\/revisions"}],"predecessor-version":[{"id":125,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/posts\/113\/revisions\/125"}],"wp:attachment":[{"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/media?parent=113"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/categories?post=113"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/opening-research\/wp-json\/wp\/v2\/tags?post=113"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}