What is Open data?
From Paolo Mangiafico, Duke’s Director of Digital Information Strategy:
Open Access is about more than just the publications that are the results of research – it’s also about the data generated during the research process.
While publications have always been “public” by definition (even if not universally accessible), data has more frequently been made available only on request, or when there’s some reason to question the published results.
But there are good reasons to make more data more open more often. A 2009 report from the National Academy of Sciences titled “Ensuring the Integrity, Accessibility, and Stewardship of Research Data in the Digital Age” makes the case this way:
The advance of knowledge is based on the open flow of information. Only when a researcher shares data and results with other researchers can the accuracy of the data, analyses, and conclusions be verified. Different researchers apply their own perspectives to the same body of information, which reduces the bias inherent in individual perspectives. Unrestricted access to the data used to derive conclusions also builds public confidence in the processes and outcomes of research. Furthermore, scientific, engineering, and medical research is a cumulative process. New ideas build on earlier knowledge, so that the frontiers of human understanding continually move outward.
Researchers use each other’s data and conclusions to extend their own ideas, making the total effort much greater than the sum of the individual efforts.
Openness speeds and strengthens the advance of human knowledge.” (p. 59)
While not all data should be kept and not all data can be shared, policies, processes, and infrastructure are being developed in many fields and at many institutions to promote openness of research data wherever possible. One example based here at Duke, the Dryad repository, part of the National Evolutionary Synthesis Center, is working with stakeholders from journals and scientific societies to develop data sharing policies, and a place to deposit data underlying scientific publications. Similar policies have been adopted by funding agencies and are becoming an expectation in many fields.
Making your data openly accessible can also bring more attention to your work.
One study that examined the citation history of 85 cancer microarray clinical trial publications found that
48% of trials with publicly available microarray data received 85% of the aggregate citations. Publicly available data was significantly (p = 0.006) associated with a 69% increase in citations, independently of journal impact factor, date of publication, and author country of origin using linear regression.” (Piwowar HA, Day RS, Fridsma DB, 2007 “Sharing Detailed Research Data Is Associated with Increased Citation Rate.” PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308)
Want to learn more about open data and how you can share the results of your work? Here are some starting points:
For more information, see the Open Access at Duke web site.
3 Responses to What is Open data?
Leave a Reply Cancel reply
Policy on Electronic Course Content
For help deciding whether course content in Blackboard or some other digital form is fair use or requires copyright permission, consult this policy document adopted by the Academic Council in February 2008.
Search the Scholarly Communications Blog
Categories
- Authors' Rights
- Copyright in the Classroom
- Copyright Information Notes
- Copyright Issues and Legislation
- Data
- Digital Rights Management
- Fair Use
- international IP
- Libraries
- Licensing
- Open Access and Institutional Repositories
- Open Access topics
- Orphan works
- Public Domain
- Scholarly Publishing
- Technologies
- Traditional Knowledge
- Uncategorized
- User Generated Content
Archives
Recent Comments
- Academic publisher on Finding out who your friends are
- Martina Periodicos on The GSU decision — not an easy road for anyone
- Jeff Malaguilla on The six million dollar fair use standard
- Kevin Smith on “the radical disaggregation of scholarship” | Marygrove Library News on Meet me at the intersection
- friends and foes at Attempting Elegance on Finding out who your friends are


As Duke University’s first Scholarly Communications Officer, Kevin Smith’s principal role is to teach and advise faculty, administrators and students about copyright, intellectual property licensing and scholarly publishing.
RSS Feed

[...] available via open access leads to greater impact, as measured by number of citations. In an earlier blog post on open data, I mentioned a study that showed similar effects for the data underlying the [...]
[...] a lot of talk about open data as of late (see the Panton Principles, Kevin Smith’s blog post “What is Open Data”, and Michael Gurstein’s blog posts here and here). I’ve also recently read about numerous [...]
[...] journal that showcases student and alumni work at UIowa. The open access journal, and the idea of open data (a growing concept in Scholarly Communications) generally, could become a real watershed moment in [...]