You are browsing the archive for Panton Principles.

Yet another Open Access inquiry

January 30, 2013 in Announcements, Panton Principles, Research

Hot on the heels of the recent House of Lords inquiry, there is also a separate Business, Innovation and Skills (BIS) Committee inquiry into the new Research Councils UK open access mandate focusing on economic aspects. There were only 70 or so written evidence submissions to the House of Lords inquiry and few were from active researchers. Other countries around the world are closely following developments with UK policy so it is globally important that the UK mandate remains strong.
For this new BIS inquiry we think you might want to submit written evidence. You need not be a UK resident or national. In fact, since the UK contributes 6% of the world’s academic research output (and 14% of the highly cited output) we’re all stakeholders in this. Open access benefits the world, academics and non-academics alike.
   
The Committee will consider a range of topics including:
  • The Government’s acceptance of the recommendations of the Finch Group Report ‘Accessibility, sustainability, excellence: how to expand access to research publications’, including its preference for the ‘gold’ over the ‘green’ open access model;
  • Rights of use and re-use in relation to open access research publications, including the implications of Creative Commons ‘CC-BY’ licences;
  • The costs of article processing charges (APCs) and the implications for research funding and for the taxpayer; and
  • The level of ‘gold’ open access uptake in the rest of the world versus the UK, and the ability of UK higher education institutions to remain competitive.
They are not particularly looking for general endorsements of Open Access. That is thankfully a given, unchangeable policy direction. As I understand it they are looking for relevant evidence to the points above, only.    
Written evidence should be sent to the Committee, as an MS Word document, by e-mail to biscommem@parliament.uk.  The deadline for BIS submissions is 7 February 2013, further details here.
 
The Open Knowledge Foundation is particularly concerned about the confusion in many recent blog posts in certain quarters over what Creative Commons licences actually do. Some have been attempting to portray the Creative Commons Attribution licence (CC BY)  as against ‘author rights’ or against ‘academic freedom‘. It would be good to make clear the benefits of CC BY, perhaps even specifically in terms of economics and economic benefits.

 

This is a rare opportunity for for our voices to be heard in a policy-guiding process. We should not waste this opportunity. Commercial academic publishers will almost certainly be submitting their viewpoints and interests, so we should equally ensure that our interests in intelligent openness are represented here too.

   

^Ross    OKFN Panton Fellow

A recap of some of the activity at OKFestival 2012

September 27, 2012 in events, OKFest, Panton Principles, Research

 

The following post is by Ross Mounce, one of the two OKFN Panton Fellows.

Wow! Where to begin… In this post I shall attempt to summarise some of OKFestival 2012 that was held in Helsinki just last week from the 17th to the 22nd of September.

Some Background:

I had been to the Open Knowledge Conference last year (in Berlin), where I gave an invited talk on Open Palaeontology and met lots of brilliant people in the Open Science community like Bjoern Brembs, Cameron Neylon & Peter Murray-Rust. But this year the event was even bigger, and even better – teaming up with the annual Open Government Data Camp for a mega-event.

The Event Itself:

The Aalto University buildings of the venue were wonderfully modern and well equipped for this event (inc. great WiFi which was essential for such a digital event as this). I got to Helsinki with our other Panton Fellow – Sophie Kershaw on the Tuesday, and caught the tail end of the Data Journalism session that day including an excellent, inspirational talk on shippr.org amongst other things. It detailed the amazing knowledge and insight gained from tracking the movement of ships with open data. I couldn’t help thinking that academics could learn a lot from these open data visualization experts (myself included!). This is one of the huge benefits of the conference – bringing together a melange of humanities, scientists, economists, governments, the World Bank (there were at least 20 representatives from this organisation here!), corporates (e.g. IBM), journalists, entrepeuners, and designers all connected by a shared utility of openness.

An interesting example of Shippr data – ships turn off their beacons once they pass the point for fear of pirates…

Wednesday – the Science & Academia session

I really liked the way that the conference had an introductory session to the days parallel events in the morning from 10am – 11am. If one was unsure of which stream to go to – these Morning Plenaries gave each topic stream a chance to pitch their events in a short slot to the awaiting audience. I thought this was very helpful given there were 13 separate topic streams at the conference!

I was involved in two sessions on this Science day. Firstly the Open Access discussion panel chaired by Peter Murray-Rust, the video for which is here with Tim Hubbard (Sanger Institute), Carlos Russel (World Bank), Tom Olijhoek & Mark MacGillivray (Open Access Index) and myself (University of Bath & OKFN Panton Fellow):

It’s a long video. We covered many topics including altmetrics, the lack of access for independent researchers and ivory-tower academics, the role of libraries, ‘illegally’ posting one’s own work up on the internet, incentives for OA and much more… with excellent contributions from the audience including Puneet Kishnor from Creative Commons and Matt Todd from the Open Source Drug Discovery team amongst many others.

Then after this there was the research data session with contributions from Mark Wainwright on CKAN, Mark Hahnel on Figshare and Joss Winn of the Orbital project.

Finally we finished with the Panton Fellowships Session with talks from myself on content mining for phylogenetic tree data, Open Access licencing and the various costs of Gold Open Access options:

and Sophie Kershaw on her Open Science Training Initiative (OSTI) at the Oxford University DTC:

The day was then rounded off with a hugely inspirational talk from Matt Todd who had travelled all the way from Australia(!), summarising his Open Source Drug Discovery work in the main lecture theatre, followed by a lovely traditional Finnish meal & social mixer afterwards in Ravintola Lasipalatsi.

Fast forward to 04:30 to see the start of Matt Todd’s talk

Thursday

Probably everyone’s highlight of the conference was Hans Rosling’s fantastic key note presentation which I urge you all to watch – it was brilliant, and thrilling to be there live in the audience for.

Friday

If there’s one thing that impresses me most of all about OKFestival, it’s this: it’s not just about talking – they do things here too. Lots of ‘hacking’ sessions on Friday to create new tools and collate awesome new data. Most conferences are extremely boring in that it’s just talk after talk after talk. Things get done here, new collaborations are started, fresh links across disciplinary boundaries are made connecting journalism with academia, economic development with open architectural design, and other incredible trans-disciplinary mashups. It’s a joy to behold.

I’m really glad I came to OKFestival, as ever I got a lot out of it.

Next year it’ll be in Switzerland.

Perhaps we’ll see you there?

Ross

cross-posted and modified from a previously posted version here

Working Group Response to Royal Society Science as a Public Enterprise

July 10, 2012 in Panton Principles, Publications

Those of you following #openscience news over the last few weeks won’t have failed to notice that the Royal Society in the UK recently released their Science as a Public Enterprise Report in strong support of open science. The working group submitted our collaboratively drafted response during the consultation period, which you can read below or download with other responses.

What ethical and legal principles should govern access to research results and data? How can ethics and law assist in simultaneously protecting and promoting both public and private interests:

The presiding principle should be that all outputs of publicly funded research are released and made publicly accessible as soon as is practicable and reasonable. Our position would be that all data, code and algorithms supporting published scientific results should be released openly alongside that publication in accordance with the Panton Principles for Open Data in Science [1].

We acknowledge that there are reasons why outputs should not be released but these are restricted to a small set of issues including but not necessarily limited to personal privacy, personal endangerment, risks to the research itself, danger to the environment. All of these are much larger issues which deserve consideration.

[1] www.pantonprinciples.org

2 a) How should principles apply to publicly-funded research conducted in the public interest:

The principles and caveats discussed above should apply to all such research.

2 b) How should principles apply to privately-funded research involving data collected about or from individuals and/or organisations (e.g. clinical trials)?

If the full economic cost is covered by a private company then research is ‘private’ and there shouldn’t be an expectation of data release in the manner previously discussed. However, even private funding is often from donors that have expectations of research being done in the public interest e.g. charities supporting medical research. Therefore, the better distrinction may be between public interest research and commercial R&D (as addressed below).

There should be dicussion between donors and funders as to data release policies in these cases and where the funder is in agreement the data should be made available according to the principles above while prioritising privacy and anonymity of research subjects (see Q2d)

In the case of partial private funding there may be grey areas. This includes cases where research is privately funded but heavily subsidised by institutions such that companies are not bearing the full economic cost. These may need consideration case by case but in terms of any published results in the scientific literature, the default should be that the data to back up those claims is publicly available.

2 c) How should principles apply to research that is entirely privately-funded but with possible public implications?

If private funding leads to research claims around public policy areas e.g. health, environment or planning, then data to support that claim should be made available in a publicly accessible manner. This is imperative if research claims are intended to influence public policy.

2 d) How should principles apply to research or communication of data that involves the promotion of the public interest but which might have implications from the privacy interests of citizens?

The privacy of citizens should come before the need for open data, but methods to protect privacy and still release data in the public interest should be explored and considered where possible.

3. What activities are currently under way that could improve the sharing and communication of scientific information?

There are numerous barriers to effective sharing and communication of scientific information which are being adressed by ongoing projects and activities.

Getting the data in shareable form: One of the most urgent needs to improve sharing and communication of scientific information is to improve capture and collection of data. Technology is required alongside standards for formatting and sharing data and money must be invested in designing and building user friendly data capture devices. For example, the efficiency of data capture and collection could be improved by scientific workflow systems such a Taverna and VisTrails.

Encouraging sharing and communication: This requires work on the attitude and expectations of scientists, establishment of community norms around data sharing and a reward system which recognises the worth of dataset publication as well as journal articles.

Work on reward systems to ensure that scientists get more recognition and benefit for sharing data is ongoing. Systems such as microattribution and other forms of recognition following reuse of shared data are being developed by STARMETRICS (NIH) [1], REF (HEFCE) [2] and Altmetrics [3] among others.

This work will be essential for creating incentives and community norms encouraging sharing and release of data. Researchers reservations must be addressed, which may include fears of getting scooped, releasing ‘dirty’ and unedited data in which people may find mistakes, possible harm due to caveats and interpretation notes being detached from the dataset and the risk of misinterpretation. The interaction of incentives, culture, and individuals will make the difference in driving greater accessibility to scientific information.

Data Publication: An important step in improving sharing is enabling scientific datasets to be published as citeable and objects and publications such as BMC Research Notes [4] and F1000 data publications (among others) are allowing this to happen.

Some fields, such as crystallography, recognised the value of data papers far earlier than others. There are currently initiatives ongoing in other areas which have traditionally not followed this publication model e.g. meteorology [5].

Much scientific information does not make it into a paper or on its own would not merit a data publication but may still be of use to others. For instance, most scientists do not publish negative results but this may lead to unecessary duplication by other labs, which reduces the efficiency of research. Projects such as FigShare [6] enable such data to be made accessible and their use should be encouraged.

Standards: Each scientific discipline will differ in what research data it feels can or cannot be released and when and how that release should happen. Encouragement of disciplines to articulate norms around reasonable exceptions to release has started via projects such as Data Dryad [7].

A difficult issue is what rights researchers have to first use of any data they collect. It may be necessary to set an upper limit to time of publication, particularly in fields where long term data collection is the norm. However, the default should be that data is released as soon as practicable on a timescale deemed reasonable by the community.

On top of this, standards related to the formatting and content of datasets must be considered to increase their usefulness to multiple stakeholders including other researchers and the public. Initiatives such as Biosharing [8] in the life sciences aim to catalogue bioscience data reporting standards and policies.

Principles: Principles for the release of data should be clearly articulated e.g. the Panton Principles for Open Data in Science [9] provide recommendations for the release of fully open data. There is a difference between publicly accessible publication and open publication of data, this organisation would strongly promote the latter. As per the Panton Principles introduction:

Science is based on building on, reusing and openly criticising the published body of scientific knowledge. For science to effectively function, and for society to reap the full benefits from scientific endeavours, it is crucial that science data be made open. By open data in science we mean that it is freely available on the public internet permitting any user to download, copy, analyse, re-process, pass them to software or use them for any other purpose without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. To this end data related to published science should be explicitly placed in the public domain.

Data management and repositories: Numerous projects are ongoing in the field of improving scientific data repositories and research data management (Dryad [10], JISC UMF [11]), building sustainable infrastructures to draw data from different sources and make it available (DataONE [12]), looking at improving the quality and availability of scientific data more generally (CoDATA [13]).

[1] https://www.starmetrics.nih.gov/
[2] http://www.hefce.ac.uk/research/ref/
[3] http://altmetrics.org/manifesto/
[4] http://www.biomedcentral.com/bmcresnotes/
[5[ http://www.jisc.ac.uk/whatwedo/programmes/reppres/sue/ojims.aspx
[6] http://figshare.com/
[7] http://www.datadryad.org/jdap
[8] http://www.biosharing.org/
[9] http://pantonprinciples.org/
[10] http://datadryad.org/
[11] http://www.jisc.ac.uk/whatwedo/programmes/umf.aspx
[12] http://www.dataone.org/about
[13] http://www.codata.org/

5. What additional challenges are there in making data usable by scientists in the same field, scientists in other fields, ‘citizen scientists’ and the general public?

The form in which the data is published is a major factor in its reuseability e.g. file formats, use of standard ontologies. Many intitiatives are defining data standards (see Q3) which will improve the situation for other researchers.

The barriers to making data useable by citizens are higher. Data visualisation and provision of suitable narratvies to accompany datasets would be useful, although language barriers would also need to be addressed.

6 a) What might be the benefits of more widespread sharing of data for the productivity and efficiency of scientific research?

More widespread sharing of data increases the efficiency of research through:

  • Reduction of duplication – Datasets are discoverable and can be reused e.g. data deposited in the NCBI Gene Expression Omnibus (GEO) database was reused in around 1150 papers from PubMed during 2007- 2010 [1].Sharing of negative results could reduce duplication significantly.
  • Ease of replication – Release of full datasets as opposed to summaries in papers enable more effective replication and scope for discovering errors, partcularly if related code and algorithms are also released.
  • Ease of critique and reanalysis – Thorough and rapid public critique of data would be made easier, possibly leading to more discussions such as the recent asrsenic life debate in the blogosphere.
  • Crowdsourcing analysis – Sharing data with other scientists can be particularly beneficial when rapid analysis of data is required e.g. the recent E.coli outbreak in Europe where E.coli genome comparisons were crowd sourced in the public domain [2]. This demonstrates its inherent ability to increase efficiency compared to multiple closed labs performing the same analysis. Citizen science such as Galaxy Zoo or PlanetHunters also allows the crowdsourced analysis of scientific data by members of the public, which can only happen if data is publicly shared.
  • Less time wasted searching for data: The discovery of data under the current system of publication can be a time consuming process. Once a suitable publication is found, the full data may need to be requested from the authors and that which is included in the publication may not be openly reuseable due to licensing. Finding out if data is reuseable takes time, which can reduce productivity and efficiencly. Tools such as the data status request service Is It Open Data? [2], which archives responses from data providers, may be useful in this regard but still takes time.

[1] http://researchremix.wordpress.com/2011/05/19/nature-letter/ [2] https://github.com/ehec-outbreak-crowdsourced/BGI-data-analysis/wiki [3] http://www.isitopendata.org/

6 b) What might be the benefits of more widespread sharing of data for new sorts of science?

Genomics as a science would have been impossible without data sharing via databases such as GenBank and the same applies to other scientific fields and their respective data sharing methods e.g. astronomy and crystallography.

It is difficult to predict what new fields may emerge from a world with greater availability of data. However, the release of open datasets will enable the utilisation of technology which already exists but is restricted by the availability of useable data e.g. semantic web technologies using linked data could discover obscure connections between datasets and research findings. Text mining and data mining tools have huge potential for new discoveries if allowed access to large sets of scientific information.

6 c) What might be the benefits of more widespread sharing of data for public policy?

Evidence based public policy will be more credible if the data supporting it is freely available (see also Q2d). This could lead to more informed debate from a wider range of stakeholders who all have access to the evidence.

6 d) What might be the benefits of more widespread sharing of data for other social benefits?

More diverse contributions to scientific research and debate would be possible, including increased public engagement. Members of the public with specific interests would have direct access to high quality scientific information e.g. patient groups who would like access to the latest research data on treatments.

6 e) What might be the benefits of more widespread sharing of data for innovation and economic growth?

The opportunities to increase the efficiency and productivity of reasearch discussed in Q6a could lead to an acceleration of innovation.

6 f) What might be the benefits of more widespread sharing of data for public trust in the processes of science?

The recent climate data controversy led to concern among the public about the management and availability of scientific data. To get to the stage where an FOI is required to retrieve the output of publicly funded research does not reflect well on the transparency and accessibility of science and thus, to some people, its credibility.

Widespread sharing of data, particularly in areas of great public interest, should reduce these concerns.

7. How should concerns about privacy, security and intellectual property be balanced against the proposed benefits of openness?

Privacy of research subjects should be prioritised as per Q2d. This is particularly pertinent to medical trials and informant-based social sciences and discussions on how best to approach data release are ongoing in these fields.

Security may be a valid reason for not publishing data but must be justified and will need to be examined on a case by case basis.

8. What should be expected and/or required of scientists (in companies, universities or elsewhere), research funders, regulators, scientific publishers, research institutions, international organisations and other bodies?

A common vision is required to enable the necessary technologial and cultural changes for widespread sharing of scientific information in the manner discussed above to be realised. There is already a topdown push from research funders e.g. a group of 17 health research funders have released a joint statement on data sharing [1] and publishers are beginning to offer more opportunities for publishing datasets (see Q3). More such organisations should be expected to join in the positive promotion of data sharing. This will enable a scaling up of efforts as more scientists are required to share data in order receive funding or a journal publication.

From scientists themselves there are various grass roots level projects. Many initiatives are listed in Q3. Additionally, the Open Data in Science Working Group at the Open Knowledge Foundation [2] is a community of scientists who aim to promote open data in science and build apps, tools and datasets to allow people to easily publish, find and reuse scientific data. The realisation that the extra work required to publish useful shared data is worth the effort may take some time. The recent winners of the 2011 BMC Open Data Award admitted that “credit..must go to persistent, anonymous referee.., who demanded—twice—that we also publish the background data” [3].

Scientists should not only be required to adhere to data sharing policies set down by organisations but should be expected to give thought to managing all of their data and how they might share it in a way that maximises its impact. This will become more likely as tools make sharing easier and incentives are provided in the form of recognition and rewards for data sharing.

[1] http://www.wellcome.ac.uk/About-us/Policy/Spotlight-issues/Data-sharing/Public-health-andepidemiology/ WTDV030689.htm [2] http://science.okfn.org/ [3] http://blogs.openaccesscentral.com/blogs/bmcblog/entry/on_the_unbearable_lightness_of

Other comments:

In many cases the costs of sharing data are negligible, but in some cases there may be an argument that the cost of effective sharing is too high compared to the potential for reuse e.g. some raw datasets are out of date and not easily digitised. Others are impossibly large e.g. primary data from the LHC or image data from next-gen sequencing machines. In this case, it would be acceptable to share only what data is available and deemed useable e.g. summary data.

However, cost barriers are reducing over time and in the future it may be possible to publish datasets that are currently unfeasible, so effective data management is essential to preserved as much output of research as possible.

Panton Fellowships

February 13, 2012 in Panton Principles, Research

The following post is by Laura Newman, a Community Coordinator at the Open Knowledge Foundation and Coordinator of the Panton Fellowships. It has been cross-posted from the OKFN main blog.

Funding for scientists who promote open data in science.

  • Panton Fellows receive £8,000 over one year
  • A small discretionary budget for travel and related expenses may also be available.
  • How would you promote open data in science?
  • See the Panton Principles’ website for full details and how to apply.

Details

The Panton Fellowships are designed to support scientists who promote open data. Following our previous announcement, this is a reminder that the deadline for applications is Friday 24th February

  • The Panton Fellowships are for scientists who promote open data in science.
  • Panton Fellowships are designed to be flexible, and there is scope for Fellows to carry out a wide variety of activities. Applicants are encouraged to propose their own work plan.
  • Panton Fellows may wish to initiate discussion about the role and value of openness, explore practical solutions for making data open, and push for change in scientific practices.

Panton Fellowships are open to all applicants, and are particularly suited to graduate students and early-stage career stage scientists.

Please Note: Panton Fellowships are not full-time positions and are not intended to cover full economic costs. Fellows will continue to work and/or study at their current institution for the duration of the Fellowship. You should ensure that you have permission from all relevant employers/funders.

Why?

We firmly believe that open data means better science. Panton Fellowships were created in order to support scientists who are interested in open data, particularly whilst they are launching their career. The scheme is overseen by a distinguished Advisory Board, which includes:

  • Dr Rufus Pollock – Co-founder and Director of the Open Knowledge Foundation
  • Dr Peter Murray-Rust – Emeritus Reader of Chemistry at Cambridge
  • Dr Cameron Neylon – Senior Scientist in Biomolecular Sciences at the ISIS Neutron Scattering facility
  • John Wilbanks – Senior Fellow at the Ewing Marion Kauffman Foundation
  • Dr Tim Hubbard – Representative of Bioinformatics at the Sanger Institute

Dr Cameron Neylon commented  on the ‘real potential’ of the Fellowships to influence practice  surrounding open data in the scientific community.

‘Panton Fellowships  will allow those who are still deeply involved in research to think closely about the policy and technical issues surrounding open data’, said Dr Neylon. By allowing scientists the scope both to explore the ‘big picture’ and also to work on specific technical solutions to individual problems, the Panton Fellowships have the potential to make a real impact upon the practice of open data in science.

How to Apply

Full details on how to apply can be found at the Panton Principles website.

  • Applicants should send a CV and covering letter to jobs [@] okfn.org by 24th February explaining as a Panton Fellow, what you would do, make or change.
  • To be eligible, applicants should have the relevant rights to work in the UK, and reasonably expect to be working and/or studying in the UK until March 2013.
  • For further details, see the website

Panton Principles in Portuguese

December 16, 2011 in Panton Principles

We are pleased to announce that the Panton Principles for Open Data in Science are now available in Portuguese!

Many thanks to Vinicius Siqueira for translating, Carolina Rossini and Everton Zanella Alvarenga for proof reading.

The open data in science working group look forward to working closely with the OKFN Brazil chapter in the future!

Ross Mounce on “Open Palaeontology” @OKCon Berlin 2011

August 11, 2011 in Collaborations, OKCon, Panton Principles

The following is a guest blogpost by Ross Mounce, currently a PhD writing on “The Importance of Fossils in Phylogeny” at the University of Bath, in UK. As his approach includes application of informatics techniques to palaeontological data, Ross’s research interests are also oriented towards Openness in Data, Access and Science. Ross attended the Open Knowledge Conference in Berlin, 2011, where he gave a talk on Open Palaeontology.

Ross Mounce:

“A few weeks ago, I gave a talk at the Open Knowledge Conference 2011, on ‘Open Palaeontology’ – based upon 18 months experience as a lowly PhD student trying, and mostly failing to get usable digital data from palaeontological research papers. As you might well have inferred already from that last sentence; it’s been an interesting ride.

The main point of my talk was the sheer stupidity/naivety of the way in which data is supplied (or in some cases, not at all!) with or within research papers. Effective science operates through the accumulation of knowledge and data, all advances are incremental and build upon the work of others – the Panton Principles probably sum it up far better than I could. Any such barriers to the accumulation of knowledge/data therefore impede the progress of science.

Whilst there are numerous barriers to academic research (access to research papers being perhaps the most well-known and well-publicised), the issue that most aggravates me, is not the access to these papers, but the actual papers themselves – especially in the digital context of the 21st century. They are only barely adequate (at best) for communicating research data and this is a major problem for the future legacy of our published work… and my research project.

My PhD thesis title is quite broad: ‘The Importance of Fossils in Phylogeny’. Given this title and (wide) scope, I need to look at a lot of papers, in a lot of different journals, and extract data from these articles to re-analyse; to assess the importance of fossils in phylogeny; to place them on a meta-scale. There are long established data formats for the particular type of data I wish to extract. So well established and easy to understand there’s even a Wikipedia page here describing the most commonly used data format (nexus). There exist multiple databases set aside specifically to host this type of data e.g. TreeBASE and MorphoBank. Yet despite all this standardisation and provisioning for paleomorphological phylogenetic data – far less than 1% of all data published on, is actually readily-available in a standardised, digital, usable format.

In most cases the data is there; you just have to dig very very hard to release it from the pdf file it’s usually buried in (and then spend unnecessary and copious amounts of time, manually reformatting and validating it). See the picture below for a typical example (and yes, it is sadly printed sideways, this is a common and silly practice that publishers use to inappropriately squeeze data matrices into papers): 7BHO

I hope you’ll agree with me that this is clearly absurd and hugely inefficient. As I explain in my presentation (also available below this post) the data, as originally analysed/used, comes in a much richer, more usable, digital, standardised format. Yet when published it gets stripped of all useful metadata and converted into a flat, inextricable and significantly obfuscated table. Why? It’s my belief that this practice is a lazy unwanted vestigial hangover from the days of paper-based (only) publishing, in which this might have been the only way in which to convey the data with the paper. But in 2011, I can confidently say that the vast majority of researchers read and use the digital versions of research papers – so why not make full and proper use of the digital format to aid scientific communication? I argue, not to axe paper copies. But to make sure that digital versions are more than just plain pdf versions of the paper copy, as they can and should be.

With this goal in mind, I set about writing an Open Letter to the rest of my research community to explain why we need to richly-digitise our published research data ASAP. Naturally, I wouldn’t get very far just by myself, so I enlisted the support of a variety of academic friends via Facebook, and (inspired by OKFN pads I’d seen) we concocted a draft letter together using an Etherpad. The result of this was a fairly basic Drupal-based website that we launched http://supportpalaeodataarchiving.co.uk/ and disseminated via mailing lists, Twitter, Academia.edu as far and wide as we possibly could, hoping just hoping, that our fellow academics would read, take note and support our cause.

Surprisingly, it worked to an extent and a lot of big names in Palaeontology signed our Open Letter in support of our cause; then things got even better when a Nature journalist (Ewen Callaway) got interested in our campaign and wrote an article for Nature News about it, which can be found here. A huge thanks must go to everyone who helped out with the campaign, it has generated truly International support, as can be demonstrated on the map below:

(View Open Letter Signatures in a larger map)

It’s far too soon to know the true impact of the campaign. Journal editorial boards can be very slow to change their editorial policies, especially if it requires a modicum of extra effort on the part of the publisher. Additionally, once the editorial policy does change at a journal, it can only apply to articles submitted from henceforth and thus articles already in the submission pipeline don’t get affected by any new guidelines. It’s not uncommon for delays of a year between submission and publishing in palaeontology, so for this and other reasons, I’m not expecting to see visible change until 2012, but I think we might have helped get the ball rolling, if nothing else… The Paleontological Society journals (Paleobiology and Journal of Paleontology) have recently adopted mandatory data submission to the Dryad repository, and the Journal of Vertebrate Paleontology has also improved their editorial policy with respect to certain types of data, but these are just a few of many journals that publish palaeontological articles. I’m very much hoping that other journals will follow suit in the next few months and years by taking steps to improve the way in which research data is communicated, for the good of everyone; authors, publishers, funders and readers.

Below you can find the Prezi I used to convey some of that (and more) at OKCon 2011. Huge thanks to the conference organisers for inviting me to give this talk. It was the most professionally run conference I’ve ever been to, by far. If the conference is on next year – I’ll be there for sure!” Ross Mounce

The invited talk, given on Friday 1st July 2011 at the Open Knowledge Conference (Berlin) by Ross Mounce: Open Palaeontology on Prezi

Panton Principles Banner on BioMedCentral

June 29, 2011 in Panton Principles

In order to spread the message that ‘Open Data Means Better Science’, BioMedCentral have kindly hosted a web banner advertising the Panton Principles.

The original file and a screenshot of it in all its glory on BMC may be found below! Hopefully, this will direct more people to read and support the Panton Principles, which have already been publicly by more than 150 people.

Panton Discussion – Richard Poynder

February 12, 2011 in Panton Principles

The first Panton discussion, held at the Panton Arms between Richard Poynder, Peter Murray-Rust, Jordan Hatcher, Rufus Pollock and Alberto Sicilia, is now online here.

Check out the audio recording and full transcript for a wide ranging discussion on open data and science. You can read more about the discussion and the involvement of working group members at Richard Poynder’s and Graham Steel’s blogs.

Please create an account to get started.

#openscience on Twitter

PeroMHCPeroMHC: RT @recology_: .@ethanwhite @PLOS & why not use @disqus comments for easy login? @elife does #openscience
1 week ago from web
recology_recology_: .@ethanwhite @PLOS & why not use @disqus comments for easy login? @elife does #openscience
1 week ago from TweetDeck
ethanwhiteethanwhite: .@PLOS if you want to encourage more post-publication discussion I'd recommend notifying all authors of new comments. #openscience
1 week ago from web
neuinfoneuinfo: check out @LitRoost - emerging science 2.0 - "A free research platform for geeks and #OpenScience" @ http://t.co/Kd3wWAqOh2
1 week ago from HootSuite