Archive for the 'Open Access' Category

Journals changing publisher, but can the rights change as well?

Journal cover Animal ConservationFrom a perspective as a repository manager I do like the Cambridge University Press journals a lot. Albeit no immediate OA, after a year the author is allowed to post the publisher’s version/PDF of his article in an institutional repository. We adhere to this policy on behalf of our authors. So we post all the publications in Cambridge journal articles to our repository after this 12 month embargo period. Sounds simple and it actually is that simple.

Recently I ran a check on this policy using the DOI’s, rather than the ISSN’s we normally use, on the metadata we collect of all our researcher’s publications. The DOI string of Cambridge journal articles all start with the prefix 10.1017. I came across an article published in the journal Animal Conservation from 2004 which was not OA on our repository. Further checking this article I found out that the DOI of the article resolved to the Wiley Online library, where the article came online only in 2006, instead of the Cambridge website. Rather odd. Checking the copyright and archiving policy of this journal at the Sherpa Romeo site, they referred to the rather limited Wiley copyrights and self archiving possibilities for this journal. Sherpa Romeo implies that this is applicable for all content of this journal. I was rather disappointed.

However, that Cambridge DOI bothered me, so I checked the Cambridge site for the journal and could find the article there as well. The DOI however, resolves to the Wiley online journals site. Clearly the journal changed from publisher, that happens all the time. But on changing from publisher it appears that the authors’ copyrights changed as well. Especially since the Wiley site also hosts the complete backfile going back to Volume 1, issue 1. That the authors self archiving rights changed on change of publisher for the journal can’t be the case because they’re based on the original publishing agreement, but the Wiley site and Sherpa Romeo do imply that the Wiley copyright and self archiving policies apply to all content of the journal. That can’t be true, can it? But here I have an article hosted at two publishers websites with two very different self archiving policies.

Of course we adhere to the Cambridge self archiving policy for this article. There is therefore now a third copy copy of this article available on the Web, proudly presented in Wageningen Yield.

These are strange ways of publishers and copyrights.

The unofficial guide for authors

Recently I co-authored a book on scientific publishing. It is available from LuLu for less than € 6,-. When that’s too much for you, you can download it for free. The book is published under a CC-BY-NC licence.

From the cover:

Most scientific journals provide guidelines for authors - how to format references and prepare artwork, how many copies of the paper to submit and to which address. However, most official guidelines say little about how you should design and produce your paper and the chances that it will be accepted. This book provides a comprehensive but focused guide to producing scientific information - from research design to publication. It provides practical tips and answers to some of the most frequently asked questions: Why do we publish in the first place? What is OA publishing and why bother about it? What is the h-index? What is a Journal Impact Factor and does it matter? How can I increase my research production efficiency? Why should I use OS software tools for academic work? How can I produce graphics that will impress? How can I brainstorm good titles? How can I select a suitable journal and where can I find out more about it? How can I get into the reviewers’ heads?

How Google could help the Open Access world a little

It was back in 2008 when Google Scholar launched the feature that identified free available versions of articles of the Web. In the early days these were indicated by green triangles in front of the reference. Nowdays free available copies are listed in the right hand column. Many of these versions are Open Access versions of articles properly submitted to preprint servers and subject or institutional repositories. Other free versions of the papers identified by Google Scholar are publishers versions of articles posted to personal websites, dropboxes and you name it. Whatever the rights are, if you need a copy of these papers, and don’t have access through your universities library subscriptions, this Google Scholar feature is a very useful tool. In scholarly search classes I always stress this very useful feature of Google Scholar to my students.

In our institution’s bibliography I would love to include a functionality to refer for each article to the so called document clusters in Google Scholar. Consider the following publication the link to the full text included in the record leads you to Science Direct. Whether you can access the paper on SD, depends on the subscriptions. Sometimes you can’t. Therefore it would be nice if we could include a link to the document cluster in Google Scholar. For this paper you get some 29 versions of the paper, but above all 6 of these are free versions of this paper posted on various websites. That’s really helpful.

In AgrisWeb, I learned from Johannes Keizer yesterday, that they link to Google trough a search for the title words. This works quite well, but it could be done better.

Consider the idea that Google Scholar had an API. If we could query that API on the basis of the DOI or PMID or ISSN in combination with volume, issue and pages or any other combination of standard bibliographic metadata. Yes, something like an openURL. And GoogleScholar would only return the correct Google Scholar ID for that article -that number 12564475196117890153 in the link- we could construct various links. Linking to the Google Scholar document cluster is one. Retrieving the Google Scholar citations is another.

Google doesn’t like metadata too much is an often heard argument. But the Google Books API works swell with ISBN numbers, OCLC numbers or LOC numbers. That API is talking metadata. Libraries a massive stores of metadata. So Anurag Acharya please. The pleas for a Google Scholar API are abound. Mostly for retrieval of citations, but for the OA movement those document clusters are really more important! Perhaps you could launch this Google Scholar API as a present for the Open Access week coming up in October?

National Library of the Netherlands discloses its Google Books Contract

After the successful disclosure of the agreement between the British Library and Google Books on the basis of the Freedom of Information Act, the National Library of the Netherlands (KB) also disclosed their agreement with Google Ireland today. Albeit the director of the KB tweeted a day ago that not all public information needed to be available on the Web, it was decided to publish the agreement on the Web since there were two WOB (a Dutch version of FOIA) procedures underway to get insight in the agreement.

Albeit I am not a lawyer, a few thins caught my eye. The agreement is very similar to the agreement between Google and the British Library. Bert Zeeman pondered the idea of standard Google contracts in this respect. This seems to go for the exception of the number of volumes in the public domain that will be digitized, 250,000 in the UK and 160,000 in the Netherlands (clause 2.1).

What struck me as interesting was the use of the libraries digital copies, clause 4.8 “the library may provide all or any portion of the library digital copy… to (a) academic institutions or research or public libraries, ….” But we are not able to “providing search or hosting services substantially similar to those provided by Google, including but not limited to those services substantially similar to Google book search”. I guess that leaves out the other academic libraries in the Netherlands to include these digital copies in their discovery tools. It is tempting, but I see problems on the horizon. We seem to be left with separate information silos whereas integration with the rest of the collection would be really interesting. It becomes more explicit in clause 4.9 where it is stated that “nothing in this agreement restricts the library from allowing Europeana to crawl the standard metadata of the digital copies provided to library by Google.” We would be more interested in the data rather than the metadata.

But then again, it is up to the lawyers to see what’s allowed and what’s not. But then again, again, after fifteen years all restrictions on the use or distribution terminate (clause 4.7), a bit long according to the open rights group. However, we have experience with building academic library collections, it takes ages. Those fifteen years are over in the wink of a young girl’s eye.

The Impact Factor of Open Access journals

In the world of Open Access publishing the golden road has received a great deal of attention. At least this is what our researchers seem to remember. Of course there are other roads to open access, but I want to present the impact factors of the journals facilitating the golden road to open access. This blogpost lists all open access journals included in DOAJ and assigned an Journal Impact Factor in the JCR 2009. The reason for this, is that our researchers see publishing in open access journals as the simplest way of achieving open access to their work, but on the other hand they are required for judgement of the citation impact that they publish in journals covered by Web of Science and therefore the Journal Citation Reports (JCR).

In the past there have been studies on citation impact of the open access journals that have actually received a journal impact factor from Thomson Reuters Scientific (formerly ISI). The first was by (McVeigh 2004) followed by (Vanouplines and Beullens 2008) (in Dutch, and not openly accessible) and recently by (Giglia 2010). These consecutive studies showed an increasing number of open access journals that received an Journal Impact Factor from Thomson Reuters. McVeigh reported 239 OA journals for the JCR 2004, Vanouplines reported 295 OA journals for the JCR 2005 and Giglia reported 385 OA journals for the JCR 2008 (there are some methodological issues that make these figures not entirely comparable).

The pitfall of these studies is that although they showed interesting figures and additional analyses, none of these studies actually published the list of open access journals that received an impact factor. The sole purpose of this blogpost is to publish this actual list. The probable reason for the previous authors is that the impact factors are proprietary information from Thomson Reuters. You are not allowed to publish these figures. On the other hand most publishers, use it in all their marketing outings for their journals. So the journal impact factor is virtually information in the public domain.

To avoid any intellectual property problems with Thomson Reuters I have included the ScimagoJR and Scopus SNIP indicator for the journals rather than the Journal Impact Factor. The correlation for this set of journals between SNIP and IF was 0.94 and between SJR and IF was 0.96. In total 619 journals from DOAJ were present in the JCR 2009 report (Science and Social Science & Humanities version deduplicated). The growth in journal coverage is due to the growth in OA journals and the significant expansion of journal coverage in 2008. On the other hand looking at the journal list of Scopus indexed journals I note that they include some 1365 journals open access journal which have a ScimagoJR or SNIP.

For the current table I matched the journal list from DOAJ downloaded on December 13th 2010, with the deduplicated list of the JCR 2009 indexed journals. This journal set of 619 journals was matched against the journal list from journalmetrics.com to include the ScimagoJR 2009 and SNIP2009 as well. For each journal the subject categories indicated by DOAJ were included. The journals were sorted alphabetically on subjects and descending IF within a subject. For the following table journals with multiple subject assignments in DOAJ were included in their different categories as well. This expanded the list to 782 lines. Finally the column with impact factors was removed, showing only the ScimagoJR and SNIP for the journals. A few journals were not assigned a ScimagoJR or SNIP, but these were assigned a Journal Impact Factor. In some cases this was due to differences in journal coverage between Scopus and Web of Science, but in a few cases this appears also the problem of different ISSN assignments by the respective databases.

Download: List of open access journals that are assigned an Impact Factor in the JCR 2009 showing their respective SNIP and ScimagoJR for 2009.

Have fun with this list

References

Giglia, E. (2010). The Impact Factor of Open Access journals: data and trends. ELPUB 2010 International Conference on Electronic Publishing, Helsinki (Finland), 16-18 June 2010. http://dhanken.shh.fi/dspace/bitstream/10227/599/72/2giglia.pdf and http://hdl.handle.net/10760/14666.

McVeigh, M.E. (2004). Open Access Journals in the ISI Citation Databases: Analysis of Impact Factors and Citation Patterns A citation study from Thomson Scientific, Thomson Scientific. http://science.thomsonreuters.com/m/pdfs/openaccesscitations2.pdf

Vanouplines, P. & R. Beullens (2008). De impact van open access tijdschriften. IK Intelectueel Kapitaal 7(5): 14-17. (In Dutch, Not OA available)

Possibly related posts
Another expansion of journal coverage by Thomson

Open Access: Just Publish

I do sincerely apologize for this boring video, a few talking heads is not the right medium to pass a message. An important message that is. But I couldn’t find any palatable alternatives on YouTube. Has nobody tried to make an attractive, short film on this subject?Anyway, a couple of bigshots from the Dutch University world passing the message on the importance of Open Access. They talk in Dutch, but this version has English sub-titles.

And so does science grinds to a halt

Copyright and Creative Commons for an article in Studies in Mycology

This morning I had to look up the citations to an article. It did no show up in WoS immediately so I had to look a bit around to trace it’s exact details. I found the article as an open access article on Highwire. No problem.

However, I was struck by the extensive and confusing copyright statements at the top of the abstract. On the first line is has the classic copyright sign © which indicates to me “all rights reserved” in this case to the CBS fungal biodiversity Centre. But the all rights reserved sign was followed immediately with their own worded Creative Commons license. CC 3.0 in this case.

I was little bemused by the third clause “”You may not alter, transform, or build upon this work”. Isn’t that what science is all about? Building on previous work?

Another annoying fact is that the DOI is not working.  But this is the link to the abstract, there are plenty of similar examples in this “Studies in Mycology” to be found.

The changing face of Elsevier Science

The last couple of days I had the pleasure to attend the Elsevier Development Partners meeting. The exact products they are working on might be of interest to some people, but that’s up to Elsevier to announce. But what was really the big surprise at this meeting -which lasted 3 days- was the tone from Elsevier. It was all about open Science. They clearly wanted to open up. There was a lot of talk about sharing information, making mash-ups possible, Application programming Interfaces (API). Elsevier Science wanted to move away from the double barred information silo to become an open solution provider in the scholarly world. If Elsevier is thinking and acting in this direction, then change will become a major issue for the entire scientific publishing industry and that is good news for libraries who want to remain a vital service in the future as well.

This change will take time. It doesn’t happen overnight. But Raphael Sidi just announced the other day on his blog the Elsevier Article API at the programmable Web. So, Elsevier is not only talking, they are acting up on it as well.

Let other publishers follow this example!

October 14th, 2008: Open Access Day

Just a post to support the idea of an International Open Access day. I wonder with Bert Zeeman which Dutch University will take the lead in the Netherlands to organize some sort of event on this subject during October 14th.

Google and the academic Deep Web

Blogging on Peer-Reviewed ResearchHagendorn and Santelli (2008) just published an interesting article on the comprehensiveness of indexing of academic repositories by Google. This article triggers this me to write up some observations I was intending to make for quite some time already. It addresses the question I got from a colleague of mine, who observed that the deep web apparently doesn’t exist anymore.

Google has made a start to index flash files. Google has made a start to retrieve information that is hidden behind search forms on the web, i.e. started to index information contained in databases. Google and OCLC exchange information on books scanned, and those contained in Worldcat. Google so it seems has indexed the Web comprehensively with 1 trillion indexed webpages. Could there possibly be anything more to be indexed?

The article by Hagendorn and Santelli shows convincingly that Google still has not indexed all information that is contained in OAISTER, the second largest archive of open access article information. Only Scientific Commons is more comprehensive. They tested this with the Google Research API using the University Research Program for Google Search. They only checked whether the URL was present. This approach only partially reveals some information on depth of the Academic Deep Web. But those are staggering figures already. But reality bites even more.

A short while ago I taught a Web Search class for colleagues at the University Library at Leiden. For the purpose of demonstrating what the Deep or Invisible Web actually constitutes I used and example from their own repository. It is was a thesis on Cannabis from last year and deposited as one huge PDF of 14 MB. Using Google you can find the metadata record. With Google Scholar as well. However, if you try to search for a quite specific sentence on the beginning pages of the actual PDF file Google gives not the sought after thesis. You find three other PhD dissertations. Two of those defended at the same university that same day, but not the one on Cannabis.

Interestingly, you are able to find parts of the thesis in Google Scholar, eg chapter 2, chapter 3 etc. But those are the parts of the thesis contained in different chapters that have been published elsewhere in scholarly journals. Unfortunately, none of these parts in Google Scholar refers back to the original thesis that is in Open Access or have been posted as OA journal article pre-prints in the Leiden repository. In Google Scholar most of the materials is still behind toll gates at publishers websites.

Is Google to blame for this incomplete indexing of repositories? Hagendorn and Santelli point the finger to Google indeed. However, John Wilkin, a colleague of them, doesn’t agree. Just as Lorcan Dempsey didn’t. And neither do I.

I have taken an interest in the new role of librarians. We are no longer solely responsible for bringing external –documentary- resources from outside into the realm of our academic clientele. We have also the dear task of bringing the fruits of their labour as good as possible for the floodlights of the external world. Be it academic or plain lay interest. We have to bring the information out there. Open Access plays an important role in this new task. But that task doesn’t stop at making it simply available on the Web.

Making it available is only a first, essential step. Making it rank well is a second, perhaps even more important step. So as librarians we have to become SEO experts. I have mentioned this here before, as well as at my Dutch blog.

So what to do about this chosen example from the Leiden repository. Well there is actually a slew of measures that should be taken. First of course is to divide the complete thesis in parts, at chapter level. Albeit publishers give permission only to publish articles, of which most theses in the beta sciences exists in the Netherlands, when the thesis is published as a whole. On the other hand, nearly 95% of the publishers allow publication of pre-prints and peer reviewed post prints. The so called Romeo green road. So it is up to the repository managers, preferably with the consent from the PhD candidate, to tear up the thesis in its parts –the chapters, which are the pre-print or post-prints of articles- and archive the thesis on chapter level as well. This makes the record for this thesis with a number of links to far more digestible chunks of information better palatable for the search engine spiders and crawlers. The record for the thesis thus contains links to the individual chapters deposited elsewhere in the repository.

Interesting side effect of this additional effort at the repository side is that the deposit rates will increase considerably. This applies for most Universities in the Netherlands, for our collection of theses as well. Since PhD students are responsible of the lion’s share of academic research at the University, depositing the individual chapters as article preprints in the repository will be of major benefit to the OA performance university. It will require more labour at the side of repository management, but if we take this seriously it is well worth the effort.

We still have to work at the visibility of the repositories really hard, but making the information more palatable is a good start.

Reference:
Hagedorn, K. and J. Santelli (2008). Google still not indexing hidden web URLs. D-Lib Magazine 14(7/8). http://www.dlib.org/dlib/july08/hagedorn/07hagedorn.html