Sunday, April 25, 2021

Scholarly articles in @Wikidata and its link with #Fatcat

 "Dam-site selection by beavers in an eastern Oregon basin" is a scholarly paper. It has a DOI that is not functional, it is known to Researchgate and consequently we can find a PDF of the paper. 

This paper is cited by a paper I have an interest in. I am adding its references into Wikidata (this is its Scholia) and I added the "Dam-site selection" paper to make a complete reference.  The result is something like this.

The PDF of the paper is available for download and as is to be expected, the Way Back machine of the Internet Archive has a copy as well for the URL of the download page. And then there is Internet Archive's Fatcat

"Fatcat is a versioned, publicly-editable catalog of research publications: journal articles, conference proceedings, pre-prints, blog posts, and so forth. The goal is to improve the state of preservation and access to these works by providing a manifest of full-text content versions and locations."

Wikidata is used as a source of Fatcat and as I include an additional paper, it will at some stage be picked up by Fatcat. If there is one thing to wish for, it is a function where entering a Wikidata Qid will trigger Fatcat to update its data based on the Wikidata info. If I can have two more wishes, one would be an icon for Fatcat. The second would be a Fatcat identifier making it easy to link from Wikidata to Fatcat.
Thanks,
        GerardM

2 comments:

Klaas 'z4us' van B. V said...

Link to 'DOI' does not work :(

Bryan Newbold said...

Hi GerardM!

Having a simple "click to import/update from Wikidata" feature is a great idea, both for papers and for journals. The goal is definitely to minimize duplicate human editing. Having a system where the norm is for contributors to edit on wikidata, and then auto-import in to fatcat, makes a lot of sense to me.

There are identifiers in Fatcat for both "works" and "releases" (a version/edition of a work). The intent is not for these is not to supplant DOIs or other public identifiers, but they can be useful for things like linking between databases like Wikidata. It is also possible to try an "lookup" a Fatcat release by wikidata QID:

https://fatcat.wiki/release/lookup

As you mention, the Wayback Machine has a copy of this paper from the BYU site. I always find it interesting to see why there was a capture:

http://web.archive.org/web/collections/20210715000000*/https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=2103&context=gbn

In this case, there have been at least 3 captures. BYU themselves have a paid archive-it.org account; Microsoft Academic included a URL in their graph ("MAG") dataset, which Internet Archive crawled as part of scholarly preservation efforts; and the paper was linked from Wikipedia so there is an additional crawl from the IA/Wikipedia outlink crawling project.