Saturday, May 12, 2018

#Wikidata - #Copyright and linked data

There are many points of view when it comes to copyright and data. In the Wikipedia world the discussion is different because each text has its own copyright. Data is different because you can not own ie copyright a separate fact.

When data is open or opened up, it follows that much of the data that exist in multiple sources is identical. When the data is the same, it has two benefits. The first is quality. When multiple sources agree on something, it is more likely to be correct. The second is copyright; whose copyright?

Every now and again, the license used by Wikidata is questioned. Typically by Wikipedians who think they know their stuff. They will be the first to tell you the importance of sources and, indeed many factoids in Wikidata do not have a source. When a factoid is sourced, a statement like John Doe died on Friday, 13th, that factoid only links to the source and hardly to the place where it came to the attention of the person or the bot adding it to Wikidata.

When I add the fact that someone is a member of the Somalian parliament, when a list is used like this one, that information is sourced, there is no added value except for a name being on a list. It has been in the news that in the last year parliamentarians have been murdered, there is no article for them and consequently even in Wikipedia it is only a name on a list, no added value, no arguable reason for copyright.

Value is in the links, it is in knowing the same data to be true in many sources. Claiming copyright, particularly in data, is predatory. It prevents people from bringing facts together. Only when facts are brought together informed knowledge exists. Only in linked data, sourced data, there is a handle on fake facts and fake news.
Thanks,
      GerardM
Post a Comment