Linking Name Authorities and Wikipedia Entries

I love the idea of linked data. As a comic book, science fiction, and fantasy geek the relationships between different works, titles, series, and the characters in them are fascinating to me and the idea of being able to explore all those relationships just by following links is basically Nerdvana. I will also confess to having spent entire evenings following link to link to link on TV Tropes  (it’s roughly as addictive as breathing oxygen) and reading Night Court/ Doctor Who crossover fan fiction on Archive of Our Own (yes, it really is a thing).

But I try to read papers on linked data and they start to talk about FRBR and triples and RDF and my eyes glaze over. I just don’t get it very well without seeing it work. But I found a way to at least create links between data that doesn’t require a massive database or knowing SPARQL or having access to paid subscription tools. I don’t think it’s real linked data because it’s one way and there’s not a described relationship, but it’s a start, and it’s something I understand so I’m going to roll with it for now and try to build on this later.

The short version is that there is a really easy way to add Library of Congress Control Numbers (LCCNs, the unique identifiers for Name Authority Records, or NARs) to Wikidata pages, which are in turn linked to Wikipedia pages. So that’s what I’m going to describe here.

The first thing we need to do is find a Wikipedia entry without an LCCN. The LCCNs are shown in the Authority Control template of the Wikipedia page. As a counter-example of what we’re looking for, I chose the Wikipedia page for Weird Al Yankovic. Of course Weird Al’s page has his LCCN listed, because he’s Polka Megastar Weird Al Yankovic. The arrow shows the location of the LCCN, which links to the Name Authority Record. Some Wikipedia pages don’t have the Authority Control template, and that is fixable, but involves editing the page and is beyond what we’re doing here so for now we’re just looking at pages with that template.

weird_al

In keeping with the novelty music theme, I found a Wikipedia page without an LCCN. That page is for Barnes & Barnes, the musical duo best known for their 1978 hit song “Fish Heads” (the video was directed by Bill Paxton, no joke, which is another reason I want linked data to take off because the world needs to know this when they Google Bill Paxton.). As you can see below, there’s a MusicBrainz ID, but no LCCN for Barnes & Barnes on their Wikipedia page. We’re going to change that.

barnes_and_barnes

We need to get the LCCN so we search authorities.loc.gov to find Barnes & Barnes Name Authority Record. Once we find the right record we copy the LCCN, but please make sure you have the right record so you don’t add an incorrect LCCN. In this case the 670 is for Barnes & Barnes 1980 album Voobaha (the reissue includes “The Vomit Song”), so we’ve got the right record. I’m trying to keep this open and accessible to everyone, but if you’re not familiar with NARs and searching authorities it might be wise to get a foundational knowledge of those before trying this. Copy the LCCN (shown here as LC control no: ).

barnes_and_barnes_record.PNG

And then go back to the Barnes & Barnes Wikipedia page and click on the Wikidata item link.

barnes_and_barnes_wikidata_item

This takes you to the Barnes & Barnes Wikidata page, which is where we’re going to add the LCCN we copied earlier. Scroll down to Identifiers, and then click on + add statement at the bottom.

barnes_and_barnes_wikidata_page.PNG

This will open up the box, just type in LCCN and the Library of Congress authority ID will appear. Click on that option.

barnes_and_barnes_LCCN.PNG

Then paste the LCCN you copied earlier into the text box. For this entry we need to (1) delete the space between the n and the 9, since URLs don’t like spaces. If there is no space in the LCCN you copied, you can just paste the number into the box as is. Then (2) click on save. If you’re not logged in, you’ll have to do a captcha to save.

LCCN_added.PNG

And once we’ve saved, we can see the LCCN in the Wikidata page. It’s a link, and you can click on it to make sure you’ve entered the right number (recommended just in case, I really suggest avoiding creating incorrect links). And there it is.

lccn_added.PNG

This won’t appear in the Wikipedia page for Barnes & Barnes right away (it seems to take about 24 hours to refresh, but that’s just a guess), but when it does we get this:

fish_heads_woohoo.PNG

Success! I didn’t add the Worldcat Identies, I suspect that was done automatically when the LCCN was added, but I’m not sure and there may be more people interested in adding links to Barnes & Barnes than I suspect. And now the Wikidata page (and the Wikipedia page through it) is linked to the Library of Congress Name Authorities. The link is only one way (we didn’t create a link from the NAR page to the Wikidata page), and there’s no relationship described so I’m pretty sure it’s not linked data, but it’s at least something I understand and can tinker with for five or ten minutes while I’m waiting on something else.

On the other hand so what, who cares? It’s a fair question. And I don’t know what this accomplishes at the moment, beyond the obvious (anyone who does know, please comment). But it at least connects NARs to a big open database, and hopefully people experimenting with that database will find a use for it. Until then, I’m happy just doing it for the sake of doing it.

If you’re looking to experiment with this yourself, for some reason it seems like a lot of musical groups don’t have their LCCN on their Wikipedia page. And they’re easy to pull out of authorities.loc.gov, just do a keyword authorities (all) search for music group and then pick one and check the Wikipedia page.

Thanks!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s