Here's a list of bands in Wikidata that are missing MusicBrainz links

musicbrainz
wikidata
Tags: #<Tag:0x00007f076fb25c00> #<Tag:0x00007f076fb259a8>

#1

Here’s the list, and here’s the Wikidata query that generated it. It lists bands (not solo artists) with a Discogs link on file, but no MB artist ID. I chose this query because the set isn’t super huge (446 bands right now), and because the Discogs link can help with identification—especially for bands that only have a Wikipedia page in languages that you don’t speak.

Want to help? You don’t need a Wikimedia account. Here’s how to add an MBID:

  1. Click the link in the “band” column.
  2. In the new tab that opens, there are links to the artist’s Wikipedia and Discogs pages. Try to find the artist in MusicBrainz.
  3. If you find the band, copy the MBID from the URI for their artist page.
  4. In the Wikidata page, click “add statement”.
  5. Click on the “property” field. Select “MusicBrainz artist ID”.
  6. Paste the MBID (not the whole URI) into the next field.
  7. Click “publish”.

To see bands that have neither MBID nor Discogs ID on file, I use this query. I set a limit of 200 results, but as of now there are more than 38,000 bands in Wikidata that have no MBID on file.


#2

Thanks for providing this.
I’ve tried to use the wikidata query service but I still don’t know how to use spaql well enough.
It might be an interesting project to query wikidata for links that they have and we don’t.
Wikidata can have social media accounts and links to other databases so there is probably a lot of information they have that we don’t.


#3

Oh sure, there are so many things that can be done. Have you tried @loujin’s Wikidata-to-MB import script? (I only just learned that there’s also a MB-to-Wikidata bot.)


#4

My name starts with J, so I did the listings starting with J.


#5

There is just a little problem with the “instance of” filter:
A band is Q215380, but e.g. a rockband is Q5741069, so it is not included in this list.


#6

Oops, you’re right. Do you know of an efficient way to include all subclasses? When I tried the WHERE clause p:P31/ps:P31/wdt:P279* wd:Q215380, the query was too expensive and it timed out.

To be clear, such a query would also not be 100% comprehensive since it doesn’t catch bands that have no “instance of” predicate. We could catch some of those by cross-referencing DBpedia. Maybe then the only bands missing would be those that have no infobox and no band-related Wikipedia categories?


#7

No, I didn’t know anything about Wikidata queries before I saw yours here.

It’s still better to execute this query for each category separately than to have no query at all :wink:

I was just a bit confused when I saw, that the Norwegian metal band Virus
https://www.wikidata.org/wiki/Q4014731 (as instance of “band” from your query)
has no Wikipedia page linked to the Wikidata item, but I found a Wikipedia Page for it which is linked to
https://www.wikidata.org/wiki/Q2090628 (as instance of “rock band”).
So I tried your query with “rock band” and found some artists, which were not included in the other list.


#8

Nice find. it is possible to merge duplicates, but for that you need an account (Virus is already merged now).

Also, thanks (to you and everyone else) for adding MBIDs! :metal:


#9

Very cool, thanks for this!
I’ve also noted a whole bunch of entries hiding behind Q43229 “Organization”, probably because of a botched batch job :wink:
Some more stuff is hiding behind these instances (nowhere near complete):








#10

wdt:P31/wdt:P279* is the standard way to do that, so this query


#11

Just a reminder for editors, who add MBIDs to Wikidata:
Please check carefully, whether the artist is really the same.
Just because only one artist with this name exists in Musicbrainz,
it doesn’t mean necessarily it is the correct one.

(Somebody had linked a Dutch metal band in Wikidata to a Danish metal band
with the same name in Musicbrainz).


#12

Cool, thanks. I updated the queries and stats in the original post. Fortunately, the expanded result sets are only slightly bigger.


#13

I have a modified query to only give me English wikipedia pages.

SELECT ?band ?bandLabel ?Discogs_artist_ID WHERE {
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
?band wdt:P31/wdt:P279* wd:Q215380.
?band wdt:P1953 ?Discogs_artist_ID.
?article schema:about ?band .
?article schema:isPartOf <https://en.wikipedia.org/>.
FILTER ( !EXISTS { ?band wdt:P434 ?MusicBrainz_artist_ID. })
}
LIMIT 50

#14

I’ve gotten the list of wikidata entries that have a discogs url down to a small list i’ve started to look at other identifiers:

The below are entries that have an isni number and not a musicbrainz id.

The below have a vaif id


#15

You rock, @dns_server! Thanks to you and all the other contributors, more than 300 records from the Discogs list have been updated!


#16

So anyway, I guess the Finns have never heard of MusicBrainz. :grin: