What can be done with super polluted AcoustIDs?

I am not super familiar with AcoustIDs but so many under Two Steps From Hell are very messed up and I would like to fix them if possible.

For example Track "b733bff4-bfd4-49ac-bddc-03eb0d0a2706" | AcoustID I think it should be Freedom Fighters (no choir) or Freedom Fighters (they are the same anyway and I am trying to merge them) but it is linked to multiple other recordings and has 4 IDs. I know the wrong track Identity Crime is in there because for some reason one album recording of “Freedom Fighters” was merged with it for whatever reason, which I am currently fixing.

Is there any way to delete a busted AcoustID and start over? That or at least permanently purge the garbage data like dead recordings and wrong tracks? What can be done when one gets super messed up?

4 Likes

Yes, you can delete the obvious busted up AcoustIDs. Just login to the AcoustID server and you’ll get a Delete button.

Top right of that AcoustID page it says “Sign In”. Just click that and login with your Musicbrainz credentials.

You can’t MOVE AcoustIDs, but you can delete. A good clue as to bad acoustIDs is if you see a low count of samples.

It is technically possible to get two tracks submitting the same AcoustID, but I think I have only seen this twice after looking at many thousands of samples.

You will also find that some editors will add a reference to an acoustID at the top of the recording page and a link to some releases it is found on. (Example) Sometimes some heavy remastering can create a new AcoustID. (Example of remastering changes)

Remember also people with vinyl copies submit their acoustIDs. These can come out slightly different

There can be an awful mess with AcoustIDs on some artists. Much of this can come from the noobs who use Picard with 10,000 files… get a messy inaccurate match up that is not checked… and then hit that Submit AcoustID button :scream:

4 Likes

I see progress.

https://community.metabrainz.org/t/large-number-of-disc-ids-toc-for-one-album/694621

There were 18 DiscIDs, now there are 13.

1
|
|
|
2

How can this be technically possible? The fingerprints are absolutely specific to a particular sound recording. Two recordings with different length which are are based on the same recording (a full length and a shortened version) may submit identical fingerprints (if compared) but they go to different acoustIDs because of their different length.

If you have verified an acoustID for your recording, you can safely disable all other recordings linked to it.

@Echelon666 - we are talking AcoustIDs, not DiscIDs here…

Oh I totally agree… but spotted it. And added them to a thread you were in. Just totally forgotten what they were now. :grin:

I could verify them as I owned both tracks. Freaky maths.

It is rare enough that you are pretty safe 99.9999% of the time disabling other matches. Especially when they have a low sample count. But sometimes need to be wary of a badly credited bootleg with an incorrect track list.

1 Like

Popular recordings often have more than one valid acoustID, especially if there are vinyls involved. Different (individual) splits can cause separate acoustIDs and sometimes very worn records can submit fingerprints that are not similar enough for the same acoustID.
You can compare fingerprints of all attached acoustIDs to your verified fingerprint. All not similar can be disabled. Sometimes after such a clean up there are more than 10 remaining that are similar enough (according to your judgment).

2 Likes

Whoever invented the AcoustID audio fingerprint should get a Nobel Prize.

Laliński?

1 Like

It would have to be the Maths Prize.

If you can name two such tracks, please let me know. I would love to listen to them.

If I could remember any of the details I would post them. Long gone from the memory and no idea which thread they went in…

1 Like

I don’t quite trust your memory. It is because of the way fingerprints are calculated. Only the peaks of the frequencies at a time are used - the loudest sounds. If the pattern is similar, it must have come from recordings that sound very similar. Only very quiet sounds can be different.

So in the case of the example I gave what is the better option?

  1. Attempt to narrow down the correct recording and disable all the others.
  2. Disable everything and generate a completely new one.

Also, do AcoustID that match to 0 recordings (from them being disabled) eventually delete themselves, like empty release groups, or do they sit there forever clogging up the space?

Do you mean disable? That’s the only option I get.

No, AcoustIDs exist on their own. They don’t need to be linked to a MB recording.

2 Likes

FWIW and unrelatedly to anything else in here - I seem to remember back in the day the same song performed with the same backing track by two different jpop singers got given the same acoustid. The code might have been made more discerning in the years since though.

1 Like

1 and 2 will end up doing the same thing. Generate a new AcoustID and read the value in Picard. Then look that up and disable the rest.

And yes, when I said “Delete” I meant “Disable”.

Worst case, if you disable something that was correct then it will soon get re-enabled again. Especially if you are doing a heap of work on a specific artist.

My experience has been that even different mixes that sound very similar produce clearly distinguishable fingerprints.

You can’t disable all active acoustIDs of a recording. Yours is only one specimen and there are most often other valid acoustIDs. But if your acoustID is shared with other (different) recordings, you can of course disable them.

My workflow is as follows:

  1. I determine the AcoustID of my version: I usually look for the specific fingerprint by re-submitting it, but all available fingerprints would be similar enough to compare it to others (you can use the acoustIDs returned from a scan).
  2. I compare this fingerprint to the most often submitted fingerprint of other acoustIDs attached to the recording:
    • at those similar to mine, I disable all recordings that are not this recording
    • at those not similar to mine, I disable this recording (and leave others, if present)

During this process I look at other linked recordings that are possibly duplicates. I look at their fingerprints and if they are (mostly) also similar, I initiate a merge. If it’s unclear, I leave it and do not disable it from the acoustID. When I’m finished with the clean up, the recording will only be represented by similar = verified acoustIDs and no clearly different recording will remain at these acoustIDs. :slight_smile:

2 Likes

I’ve seen same acoustID when clean vs. explicit.

3 Likes