[2021-07-14] Current data dumps incomplete?

Tags: #<Tag:0x00007fe855070ea0>

Can it be that the actual data dump in
http://ftp.musicbrainz.org/pub/musicbrainz/data/search-indexes/20210714-041021/
and
https://mirrors.dotsrc.org/MusicBrainz/data/search-indexes/20210714-041021/
is incomplete?

There are no files for

md5sum: recording.tar.zst: No such file or directory
recording.tar.zst: FAILED open or read
md5sum: release-group.tar.zst: No such file or directory
release-group.tar.zst: FAILED open or read
md5sum: release.tar.zst: No such file or directory
release.tar.zst: FAILED open or read
md5sum: series.tar.zst: No such file or directory
series.tar.zst: FAILED open or read
md5sum: tag.tar.zst: No such file or directory
tag.tar.zst: FAILED open or read
md5sum: url.tar.zst: No such file or directory
url.tar.zst: FAILED open or read
md5sum: work.tar.zst: No such file or directory
work.tar.zst: FAILED open or read
md5sum: WARNING: 7 listed files could not be read

This one seems to be complete
https://data.metabrainz.org/pub/musicbrainz/data/search-indexes/20210714-041021/
but not (yet?) replicated?

Yes, replication was not complete at that time.
Search indexes are dumped every Wednesday and Saturday.
Replicating 30GB takes some time depending on the mirror.

1 Like

Thanks for that information @yvanzo

Maybe the files LATEST and latest-is-yyyymmdd-hhmmss should be synchronized at the end of the process (to ensure that the files inside the directory with the same name are completely replicated) :face_with_monocle:

Next time, I wait until Thursday :innocent:

1 Like