The Internet Archive, namely Jason Scott, has been given a “dump” of MP3.com’s digital audio files before their CNET take-over in late 2004. This “dump” contains a large amount of early MP3 files from various independent musicians.
The dump has been “de-duplicated” and roughly sorted by filename into various chunks.
I am of course downloading a chunk to see what results turn up in Picard!
… As a result, descriptions and information are scant about the tracks and artist information was not wrapped into the MP3s themselves. While they are all playable, very little else is known about them. They have been put into a set of relatively large multi-track sets, but none of the tracks are necessarily related to each other.
Please let us also know how AcoustID and/or Picard recognize this tracks.
Well so far in the Internet Archive Discord, Jason’s done a few packs and we’re getting less than maybe 1% detection rate… these tracks are SUPER RARE - so looks like I’ve got some more research and adding to the database ahead of me
I have downloaded the part “A-2” (18 GB) with 4’996 songs.
These mp3 only have a quality of 128 kbit/s CBR and only contains an ID3v1 tag.
Embedded are the Artist name and Title and in the comment an URL to the old mp3.com website.
There is no information about Album, release Year, Genre and no Cover is embedded.
The only thing I currently see, is to search for additional information in the wayback machine:
Go for it! I’m diverting at the moment and filling in the database with MP3.com bands / artists that aren’t already here, in the hopes that we can match some of these thousands of digital tracks to some one.
Ultimately no; it’s a case of going extremely hard with searches, using other databases like Discogs, last.fm, AllMusic etc. to see if they have any records. It’s all fun, but extremely time consuming.
Hi - there are still a lot of us around who worked on the mp3.com code, so if there are particular questions you need answered, ask away.
As far as I remember we never stored anything other than 128kbit tracks; storage was expensive so it wasn’t like there were archives of better bit rate masters somewhere.
Very cool - I’m still working on this here and there when I get a few moments to spare.
What IA have are the final resultant/rendered HTML pages, and not all of them.
Anything overly dynamic (like genre pages) are gone, but artist pages, artist-info and song-info pages seem to be around.
We also seem to have a lot of the cover art (albeit in classic early internet low resolution) for the tracks.
My next goal is to gather a list of the artist images that are missing, and “touch” the Internet Archive’s WayBackMachine to see if they saved any of them. Get them downloaded and uploaded.
After that, the next thing would be to look to seeing if we can link the tracks in the MP3 Rescue Barge back to the website that I’ve got… this should allow people to find the MP3’s a bit more “naturally”.
Ultimately, my goal is to have this up and running in such a way that I can then start to add the songs in the rescue barge into the MBz DB.
First of all a giant thanks for everyone chipping in, especially sound.and.vision the 2003 project. Coincidentally, I found this trying to piece together more information on the mp3.com rescue barge and thanks to the 2003 project I was able to find out more information on some of the mp3s I’ve been pulling aside and tagging. Exciting! My question is (and maybe no one knows) … “F”, “O”, “R”, seems to be missing from the dump, does anyone happen to know if more is coming or did Jason contribute all that he had. Again, thank you everyone, I’m really glad I found this thread!