I need a fairly large sample (or the whole dataset, if possible) for some academic research and analysis.
I first thought I should download the database dumps, however I do not know have a lot of knowledge with server and database management, and on the wiki it mentions how you need to use docker to set up a server. Most of which is fairly confusing for me
I choose to go down getting the JSON dumps route, now I need a way to convert that JSON into a tabular format, preferably using sqllite3, since I am slightly familiar with it
However, I have no clue as to how I can do that, since there seems to be little to no repeating format in the JSON dumps
Can anyone suggest how I can go about this, is using the database dump a better idea for my use case?
That will not be an easy task.
Just have a look at the database schema, necessary to rebuild the complete database in something like SQlite.
This is not something that can be done in a few minutes.
I suggest to go the path with the docker/container that already includes all the necessary parts like MusicBrainz Server with replication, Search Server and the development setup.