With the LinkedBrainz D2R fork running under the latest revision of mappings we have produced an dump of the current RDF translation of the MusicBrainz NGS dataset. This is now 23.8GB of NTriples, ~180M assertions, which are available as a 2.7GB gzip compressed download.
In order to load this dump it is necessary to chunk it into manageable pieces. For instance, to load into OWLIM execute the following:
$split -l 1000000 musicbrainz_ngs_dump.rdf.ttl.gz $owlim/preload
To load into OWLIM, the GettingStarted app can be used by adapting the example.sh script to read:
. ../setvars.sh
java -Xmx3433m -cp "bin:$CP_TESTS" GettingStarted flush=true $* 1>mb.out 2>mb.err
(This assumes 4GB of available memory - adjust according to the configuration spreadsheet.)






Post new comment