user warning: Unknown column 'captcha_type' in 'field list' query: SELECT module, captcha_type FROM captcha_points WHERE form_id = 'user_login_block' in /var/www/drupal/sites/all/modules/captcha/captcha.inc on line 64.

LinkedBrainz Summary

Title of Primary Project Output: LinkedBrainz Live!

The type of user this product is for: LinkedBrainz Live is for application developers and institutions who require or use information about recorded music. Especially if thes require to interlink datasets or benefit from inference via automated reasoning.

RDF Dump

With the LinkedBrainz D2R fork running under the latest revision of mappings we have produced an dump of the current RDF translation of the MusicBrainz NGS dataset. This is now 23.8GB of NTriples, ~180M assertions, which are available as a 2.7GB gzip compressed download.

In order to load this dump it is necessary to chunk it into manageable pieces. For instance, to load into OWLIM execute the following:
$split -l 1000000 musicbrainz_ngs_dump.rdf.ttl.gz $owlim/preload

Pagination in RDF

The HTML pages on MusicBrainz use lots of pagination. This not only prevents the end user from being overwhelmed by a listing of thousands of tracks, it reduces the size of HTML pages to something manageable and also ensures a smaller and predictable load on the database server.

In RDF, it would be preferable to avoid pagination. However, this is not particularly practical for artists that are associated with many, many releases and recordings. The problem becomes acute for classical composers like Bach who are credited with tens of thousands recordings. Arguably a more appropriate utilization of NGS would have composers such as Bach credited to works rather than recordings. However, the general opinion of MB editors seems to favor crediting composers as well as performers on recordings. Therefore, the complete RDF resource description for Bach would be immense. This would cause an unacceptable load on the database server and long wait times for dereferenced URIs. One solution is to use pagination in RDF. Terms to support this exist in the XHTML Vocabulary which is generally intended for use with RDFa on paginated websites.

Syndicate content