A subset of documents found in the catalogue of the DNB with predictions generated with some arbitrary indexing method. The document ids match those in the dnb_gold_standard dataset.

dnb_test_predictions

Format

dnb_test_predictions

A data frame with 100,000 rows and 3 columns:

doc_id

DNB identifier of a document in the catalogue.

label_id

DNB identifier of a concept in the GND subject vocabulary.

score

A confidence score in \([0, 1]\) generated by the indexing method.