Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
Koraljka Golub, Johan Hagelbäck, Anders Ardö
Table 1 The different datasets generated from the raw LIBRIS data.
Dataset ID records classes
Titles T 143,838 816
Titles and keywords T_KW 121,505 802
Keywords only KW 121,505 802
Titles, major classes T_MC 72,937 29
Titles and keywords, major classes T_KW_MC 60,641 29
Keywords only, major classes KW_MC 60,641 29