Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
Koraljka Golub, Johan Hagelbäck, Anders Ardö
Table 2 Accuracy of the Multinomial Naïve Bayes classifier on the different datasets.
Dataset Accuracy, unigrams Accuracy, unigrams + 2-grams
Training set Test set Training set Test set
T 83.54% 34.89% 95.82% 34.15%
T_KW 90.01% 55.33% 98.14% 55.45%
KW 75.28% 59.15% 84.95% 58.11%
T_MC 90.83% 54.21% 98.63% 50.51%
T_KW_MC 95.42% 76.52% 99.66% 75.96%
KW_MC 86.94% 77.25% 94.24% 77.09%