Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
Koraljka Golub, Johan Hagelbäck, Anders Ardö
Table 3 Accuracy of the Support Vector Machine classifier on the different datasets.
Dataset Accuracy, unigrams Accuracy, unigrams + 2-grams
Training set Test set Training set Test set
T 93.74% 40.91% 99.59% 40.45%
T_KW 97.50% 65.25% 99.90% 66.13%
KW 83.09% 64.02% 92.38% 64.09%
T_MC 93.95% 57.99% 99.62% 57.80%
T_KW_MC 97.89% 80.75% 99.93% 81.37%
KW_MC 90.58% 79.56% 96.30% 80.38%