Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
Koraljka Golub, Johan Hagelbäck, Anders Ardö
Table 5 Accuracy of the Supper Vector Machine classifier using different pre-processing.
Support Vector Machine
Dataset Accuracy, unigrams Accuracy, unigrams + 2-grams
Training set Test set Training set Test set
T_KW_MC 97.89% 80.75% 99.93% 81.37%
T_KW_MC_rem 92.51% 80.94% 95.02% 81.83%
T_KW_MC_stm 97.21% 81.07% 99.91% 81.80%
T_KW_MC_stm_rem 92.18% 81.34% 94.89% 82.20%
T_KW_MC_sw 95.44% 80.98% 98.48% 81.24%
T_KW_MC_sw_rem 92.46% 81.04% 94.30% 82.13%
T_KW_MC_sw_stm 94.87% 81.40% 98.72% 81.24%
T_KW_MC_sw_stm_rem 92.17% 81.54% 94.16% 81.90%