|
|
Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
|
Koraljka Golub, Johan Hagelbäck, Anders Ardö
|
|
|
Table 5 Accuracy of the Supper Vector Machine classifier using different pre-processing. |
|
Support Vector Machine | Dataset | Accuracy, unigrams | Accuracy, unigrams + 2-grams | Training set | Test set | Training set | Test set | T_KW_MC | 97.89% | 80.75% | 99.93% | 81.37% | T_KW_MC_rem | 92.51% | 80.94% | 95.02% | 81.83% | T_KW_MC_stm | 97.21% | 81.07% | 99.91% | 81.80% | T_KW_MC_stm_rem | 92.18% | 81.34% | 94.89% | 82.20% | T_KW_MC_sw | 95.44% | 80.98% | 98.48% | 81.24% | T_KW_MC_sw_rem | 92.46% | 81.04% | 94.30% | 82.13% | T_KW_MC_sw_stm | 94.87% | 81.40% | 98.72% | 81.24% | T_KW_MC_sw_stm_rem | 92.17% | 81.54% | 94.16% | 81.90% |
|
|
|