Two New Feature Extraction Methods for Text Classification: TESDF and SADF

Kılıç E., Ateş N., Karakaya A., Şahin D. Ö.

23nd Signal Processing and Communications Applications Conference (SIU), Malatya, Turkey, 16 - 19 May 2015, pp.475-478 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/siu.2015.7129862
  • City: Malatya
  • Country: Turkey
  • Page Numbers: pp.475-478
  • Keywords: text classification, term weighting, inverse document frequency
  • Ondokuz Mayıs University Affiliated: Yes


In this study, two new document weighting methods are proposed based on term frequency-inverse document frequency (TF-IDF) generally used in text mining methods. Also, insignificance of the verb in text classification which will be a new method in pre-processing have been put forward and tested. The better results were observed through using these methods when these methods compare with other method, It was observed that the performance rate hardly change and the data size which was processed decreased by omitting verbs of texts.