Authors
Susan Dumais, John Platt, David Heckerman, Mehran Sahami
Publication date
1998/11/1
Book
Proceedings of the seventh international conference on Information and knowledge management
Pages
148-155
Description
Text categorization–the assignment of natural language texts to one or more predefined categories based on their content–is an important component in many information organization and management tasks. We compare the effectiveness of five different automatic learning algorithms for text categorization in terms of learning speed, realtime classification speed, and classification accuracy. We also examine training set size, and alternative document representations. Very accurate text classifiers can be learned automatically from training examples. Linear Support Vector Machines (SVMs) are particularly promising because they are very accurate, quick to train, and quick to evaluate. 1.1 Keywords
Total citations
19992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024305375881531351581621311521381201021051041119474897574724333295
Scholar articles
S Dumais, J Platt, D Heckerman, M Sahami - Proceedings of the seventh international conference on …, 1998