Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleOctober 2001
Learning probabilistic datalog rules for information classification and transformation
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 387–394https://doi.org/10.1145/502585.502651Probabilistic Datalog is a combination of classical Datalog (i.e., function-free Horn clause predicate logic) with probability theory. Therefore, probabilistic weights may be attached to both facts and rules. But it is often impossible to assign exact ...
- ArticleOctober 2001
SQL database primitives for decision tree classifiers
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 379–386https://doi.org/10.1145/502585.502650Scalable data mining in large databases is one of today's challenges to database technologies. Thus, substantial effort is dedicated to a tight coupling of database and data mining systems leading to database primitives supporting data mining tasks. In ...
- ArticleOctober 2001
Summarization as feature selection for text categorization
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 365–370https://doi.org/10.1145/502585.502647We address the problem of evaluating the effectiveness of summarization techniques for the task of document categorization. It is argued that for a large class of automatic categorization algorithms, extraction-based document categorization can be ...
- ArticleOctober 2001
Using LSI for text classification in the presence of background text
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 113–118https://doi.org/10.1145/502585.502605This paper presents work that uses Latent Semantic Indexing (LSI) for text classification. However, in addition to relying on labeled training data, we improve classification accuracy by also using unlabeled data and other forms of available "background"...
- ArticleOctober 2001
Text classification in a hierarchical mixture model for small training sets
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 105–113https://doi.org/10.1145/502585.502604Documents are commonly categorized into hierarchies of topics, such as the ones maintained by Yahoo! and the Open Directory project, in order to facilitate browsing and other interactive forms of information retrieval. In addition, topic hierarchies can ...
- ArticleOctober 2001
Combining multiple classifiers for text categorization
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementOctober 2001, Pages 97–104https://doi.org/10.1145/502585.502603A major problem facing online information services is how to index and supplement large document collections with respect to a rich set of categories. We focus upon the routing of case law summaries to various secondary law volumes in which they should ...