skip to main content
10.1109/ICDM.2005.38guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns

Published: 27 November 2005 Publication History

Abstract

Since its introduction, frequent-pattern mining has been the subject of numerous studies, including incremental updating. Many existing incremental mining algorithms are Apriori-based, which are not easily adoptable to FP-tree based frequent-pattern mining. In this paper, we propose a novel tree structure, called CanTree (Canonical-order Tree), that captures the content of the transaction database and orders tree nodes according to some canonical order. By exploiting its nice properties, the CanTree can be easily maintained when database transactions are inserted, deleted, and/or modified. For example, the CanTree does not require adjustment, merging, and/or splitting of tree nodes during maintenance. No rescan of the entire updated database or reconstruction of a new tree is needed for incremental updating. Experimental results show the effectiveness of our CanTree.

References

[1]
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. SIGMOD 1993, pp. 207-216.
[2]
R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. VLDB 1994, pp. 487-499.
[3]
N.F. Ayan, A.U. Tansel, and E. Arkun. An efficient algorithm to update large itemsets with early pruning. In Proc. SIGKDD 1999, pp. 287-291.
[4]
R.J. Bayardo. Efficiently mining long patterns from databases. In Proc. SIGMOD 1998, pp. 85-93.
[5]
F. Bonchi and C. Lucchese. On closed constrained frequent pattern mining. In Proc. ICDM 2004, pp. 35-42.
[6]
S. Brin, R. Motwani, and C. Silverstein. Beyond market baskets: generalizing association rules to correlations. In Proc. SIGMOD 1997, pp. 265-276.
[7]
C. Bucila, J. Gehrke, D. Kifer, and W.M. White. DualMiner: a dual-pruning algorithm for itemsets with constraints. In Proc. SIGKDD 2002, pp. 42-51.
[8]
D.W. Cheung, J. Han, V.T. Ng, and C.Y. Wong. Maintenance of discovered association rules in large databases: an incremental updating technique. In Proc. ICDE 1996, pp. 106-114.
[9]
D.W. Cheung, S. D. Lee, and B. Kao. A general incremental technique for maintaining discovered association rules. In Proc. DASFAA 1997, pp. 185-194.
[10]
W. Cheung and O.R. Zaïane. Incremental mining of frequent patterns without candidate generation or support constraint. In Proc. IDEAS 2003, pp. 111-116.
[11]
T. Fukuda, Y. Morimoto, S. Morishita, and T. Tokuyama. Data mining using two-dimensional optimized association rules: scheme, algorithms, and visualization. In Proc. SIGMOD 1996, pp. 13-23.
[12]
K. Gade, J. Wang, and G. Karypis. Efficient closed pattern mining in the presence of tough block constraints. In Proc. SIGKDD 2004, pp. 138-147.
[13]
J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In Proc. SIGMOD 2000, pp. 1-12.
[14]
J. Han, J. Pei, Y. Yin, and R. Mao. Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Mining and Knowledge Discovery, 8(1), pp. 53-87, Jan. 2004.
[15]
C. Hidber. Online association rule mining. In Proc. SIGMOD 1999, pp. 145-156.
[16]
H. Huang, X. Wu, and R. Relue. Association analysis with one scan of databases. In Proc. ICDM 2002, pp. 629-632.
[17]
J.-L. Koh and S.-F. Shieh. An efficient approach for maintaining association rules based on adjusting FP-tree structures. In Proc. DASFAA 2004, pp. 417-424.
[18]
L.V.S. Lakshmanan, C.K.-S. Leung, and R.T. Ng. Efficient dynamic mining of constrained frequent sets. ACM TODS, 28(4), pp. 337-389, Dec. 2003.
[19]
C.K.-S. Leung. Interactive constrained frequent-pattern mining system. In Proc. IDEAS 2004, pp. 49-58.
[20]
C.K.-S. Leung, L.V.S. Lakshmanan, and R.T. Ng. Exploiting succinct constraints using FP-trees. SIGKDD Explorations, 4(1), pp. 40- 49, June 2002.
[21]
C.K.-S. Leung, R.T. Ng, and H. Mannila. OSSM: a segmentation approach to optimize frequency counting. In Proc. ICDE 2002, pp. 583-592.
[22]
R.T. Ng, L.V.S. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations Rules. In Proc. SIGMOD 1998, pp. 13-24.
[23]
K.-L. Ong, W.K. Ng, and E.-P. Lim. FSSM: fast construction of the optimized segment support map. In Proc. DaWaK 2003, pp. 257- 266.
[24]
J.S. Park, M.-S. Chen, and P.S. Yu. Using a hash-based method with transaction trimming for mining association rules. IEEE TKDE, 9(5), pp. 813-825, Sept./Oct. 1997.
[25]
J. Pei, J. Han, and L.V.S. Lakshmanan. Mining frequent itemsets with convertible constraints. In Proc. ICDE 2001, pp. 433-442.
[26]
J. Pei, J. Han, and R. Mao. CLOSET: an efficient algorithm for mining frequent closed itemsets. In Proc. DMKD 2000, pp. 21-30.
[27]
S. Sarawagi, S. Thomas, and R. Agrawal. Integrating association rule mining with relational database systems: alternatives and implications. In Proc. SIGMOD 1998, pp. 343-354.
[28]
D. Tsur, J.D. Ullman, S. Abiteboul, C. Clifton, R. Motwani, S. Nestorov, and A. Rosenthal. Query flocks: a generalization of association-rule mining. In Proc. SIGMOD 1998, pp. 1-12.
[29]
M.J. Zaki and C.-J. Hsiao. CHARM: an efficient algorithm for closed itemset mining. In Proc. SDM 2002.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '05: Proceedings of the Fifth IEEE International Conference on Data Mining
November 2005
837 pages
ISBN:0769522785

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 November 2005

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media