skip to main content
10.1007/978-3-319-46128-1_17guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

M-Zoom: Fast Dense-Block Detection in Tensors with Quality Guarantees

Published: 19 September 2016 Publication History

Abstract

Given a large-scale and high-order tensor, how can we find dense blocks in it__ __ Can we find them in near-linear time but with a quality guarantee__ __ Extensive previous work has shown that dense blocks in tensors as well as graphs indicate anomalous or fraudulent behavior e.g., lockstep behavior in social networks. However, available methods for detecting such dense blocks are not satisfactory in terms of speed, accuracy, or flexibility. In this work, we propose M-Zoom, a flexible framework for finding dense blocks in tensors, which works with a broad class of density measures. M-Zoom has the following properties: 1 Scalable: M-Zoom scales linearly with all aspects of tensors and is upï źto 114$$\times $$faster than state-of-the-art methods with similar accuracy. 2 Provably accurate: M-Zoom provides a guarantee on the lowest density of the blocks it finds. 3 Flexible: M-Zoom supports multi-block detection and size bounds as well as diverse density measures. 4 Effective: M-Zoom successfully detected edit wars and bot activities in Wikipedia, and spotted network attacks from a TCP dump with near-perfect accuracy AUCï ź=ï ź0.98. The data and software related to this paper are available at http://www.cs.cmu.edu/~kijungs/codes/mzoom/.

References

[1]
Supplementary document examples, proofs, and additional experiments. http://www.cs.cmu.edu/~kijungs/codes/mzoom/supple.pdf
[2]
Akoglu, L., McGlohon, M., Faloutsos, C.: Oddball: spotting anomalies in weighted graphs. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. eds. PAKDD 2010. LNCS LNAI, vol. 6119, pp. 410---421. Springer, Heidelberg 2010.
[3]
Andersen, R., Chellapilla, K.: Finding dense subgraphs with size bounds. In: Kumar, R., Sivakumar, D. eds. WAW 2010. LNCS, vol. 6516, pp. 25---37. Springer, Heidelberg 2009.
[4]
Bader, B.W., Kolda, T.G., et al.: Matlab tensor toolbox version 2.6. http://www.sandia.gov/~tgkolda/TensorToolbox/
[5]
Beutel, A., Xu, W., Guruswami, V., Palow, C., Faloutsos, C.: Copycatch: stopping group attacks by spotting lockstep behavior in social networks. In: WWW 2013
[6]
Cerf, L., Besson, J., Robardet, C., Boulicaut, J.F.: Data peeler: constraint-based closed pattern mining in n-ary relations. In: SDM 2008
[7]
Charikar, M.: Greedy approximation algorithms for finding dense components in a graph. In: Jansen, K., Leonardi, S., Vazirani, V. eds. APPROX 2002. LNCS, vol. 2462, pp. 84---95. Springer, Heidelberg 2000.
[8]
Gibson, D., Kumar, R., Tomkins, A.: Discovering large dense subgraphs in massive graphs. In: VLDB 2005
[9]
Goldberg, A.V.: Finding a maximum density subgraph. Technical report 1984
[10]
Hooi, B., Song, H.A., Beutel, A., Shah, N., Shin, K., Faloutsos, C.: Fraudar: bounding graph fraud in the face of camouflage. In: KDD 2016
[11]
Ignatov, D.I., Kuznetsov, S.O., Poelmans, J., Zhukov, L.E.: Can triconcepts become triclusters? Int. J. Gen. Syst. 426, 572---593 2013
[12]
Jiang, M., Beutel, A., Cui, P., Hooi, B., Yang, S., Faloutsos, C.: A general suspiciousness metric for dense blocks in multimodal data. In: ICDM 2015
[13]
Jiang, M., Cui, P., Beutel, A., Faloutsos, C., Yang, S.: Catchsync: catching synchronized behavior in large directed graphs. In: KDD 2014
[14]
Jiang, M., Cui, P., Beutel, A., Faloutsos, C., Yang, S.: Inferring strange behavior from connectivity pattern in social networks. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. eds. PAKDD 2014. LNCS LNAI, vol. 8443, pp. 126---138. Springer, Heidelberg 2014.
[15]
Kannan, R., Vinay, V.: Analyzing the structure of large graphs. Technical report 1999
[16]
Khuller, S., Saha, B.: On finding dense subgraphs. In: Loeckx, J. ed. ICALP 1974. LNCS, vol. 14, pp. 597---608. Springer, Heidelberg 2009.
[17]
Kolda, T.G., Bader, B.W.: Tensor decompositions and applications. SIAM Rev. 513, 455---500 2009
[18]
Lee, V.E., Ruan, N., Jin, R., Aggarwal, C.: A survey of algorithms for dense subgraph discovery. In: Aggarwal, C.C., Wang, H. eds. Managing and Mining Graph Data, vol. 40, pp. 303---336. Springer, Heidelberg 2010
[19]
Maruhashi, K., Guo, F., Faloutsos, C.: Multiaspectforensics: pattern mining on large-scale heterogeneous networks with tensor analysis. In: ASONAM 2011
[20]
Pandit, S., Chau, D.H., Wang, S., Faloutsos, C.: Netprobe: a fast and scalable system for fraud detection in online auction networks. In: WWW 2007
[21]
Prakash, B.A., Sridharan, A., Seshadri, M., Machiraju, S., Faloutsos, C.: EigenSpokes: surprising patterns and scalable community chipping in large graphs. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. eds. PAKDD 2010. LNCS LNAI, vol. 6119, pp. 435---448. Springer, Heidelberg 2010.
[22]
Seppänen, J.K., Mannila, H.: Dense itemsets. In: KDD 2004
[23]
Shah, N., Beutel, A., Gallagher, B., Faloutsos, C.: Spotting suspicious link behavior with fbox: an adversarial perspective. In: ICDM 2014

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ECML PKDD 2016: European Conference on Machine Learning and Knowledge Discovery in Databases - Volume 9851
September 2016
811 pages
ISBN:9783319461274
  • Editors:
  • Paolo Frasconi,
  • Niels Landwehr,
  • Giuseppe Manco,
  • Jilles Vreeken

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 19 September 2016

Author Tags

  1. Anomaly/Fraud detection
  2. Dense-block detection
  3. Tensor

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media