skip to main content
research-article

An Efficient Multidimensional Big Data Fusion Approach in Machine-to-Machine Communication

Published: 07 June 2016 Publication History

Abstract

Machine-to-Machine communication (M2M) is nowadays increasingly becoming a world-wide network of interconnected devices uniquely addressable, via standard communication protocols. The prevalence of M2M is bound to generate a massive volume of heterogeneous, multisource, dynamic, and sparse data, which leads a system towards major computational challenges, such as, analysis, aggregation, and storage. Moreover, a critical problem arises to extract the useful information in an efficient manner from the massive volume of data. Hence, to govern an adequate quality of the analysis, diverse and capacious data needs to be aggregated and fused. Therefore, it is imperative to enhance the computational efficiency for fusing and analyzing the massive volume of data. Therefore, to address these issues, this article proposes an efficient, multidimensional, big data analytical architecture based on the fusion model. The basic concept implicates the division of magnitudes (attributes), i.e., big datasets with complex magnitudes can be altered into smaller data subsets using five levels of the fusion model that can be easily processed by the Hadoop Processing Server, resulting in formalizing the problem of feature extraction applications using earth observatory system, social networking, or networking applications. Moreover, a four-layered network architecture is also proposed that fulfills the basic requirements of the analytical architecture. The feasibility and efficiency of the proposed algorithms used in the fusion model are implemented on Hadoop single-node setup on UBUNTU 14.04 LTS core i5 machine with 3.2GHz processor and 4GB memory. The results show that the proposed system architecture efficiently extracts various features (such as land and sea) from the massive volume of satellite data.

References

[1]
Divyakant Agrawal, Sudipto Das, and Amr El Abbadi. 2011. Big data and cloud computing: current state and future opportunities. In Proceedings of the 14th International Conference on Extending Database Technology. ACM, 530--533.
[2]
Awais Ahmad, Alfred Daniel, and Anand Paul. 2014. Optimized data transmission using cooperative devices in clustered D2D communication. In Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems. ACM, 209--214.
[3]
Awais Ahmad, Anand Paul, and M. Mazhar Rathore. 2016. An efficient divide-and-conquer approach for big data analytics in machine-to-machine communication. Neurocomputing 174, Part A, 439--453.
[4]
Awais Ahmad, Anand Paul, M. Mazhar Rathore, and Seungmin Rho. 2015. Power aware mobility management of M2M for IoT communications. Mobile Information Systems 501, 521093. Volume 2015, pages 14.
[5]
Oresti Banos, Rafael Garcia, Juan A. Holgado-Terriza, Miguel Damas, Hector Pomares, Ignacio Rojas, Alejandro Saez, and Claudia Villalonga. 2014. mHealthDroid: A novel framework for agile development of mobile health applications. In Ambient Assisted Living and Daily Activities. Springer, 91--98.
[6]
Kelli A. C. Baumgartner, Silvia Ferrari, and Anil V. Rao. 2009. Optimal control of an underwater sensor network for cooperative target tracking. IEEE Journal of Oceanic Engineering 34, 4, 678--697.
[7]
Cyril Cecchinel, Matthieu Jimenez, Sébastien Mosser, and Michel Riveill. 2014. An architecture to support the collection of big data in the Internet of Things. In Proceedings of the IEEE World Congress on Services (SERVICES). IEEE, 442--449.
[8]
Turgay Celik and Kai-Kuang Ma. 2010. Unsupervised change detection for satellite images using dual-tree complex wavelet transform. IEEE Transactions on Geoscience and Remote Sensing 48, 3, 1199--1210.
[9]
Jeffrey Cohen, Brian Dolan, Mark Dunlap, Joseph M. Hellerstein, and Caleb Welton. MAD skills: new analysis practices for big data. Proceedings of the VLDB Endowment 2, 2, 1481--1492.
[10]
Graham Cormode and Minos Garofalakis. 2010. Histograms and wavelets on probabilistic data., IEEE Transactions on Knowledge and Data Engineering 22, 8, 1142--1157.
[11]
Matthew S. Crouse, Robert D. Nowak, and Richard G. Baraniuk. 1998. Wavelet-based statistical signal processing using hidden Markov models. IEEE Transactions on Signal Processing 46, 4, 886--902.
[12]
Alfredo Cuzzocrea, Domenico Saccà, and Jeffrey D. Ullman. 2013. Big data: A research agenda. In Proceedings of the 17th International Database Engineering & Applications Symposium. ACM, 198--203.
[13]
Belur V. Dasarathy. Sensor fusion potential exploitation-innovative architectures and illustrative applications. Proceedings of the IEEE 85, 1, 24--38.
[14]
Jeffrey Dean and Sanjay Ghemawat. 2008. MapReduce: simplified data processing on large clusters. Communications of the ACM 51, 1, 107--113.
[15]
C. Eaton, D. Deroos, T. Deutsch, G. Lapis and P. C. Zikopoulos. 2012. Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data. McGraw-Hill.
[16]
EnviSat, A. S. A. R. 2007. Product Handbook. European Space Agency 2.2.
[17]
European Space Agency (ESA). 2014a. Earth Online. Retrieved from https://earth.esa.int/.
[18]
European Space Agency (ESA). 2014b. The Beam Project. Retrieved from http://www.brockmann-consult.de/cms/web/beam/.
[19]
Maria González-Audícana, José Luis Saleta, Raquel García Catalán, and Rafael García. 2004. Fusion of multispectral and panchromatic images using improved IHS and PCA mergers based on wavelet decomposition. IEEE Transactions on Geoscience and Remote Sensing 42, 6, 1291--1299.
[20]
Nicolas Haderer, Romain Rouvoy, and Lionel Seinturier. 2013. Dynamic deployment of sensing experiments in the wild using smartphones. In Distributed Applications and Interoperable Systems. Springer, Berlin, 43--56.
[21]
John Heidemann, Fabio Silva, Chalermek Intanagonwiwat, Ramesh Govindan, Deborah Estrin, and Deepak Ganesan. 2001. Building efficient wireless sensor networks with low-level naming. ACM SIGOPS Operating Systems Review 35, 5, 146--159.
[22]
Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, and Shivnath Babu. 2011. Starfish: A self-tuning system for big data analytics. In CIDR. vol. 11, 261--272.
[23]
Chalermek Intanagonwiwat, Deborah Estrin, Ramesh Govindan, and John Heidemann. 2002. Impact of network density on data aggregation in wireless sensor networks. In Proceedings. 22nd International Conference on Distributed Computing Systems. IEEE, 457--458.
[24]
Tzu-Chuan Juan, Shih-En Wei, and Hung-Yun Hsieh. 2013. Data-centric clustering for data gathering in machine-to-machine wireless networks. In Proceedings of the 2013 IEEE International Conference on Communications Workshops (ICC). IEEE, 89--94.
[25]
A. Kleiner, A. Talwalkar, P. Sarkar, and M. I. Jordan. 2012. The big data bootstrap. In Proceedings of the 29th Internatinoal Conference on Machine Learning. 1--8.
[26]
Alexandros Labrinidis and H. V. Jagadish. 2012. Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5, 12, 2032--2033.
[27]
Runze Li, Dennis K. J. Lin, and Bing Li. 2013. Statistical inference in massive data sets. Applied Stochastic Models in Business and Industry 29, 5, 399--409.
[28]
Xiaoquan Li, Fujiang Zhang, and Yongliang Wang. 2013. Research on big data architecture, key technologies and its measures. In Proceedings of the 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing (DASC). IEEE, 1--4.
[29]
Chung-Chih Liu, Ming-Jang Chiu, Chun-Chieh Hsiao, Ren-Guey Lee, and Yuh-Show Tsai. 2006. Wireless health care service system for elderly with dementia. IEEE Transactions on Information Technology in Biomedicine 10, 4, 696--704.
[30]
Juan Liu and Pierre Moulin. 2001. Information-theoretic analysis of interscale and intrascale dependencies between image wavelet coefficients. IEEE Transactions on Image Processing 10, 11, 1647--1658.
[31]
Peng Liu and Kie B. Eom. 2013. Restoration of multispectral images by total variation with auxiliary image. Optics and Lasers in Engineering 51, 7, 873--882.
[32]
Peng Liu, Fang Huang, Guoqing Li, and Zhiwen Liu. 2012. Remote-sensing image denoising using partial differential equations and auxiliary images as priors. IEEE Geoscience and Remote Sensing Letters 9, 3, 358--362.
[33]
Jianguo Lu and Dingding Li. 2013. Bias correction in a small sample from big data. IEEE Transactions on Knowledge and Data Engineering 25, 11, 2658--2663.
[34]
Wald Lucien. 1999. Some terms of reference in data fusion. IEEE Transactions on Geosciences and Remote Sensing 37, 3, 1190--1193.
[35]
Stephane G. Mallat. 1989. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11, 7, 674--693.
[36]
Lori Mann Bruce, Cliff H. Koger, and Jiang Li. 2002. Dimensionality reduction of hyperspectral data using discrete wavelet transform feature extraction. IEEE Transactions on Geoscience and Remote Sensing 40, 10, 2331--2338.
[37]
Samuel Marchal, Xiuyan Jiang, Radu State, and Thomas Engel. 2014. A big data architecture for large scale security monitoring. In Proceedings of the 2014 IEEE International Congress on Big Data. IEEE, 56--63.
[38]
Samuel Marchal, Xiuyan Jiang, Radu State, and Thomas Engel. 2014. A big data architecture for large scale security monitoring. In Proceedings of the 2014 IEEE International Congress on Big Data. IEEE, 56--63.
[39]
M. Mayilvaganan and M. Sabitha. 2013. A cloud-based architecture for Big-Data analytics in smart grid: A proposal. In Proceedings of the 2013 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC). IEEE, 1--4.
[40]
Katina Michael and Keith Miller. 2013. Big data: New opportunities and new challenges [guest editors' introduction]. Computer 46, 6, 22--24.
[41]
Pooja Mishra and Dharmendra Singh. 2014. A statistical-measure-based adaptive land cover classification algorithm by efficient utilization of polarimetric SAR observables. IEEE Transactions on Geoscience and Remote Sensing 52, 5, 2889--2900.
[42]
Sébastien Mosser, Ivan Logre, Nicolas Ferry, and Philippe Collet. 2013. From sensors to visualization dashboards: Need for language composition. In Proceedings of the 2nd International Workshop on Globalization of Modeling Languages at MODELS. 6.
[43]
Sebastien Mosser, Franck Fleurey, Brice Morin, Franck Chauvel, Arnor Solberg, and Iokanaan Goutier. 2012. Sensapp as a reference platform to support cloud experiments: From the internet of things to the internet of services. In Proceedings of the 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC). 400--406.
[44]
F. Nakamura, A. F. Loureiro, and C. Frery. 2007. Data fusion for wireless sensor networks: Methods, models, and classifications. ACM Computing Surveys 39, 3, Article 9.
[45]
Eduardo F. Nakamura, Antonio A. F. Loureiro, and Alejandro C. Frery. 2007. Information fusion for wireless sensor networks: Methods, models, and classifications. ACM Computing Surveys (CSUR) 39, 3, 9.
[46]
Anand Paul and Seungmin Rho. 2015. Probabilistic model for M2M in IoT networking and communication. Telecommunication Systems 62, 1, 59--66.
[47]
Anand Paul, Seungmin Rho, and K. Bharnitharan. 2014. Interactive scheduling for mobile multimedia service in M2M environment. Multimedia Tools and Applications 71, 1, 235--246.
[48]
Anand Paul. 2013. Graph based M2M optimization in an IoT environment. In Proceedings of the 2013 Research in Adaptive and Convergent Systems. ACM, 45--46.
[49]
Anand Paul. 2014. Real-time power management for embedded M2M using intelligent learning methods. ACM Transactions on Embedded Computing Systems (TECS) 13, 5s, 148.
[50]
Zdzisław Pawlak. 2002. Rough set theory and its applications. Journal of Telecommunications and Information Technology 3, 3, 7--10.
[51]
Antonio Plaza, Pablo Martínez, Rosa Pérez, and Javier Plaza. 2004. A quantitative and comparative analysis of endmember extraction algorithms from hyperspectral data. IEEE Transactions on Geoscience and Remote Sensing 42, 3, 650--663.
[52]
Javier Portilla and Eero P. Simoncelli. 2000. A parametric texture model based on joint statistics of complex wavelet coefficients. International Journal of Computer Vision 40, 1, 49--70.
[53]
Lakshmish Ramaswamy, Victor Lawson, and Siva Venkat Gogineni. 2013. Towards a quality-centric big data architecture for federated sensor services. In Proceedings of the 2013 IEEE International Congress on Big Data (BigData Congress). IEEE, 86--93.
[54]
Attila Reiss and Didier Stricker. 2012a. Creating and benchmarking a new dataset for physical activity monitoring. In Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments. ACM, 40.
[55]
Attila Reiss, and Didier Stricker. 2012b. Introducing a new benchmarked dataset for activity monitoring. In Proceedings of the 2012 16th International Symposium on Wearable Computers (ISWC’12). IEEE, 108--109.
[56]
Mikko Rinne, Seppo Törmä, and Esko Nuutila. 2012. SPARQL-based applications for RDF-encoded sensor data. In Proceedings of the 5th International Conference on Semantic Sensor Networks, vol. 904. 81--96.
[57]
Philip E. Ross. 2004. Managing care through the air [remote health monitoring]. IEEE Spectrum 41, 12, 26--31.
[58]
Vijay P. Shah, Nicolas H. Younan, Surya S. Durbha, and Roger L. King. 2010. Feature identification via a combined ICA--wavelet method for image information mining. IEEE Geoscience and Remote Sensing Letters 7, 1, 18--22.
[59]
R.D. Schneider. 2012. Hadoop for Dummies Special Edition. John Wiley & Sons.
[60]
Wen-Zhan Song, Renjie Huang, Mingsen Xu, Behrooz A. Shirazi, and Richard LaHusen. 2010. Design and deployment of sensor network for real-time high-fidelity volcano monitoring. IEEE Transactions on Parallel and Distributed Systems 21, 11, 1658--1674.
[61]
David R. Thompson, Lukas Mandrake, Martha S. Gilmore, and Rebecca Castaño. 2010. Superpixel endmember detection. IEEE Transactions on Geoscience and Remote Sensing 48, 11, 4023--4033.
[62]
Lizhe Wang, Hui Zhong, Rajiv Ranjan, Albert Zomaya, and Peng Liu. 2014. Estimating the statistical characteristics of remote sensing big data in the wavelet transform domain. IEEE Transactions on Emerging Topics in Computing 2, 3, 324--337.
[63]
Xindong Wu, Xingquan Zhu, Gong-Qing Wu, and Wei Ding. 2014. Data mining with big data. IEEE Transactions on Knowledge and Data Engineering 26, 1, 97--107.
[64]
Chen Yang, Lorenzo Bruzzone, Fengyue Sun, Laijun Lu, Renchu Guan, and Yanchun Liang. 2010. A fuzzy-statistics-based affinity propagation technique for clustering in multispectral images. IEEE Transactions on Geoscience and Remote Sensing 48, 6, 2647--2659.
[65]
Qiang Yang, Yuqiang Chen, Gui-Rong Xue, Wenyuan Dai, and Yong Yu. 2009. Heterogeneous transfer learning for image clustering via the social web. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. Association for Computational Linguistics, 1--9.
[66]
Xiaomeng Yi, Fangming Liu, Jiangchuan Liu, and Hai Jin. 2014. Building a network highway for big data: architecture and challenges. IEEE Network 28, 4, 5--13.
[67]
Arkady Zaslavsky, Charith Perera, and Dimitrios Georgakopoulos. 2013. Sensing as a service and big data. arXiv preprint arXiv: 1301.0159.
[68]
Jia Zhang, Bob Iannucci, Mark Hennessy, Kartik Gopal, Sean Xiao, Sudhakar Kumar, David Pfeffer, et al. 2013. Sensor data as a service—a federated platform for mobile data-centric service development and sharing. In Proceedings of the 2013 IEEE International Conference on Services Computing (SCC). IEEE, 446--453.
[69]
Jin Zhou, Lei Hu, Feng Wang, Hai-Han Lu, and Kai Zhao. 2013. An efficient multidimensional fusion algorithm for IoT data based on partitioning. Tsinghua Science and Technology 18, 4, 369--378.

Cited By

View all

Index Terms

  1. An Efficient Multidimensional Big Data Fusion Approach in Machine-to-Machine Communication

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Embedded Computing Systems
    ACM Transactions on Embedded Computing Systems  Volume 15, Issue 2
    Special Issue on Innovative Design, Special Issue on MEMOCODE 2014 and Special Issue on M2M/IOT
    May 2016
    421 pages
    ISSN:1539-9087
    EISSN:1558-3465
    DOI:10.1145/2888407
    Issue’s Table of Contents
    Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 07 June 2016
    Accepted: 01 October 2015
    Revised: 01 July 2015
    Received: 01 April 2015
    Published in TECS Volume 15, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Big Data
    2. Hadoop processing server
    3. M2M
    4. data fusion

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • Institute for Information & communications Technology Promotion, Korea

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 02 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media