research-article

On Using Linear Diophantine Equations for Efficient Hiding of Decision Tree Rules

Authors:

Georgios Feretzakis,

Dimitris Kalles,

Vassilios S. VerykiosAuthors Info & Claims

SETN '18: Proceedings of the 10th Hellenic Conference on Artificial Intelligence

Article No.: 12, Pages 1 - 8

https://doi.org/10.1145/3200947.3201030

Published: 09 July 2018 Publication History

Abstract

Data sharing among organizations has become an increasingly common procedure in several areas like advertising, marketing, e-commerce and banking, but any organization will probably attempt to keep some patterns as hidden as possible when it shares its datasets with others. This paper focuses on preserving the privacy of sensitive patterns when inducing decision trees. We adopt a record augmentation approach for hiding sensitive classification rules in binary datasets. Such a hiding methodology is preferred over other heuristic solutions like output perturbation or crypto-graphic techniques - which restrict the usability of the data - since the raw data itself is readily available for public use. We propose a look-ahead approach using linear Diophantine equations in order to add the appropriate number of instances while maintaining the initial entropy of the nodes. This technique can be used to hide one or more decision tree rules in an optimal way.

References

[1]

Verykios, V.S., Bertino, E., Fovino, I.N., Provenza, L.P., Saygin, Y. and Theodoridis, Y. 2004. State of the Art Privacy Preserving Data Mining. SIGMOD Record 33(1), 50--57

Digital Library

[2]

Agrawal, R. and Srikant, R. 2000. Privacy-Preserving Data Mining. In: ACM SIGMOD Conference of Management of Data, pp. 439--450

Digital Library

[3]

Gkoulalas-Divanis, A., and Verykios, V.S. 2009. Privacy Preserving Data Mining: How far can we go? In Handbook of Research on Data Mining in Public and Private Sectors: Organizational and Government Applications. Eds. A. Syvajarvi and J. Stenvall. IGI Global.

[4]

Estivill-Castro, V., and Brankovic, L. 1999. Data swapping: Balancing privacy against precision in mining for logic rules. In: First International Conference on Data Warehousing and Knowledge Discovery.

Digital Library

[5]

Chang, L.W., and Moskowitz, I.S. 1998. Parsimonious Downgrading and Decision Trees applied to the Inference Problem. In: New Security Paradigms Workshop, pp. 82--89

Digital Library

[6]

Natwichai, J., Li, X., and Orlowska, M. 2005. Hiding Classification Rules for Data Sharing with Privacy Preservation. In: 7th International Conference on Data Warehousing and Knowledge Discovery, pp. 468--467.

Digital Library

[7]

Natwichai, J., Li, X., and Orlowska, M. 2006. A Reconstruction-based Algorithm for Classification Rules Hiding. In: 17th Australasian Database Conference, pp. 49--58.

Digital Library

[8]

Quinlan, J.R. C4.5. 1993. Programs for Machine Learning. Morgan Kaufmann.

Digital Library

[9]

Cohen, W.W. 1995. Fast effective rule induction. In: Machine Learning: the 12th International Conference.

Digital Library

[10]

Katsarou, A., Gkouvalas-Divanis, A., and Verykios, V. S. 2009. Reconstruction-based Classification Rule Hiding through Controlled Data Modification. In: IFIP International Federation for Information Processing 296, 449--458.

[11]

Natwichai, J., Sun, X., and Li, X.: Data Reduction Approach for Sensitive Associative Classification Rule Hiding. In: 19th Australian Database Conference (2008).

Digital Library

[12]

Wang, K., Fung, B.C.M., and Yu, P.S. 2005. Template-Based Privacy Preservation in Classification Problems. In:5th IEEE International Conference on Data Mining, pp. 466--473.

Digital Library

[13]

Delis, A., Verykios, V.S., and Tsitsonis, A. 2010. A Data Perturbation Approach to Sensitive Classification Rule Hiding. In:25th Symposium On Applied Computing.

Digital Library

[14]

Kalles, D., Verykios, V.S., Feretzakis, G., and Papagelis, A. 2016. Data set operations to hide decision tree rules. In: Twenty-second European Conference on Artificial Intelligence.

Digital Library

[15]

Kalles, D., Verykios, V.S., Feretzakis, G., and Papagelis, A. 2016. Data set operations to hide decision tree rules. In: 1st International Workshop on AI for Privacy and Security. Article No. 10.

Digital Library

[16]

Li, R., de Vries D., and Roddick J. 2011. Bands of Privacy Preserving Objectives: Classification of PPDM Strategies. In: 9th Australasian Data Mining Conference, pp. 137--151.

Digital Library

[17]

Kalles, D., and Morris, D.T. 1996. Efficient Incremental Induction of Decision Trees. Machine Learning 24(3), 231--242.

Digital Library

[18]

Kalles, D., and Papagelis, A. 2000. Stable decision trees: Using local anarchy for efficient incremental learning. International Journal on Artificial Intelligence Tools 9(1), 79--95.

[19]

Kalles, D., and Papagelis, A. 2010. Lossless fitness inheritance in genetic algorithms for decision trees. Soft Computing 14(9), 973--993.

Digital Library

[20]

Zantema, H., and Bodlaender, H.L. 2000. Finding Small Equivalent Decision Trees is Hard. International Journal of Foundations of Computer Science 11(2), 343--354.

Cited By

Feretzakis GVerykios V(2024)Trustworthy AI: Securing Sensitive Data in Large Language ModelsAI10.3390/ai50401345:4(2773-2800)Online publication date: 6-Dec-2024
https://doi.org/10.3390/ai5040134
Feretzakis GMitropoulos KKalles DS. Verykios V(2020)Local Distortion Hiding (LDH) Algorithm: a Java-based prototype11th Hellenic Conference on Artificial Intelligence10.1145/3411408.3411419(144-149)Online publication date: 2-Sep-2020
https://dl.acm.org/doi/10.1145/3411408.3411419
Feretzakis GKalles DVerykios V(2020)Knowledge Hiding in Decision Trees for Learning Analytics ApplicationsAdvances in Core Computer Science-Based Technologies10.1007/978-3-030-41196-1_3(37-54)Online publication date: 19-Jun-2020
https://doi.org/10.1007/978-3-030-41196-1_3
Show More Cited By

Recommendations

Balanced Approach for Hiding Sensitive Association Rules in Data Sharing Environment

Privacy preserving association rule mining protects the sensitive association rules specified by the owner of the data by sanitizing the original database so that the sensitive rules are hidden. In this paper, the authors study a problem of hiding ...
Hiding collaborative recommendation association rules on horizontally partitioned data

The study of privacy preserving data mining has become more important in recent years due to the increasing amount of personal data in public, the increasing sophistication of data mining algorithms to leverage this information, and the increasing ...
Decision Rules Induced From Sets of Decision Trees
Abstract
Decision rules belong to known forms of knowledge representation. Among popular measures of their quality length and support can be distinguished. Shorter rules are easier to understand and interpret. Support allows to present patterns hidden in ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

SETN '18: Proceedings of the 10th Hellenic Conference on Artificial Intelligence

July 2018

339 pages

ISBN:9781450364331

DOI:10.1145/3200947

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

EETN: Hellenic Artificial Intelligence Society
UOP: University of Patras
University of Thessaly: University of Thessaly, Volos, Greece

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

SETN '18

SETN '18: 10th Hellenic Conference on Artificial Intelligence

July 9 - 12, 2018

Patras, Greece

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
65
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 07 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Feretzakis GVerykios V(2024)Trustworthy AI: Securing Sensitive Data in Large Language ModelsAI10.3390/ai50401345:4(2773-2800)Online publication date: 6-Dec-2024
https://doi.org/10.3390/ai5040134
Feretzakis GMitropoulos KKalles DS. Verykios V(2020)Local Distortion Hiding (LDH) Algorithm: a Java-based prototype11th Hellenic Conference on Artificial Intelligence10.1145/3411408.3411419(144-149)Online publication date: 2-Sep-2020
https://dl.acm.org/doi/10.1145/3411408.3411419
Feretzakis GKalles DVerykios V(2020)Knowledge Hiding in Decision Trees for Learning Analytics ApplicationsAdvances in Core Computer Science-Based Technologies10.1007/978-3-030-41196-1_3(37-54)Online publication date: 19-Jun-2020
https://doi.org/10.1007/978-3-030-41196-1_3
Feretzakis GKalles DVerykios V(2019)Using Minimum Local Distortion to Hide Decision Tree RulesEntropy10.3390/e2104033421:4(334)Online publication date: 28-Mar-2019
https://doi.org/10.3390/e21040334
Feretzakis GKalles DVerykios V(2019)Local Distortion Hiding in Financial Technology application: a case study with a benchmark data set2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)10.1109/IISA.2019.8900733(1-4)Online publication date: Jul-2019
https://doi.org/10.1109/IISA.2019.8900733

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents