GraphMoco:a Graph Momentum Contrast Model that Using Multimodel Structure Information for Large-scale Binary Function Representation Learning

RuiJin, Sun; ShiZe, Guo; Xi, Guo; ZhiSong, Pan

Computer Science > Cryptography and Security

arXiv:2305.10826v1 (cs)

[Submitted on 18 May 2023 (this version), latest version 18 Jul 2023 (v2)]

Title:GraphMoco:a Graph Momentum Contrast Model that Using Multimodel Structure Information for Large-scale Binary Function Representation Learning

Authors:Sun RuiJin, Guo ShiZe, Guo Xi, Pan ZhiSong

View PDF

Abstract:The ability to compute similarity scores of binary code at the function level is essential for cyber security. A single binary file can contain tens of thousands of functions. A deployable learning framework for cybersecurity applications needs to work not only accurately but also efficiently with large amounts of data. Traditional methods suffer from two drawbacks. First, it is very difficult to annotate different pairs of functions with accurate labels. These supervised learning methods can easily be overtrained with inaccurate labels. The second is that they either use the pre-trained encoder or use the fine-grained graph comparison. However, these methods have shortcomings in terms of time or memory consumption. We focus on large-scale Binary Code Similarity Detection (BCSD) and to mitigate the traditional problems, we propose GraphMoco: a graph momentum contrast model that uses multimodal structure information for large-scale binary function representation learning. We take an unsupervised learning approach and make full use of the structural information in the binary code. It does not require manually labelled similar or dissimilar information. Our models perform efficiently on large amounts of training data. Our experimental results show that our method outperforms the state-of-the-art in terms of accuracy.

Comments:	34 pages,5 figures
Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2305.10826 [cs.CR]
	(or arXiv:2305.10826v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2305.10826

Submission history

From: Ruijin Sun [view email]
[v1] Thu, 18 May 2023 09:07:40 UTC (1,204 KB)
[v2] Tue, 18 Jul 2023 16:05:16 UTC (1,996 KB)

Computer Science > Cryptography and Security

Title:GraphMoco:a Graph Momentum Contrast Model that Using Multimodel Structure Information for Large-scale Binary Function Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:GraphMoco:a Graph Momentum Contrast Model that Using Multimodel Structure Information for Large-scale Binary Function Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators