Catalogo dei prodotti della ricerca

In this article we propose the first multi-task benchmark for evaluating the performances of machine learning models that work on low level assembly functions. While the use of multi-task benchmark is a standard in the natural language processing (NLP) field, such practice is unknown in the field of assembly language processing. However, in the latest years there has been a strong push in the use of deep neural networks architectures borrowed from NLP to solve problems on assembly code. A first advantage of having a standard benchmark is the one of making different works comparable without effort of reproducing third part solutions. The second advantage is the one of being able to test the generality of a machine learning model on several tasks. For these reasons, we propose BinBench, a benchmark for binary function models. The benchmark includes various binary analysis tasks, as well as a dataset of binary functions on which tasks should be solved. The dataset is publicly available and it has been evaluated using baseline models.

BinBench: a benchmark for x64 portable operating system interface binary function representations / Console, F., D'Aquanno, G., Di Luna, G.A., Querzoni, L.. - In: PEERJ. COMPUTER SCIENCE. - ISSN 2376-5992. - 9:(2023). [10.7717/peerj-cs.1286]

BinBench: a benchmark for x64 portable operating system interface binary function representations

Console F.;D'Aquanno G.;Di Luna G. A.;Querzoni L.

2023

Abstract

In this article we propose the first multi-task benchmark for evaluating the performances of machine learning models that work on low level assembly functions. While the use of multi-task benchmark is a standard in the natural language processing (NLP) field, such practice is unknown in the field of assembly language processing. However, in the latest years there has been a strong push in the use of deep neural networks architectures borrowed from NLP to solve problems on assembly code. A first advantage of having a standard benchmark is the one of making different works comparable without effort of reproducing third part solutions. The second advantage is the one of being able to test the generality of a machine learning model on several tasks. For these reasons, we propose BinBench, a benchmark for binary function models. The benchmark includes various binary analysis tasks, as well as a dataset of binary functions on which tasks should be solved. The dataset is publicly available and it has been evaluated using baseline models.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2023
			
	Parole chiave
	
				assembly language; benchmark; binary functions; binary functions representation; binary similarity; compiler provenance; dataset; neural networks
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				BinBench: a benchmark for x64 portable operating system interface binary function representations / Console, F., D'Aquanno, G., Di Luna, G.A., Querzoni, L.. - In: PEERJ. COMPUTER SCIENCE. - ISSN 2376-5992. - 9:(2023). [10.7717/peerj-cs.1286]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Console_BinBench_2023.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 1.63 MB Formato Adobe PDF	1.63 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1685762

Citazioni

1

1

1

social impact