Learning General Policies for Planning through GPT Models

Rossetti N.; Tummolo M.; Gerevini A. E.; Putelli L.; Serina I.; Chiari M.; Olivato M.

doi:10.1609/icaps.v34i1.31510

Transformer-based architectures, such as T5, BERT and GPT, have demonstrated revolutionary capabilities in Natural Language Processing. Several studies showed that deep learning models using these architectures not only possess remarkable linguistic knowledge, but they also exhibit forms of factual knowledge, common sense, and even programming skills. However, the scientific community still debates about their reasoning capabilities, which have been recently tested in the context of automated AI planning; the literature presents mixed results, and the prevailing view is that current transformer-based models may not be adequate for planning. In this paper, we address this challenge differently. We introduce a GPT-based model customised for planning (\model) to learn a general policy for classical planning by training the model from scratch with a dataset of solved planning instances. Once \model has been trained for a domain, it can be used to generate a solution plan for an input problem instance in that domain. Our training procedure exploits automated planning knowledge to enhance the performance of the trained model. We build and evaluate our GPT model with several planning domains, and we compare its performance w.r.t. other recent deep learning techniques for generalised planning, demonstrating the effectiveness of the proposed approach.

Learning General Policies for Planning through GPT Models / Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Putelli, L.; Serina, I.; Chiari, M.; Olivato, M.. - 34:(2024), pp. 500-508. ( International Conference on Automated Planning and Scheduling, ICAPS, 2024 Banff, Canada ) [10.1609/icaps.v34i1.31510].

Learning General Policies for Planning through GPT Models

Rossetti N.^Primo;Tummolo M.;Gerevini A. E.;Putelli L.;Serina I.;Chiari M.;Olivato M.

2024

Abstract

Transformer-based architectures, such as T5, BERT and GPT, have demonstrated revolutionary capabilities in Natural Language Processing. Several studies showed that deep learning models using these architectures not only possess remarkable linguistic knowledge, but they also exhibit forms of factual knowledge, common sense, and even programming skills. However, the scientific community still debates about their reasoning capabilities, which have been recently tested in the context of automated AI planning; the literature presents mixed results, and the prevailing view is that current transformer-based models may not be adequate for planning. In this paper, we address this challenge differently. We introduce a GPT-based model customised for planning (\model) to learn a general policy for classical planning by training the model from scratch with a dataset of solved planning instances. Once \model has been trained for a domain, it can be used to generate a solution plan for an input problem instance in that domain. Our training procedure exploits automated planning knowledge to enhance the performance of the trained model. We build and evaluate our GPT model with several planning domains, and we compare its performance w.r.t. other recent deep learning techniques for generalised planning, demonstrating the effectiveness of the proposed approach.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2024
			
	Nome convegno
	
				International Conference on Automated Planning and Scheduling, ICAPS, 2024
			
	Parole chiave
	
				learning for planning; generalized planning; gpt; transformers; classical planning; llms
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Learning General Policies for Planning through GPT Models / Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Putelli, L.; Serina, I.; Chiari, M.; Olivato, M.. - 34:(2024), pp. 500-508. ( International Conference on Automated Planning and Scheduling, ICAPS, 2024 Banff, Canada ) [10.1609/icaps.v34i1.31510].

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1724772

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

13

ND

Catalogo dei prodotti della ricerca