Transformer-based architectures, such as T5, BERT and GPT, have demonstrated revolutionary capabilities in Natural Language Processing. Several studies showed that deep learning models using these architectures not only possess remarkable linguistic knowledge, but they also exhibit forms of factual knowledge, common sense, and even programming skills. However, the scientific community still debates about their reasoning capabilities, which have been recently tested in the context of automated AI planning; the literature presents mixed results, and the prevailing view is that current transformer-based models may not be adequate for planning. In this paper, we address this challenge differently. We introduce a GPT-based model customised for planning (\model) to learn a general policy for classical planning by training the model from scratch with a dataset of solved planning instances. Once \model has been trained for a domain, it can be used to generate a solution plan for an input problem instance in that domain. Our training procedure exploits automated planning knowledge to enhance the performance of the trained model. We build and evaluate our GPT model with several planning domains, and we compare its performance w.r.t. other recent deep learning techniques for generalised planning, demonstrating the effectiveness of the proposed approach.

Learning General Policies for Planning through GPT Models / Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Putelli, L.; Serina, I.; Chiari, M.; Olivato, M.. - 34:(2024), pp. 500-508. (Intervento presentato al convegno International Conference on Automated Planning and Scheduling, ICAPS, 2024 tenutosi a Banff, Canada) [10.1609/icaps.v34i1.31510].

Learning General Policies for Planning through GPT Models

Rossetti N.
Primo
;
Tummolo M.;Gerevini A. E.
;
2024

Abstract

Transformer-based architectures, such as T5, BERT and GPT, have demonstrated revolutionary capabilities in Natural Language Processing. Several studies showed that deep learning models using these architectures not only possess remarkable linguistic knowledge, but they also exhibit forms of factual knowledge, common sense, and even programming skills. However, the scientific community still debates about their reasoning capabilities, which have been recently tested in the context of automated AI planning; the literature presents mixed results, and the prevailing view is that current transformer-based models may not be adequate for planning. In this paper, we address this challenge differently. We introduce a GPT-based model customised for planning (\model) to learn a general policy for classical planning by training the model from scratch with a dataset of solved planning instances. Once \model has been trained for a domain, it can be used to generate a solution plan for an input problem instance in that domain. Our training procedure exploits automated planning knowledge to enhance the performance of the trained model. We build and evaluate our GPT model with several planning domains, and we compare its performance w.r.t. other recent deep learning techniques for generalised planning, demonstrating the effectiveness of the proposed approach.
2024
International Conference on Automated Planning and Scheduling, ICAPS, 2024
learning for planning; generalized planning; gpt; transformers; classical planning; llms
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Learning General Policies for Planning through GPT Models / Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Putelli, L.; Serina, I.; Chiari, M.; Olivato, M.. - 34:(2024), pp. 500-508. (Intervento presentato al convegno International Conference on Automated Planning and Scheduling, ICAPS, 2024 tenutosi a Banff, Canada) [10.1609/icaps.v34i1.31510].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1724772
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact