This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on https://github.com/RSTLess-research/Fauno-Italian-LLM
Fauno: The Italian Large Language Model that will leave you senza parole! / Bacciu, Andrea; Trappolini, Giovanni; Santilli, Andrea; Rodolà, Emanuele; Silvestri, Fabrizio. - (2023). (Intervento presentato al convegno Italian Information Retrieval (IIR) 2023 tenutosi a Pisa).
Fauno: The Italian Large Language Model that will leave you senza parole!
Andrea Bacciu
Primo
;Giovanni TrappoliniSecondo
;Emanuele RodolàPenultimo
;Fabrizio SilvestriUltimo
2023
Abstract
This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on https://github.com/RSTLess-research/Fauno-Italian-LLMI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.