This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on https://github.com/RSTLess-research/Fauno-Italian-LLM
Fauno: The Italian Large Language Model that will leave you senza parole! / Bacciu, Andrea; Trappolini, Giovanni; Santilli, Andrea; Rodolà, Emanuele; Silvestri, Fabrizio. - 3448:(2023), pp. 9-17. (Intervento presentato al convegno IIR2023: 13th Italian Information Retrieval Workshop tenutosi a Pisa; Italy).
Fauno: The Italian Large Language Model that will leave you senza parole!
Andrea Bacciu
Primo
;Giovanni TrappoliniSecondo
;Emanuele RodolàPenultimo
;Fabrizio SilvestriUltimo
2023
Abstract
This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on https://github.com/RSTLess-research/Fauno-Italian-LLMFile | Dimensione | Formato | |
---|---|---|---|
Bacciu_Fauno_2023.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
1 MB
Formato
Adobe PDF
|
1 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.