Text-to-image systems based on generative pre-trained transformers have become pervasive in recent years. This result is mainly due to the generation tools' ease of use, application availability, and increasingly refined graphical and algorithmic capabilities. On the other hand, the non-controllability given by a random process and the dependency on existing information databases highlight some limitations of these automatic image-generation methods. The rapid construction of increasingly realistic digital images draws new boundaries between real and unreal, highlighting a strict relation between text and image regarding semantics and logic descriptions. Therefore, a critical look into the use of these applications as tools for a reliable representation of architecture becomes cogent. We started from representation established by the treatises, as in the case of orders. Most of these drawings, proportions, and rules are derived from descriptive parts in Vitruvius' text. However, the graphic interpretations result from the architect's experience and culture, which has become the basic grammar of architecture. This research stems precisely from the connection between Vitruvius' text, the new text-to-image contents and the established representations of the treatise writers. The comparison will consider both the image reliability and the geometric rules, testing the current potential of GPT systems for image creation and reliability.

GPT for Treatise Image Creation. A critical overview / Russo, Michele; Senatore, Luca James; Flenghi, Giulia. - (2026), pp. 197-214. - DIGITAL INNOVATIONS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTION. [10.1007/978-3-032-04711-3_12].

GPT for Treatise Image Creation. A critical overview

Michele Russo;Luca James Senatore;Giulia Flenghi
2026

Abstract

Text-to-image systems based on generative pre-trained transformers have become pervasive in recent years. This result is mainly due to the generation tools' ease of use, application availability, and increasingly refined graphical and algorithmic capabilities. On the other hand, the non-controllability given by a random process and the dependency on existing information databases highlight some limitations of these automatic image-generation methods. The rapid construction of increasingly realistic digital images draws new boundaries between real and unreal, highlighting a strict relation between text and image regarding semantics and logic descriptions. Therefore, a critical look into the use of these applications as tools for a reliable representation of architecture becomes cogent. We started from representation established by the treatises, as in the case of orders. Most of these drawings, proportions, and rules are derived from descriptive parts in Vitruvius' text. However, the graphic interpretations result from the architect's experience and culture, which has become the basic grammar of architecture. This research stems precisely from the connection between Vitruvius' text, the new text-to-image contents and the established representations of the treatise writers. The comparison will consider both the image reliability and the geometric rules, testing the current potential of GPT systems for image creation and reliability.
2026
Representation Across Boundaries. New Links with AI, AI-GEN, and XR Tools for Cultural Heritage and Innovative Design
9783032047113
AI image uses; text-to-image; treatise; architecture orders; prompt analysis
02 Pubblicazione su volume::02a Capitolo o Articolo
GPT for Treatise Image Creation. A critical overview / Russo, Michele; Senatore, Luca James; Flenghi, Giulia. - (2026), pp. 197-214. - DIGITAL INNOVATIONS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTION. [10.1007/978-3-032-04711-3_12].
File allegati a questo prodotto
File Dimensione Formato  
Russo_GPT-Treatise-Image-Creation_2026.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 7.61 MB
Formato Adobe PDF
7.61 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1762095
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact