Testing audio compression autoencoders for seismology. Moving toward foundation models

Laurenti, Laura; Johnson, Christopher W.; Trappolini, Daniele; Tinti, Elisa; Galasso, Fabio; Marone, Chris

doi:10.1029/2025jh000787

Efforts to develop foundation models (FMs) for seismic waveform analysis are beginning to advance with progress still needed for geophysics applications. FMs learn a generalized representation of the data using a self-supervised approach, thus allowing several downstream tasks to be performed in a unified framework. We test a seismic waveform autoencoder as a FM, that is, pretrained for audio data compression and transfer this to seismic data. The pre-trained audio model is fine-tuned using seismic waveforms and evaluated with a waveform reconstruction task. The compression and reconstruction encourage the model to learn a generalizable representation of the input signal. The fine-tuned model is tested for generalization using waveforms from different data distributions. The embedding signal is applied to downstream tasks for foreshock and aftershock classification, early warning ground motion estimates, and earthquake phase detection. We systematically test different fine-tuning procedures to evaluate the effectiveness of the pre-trained model encoder for producing a generalized hidden space. The results show improvement when compared to task-specific models, even when applying few-shot learning. The performance of this model highlights the strengths and limitations as a FM and provides insight for future efforts to develop FMs for seismology.

Testing audio compression autoencoders for seismology. Moving toward foundation models / Laurenti, L., Johnson, C.W., Trappolini, D., Tinti, E., Galasso, F., Marone, C.. - In: JOURNAL OF GEOPHYSICAL RESEARCH. MACHINE LEARNING AND COMPUTATION. - ISSN 2993-5210. - 3:1(2026). [10.1029/2025jh000787]

Testing audio compression autoencoders for seismology. Moving toward foundation models

Laurenti, Laura^Primo;Johnson, Christopher W.;Trappolini, Daniele;Tinti, Elisa;Galasso, Fabio^Penultimo;Marone, Chris^Ultimo

2026

Abstract

Efforts to develop foundation models (FMs) for seismic waveform analysis are beginning to advance with progress still needed for geophysics applications. FMs learn a generalized representation of the data using a self-supervised approach, thus allowing several downstream tasks to be performed in a unified framework. We test a seismic waveform autoencoder as a FM, that is, pretrained for audio data compression and transfer this to seismic data. The pre-trained audio model is fine-tuned using seismic waveforms and evaluated with a waveform reconstruction task. The compression and reconstruction encourage the model to learn a generalizable representation of the input signal. The fine-tuned model is tested for generalization using waveforms from different data distributions. The embedding signal is applied to downstream tasks for foreshock and aftershock classification, early warning ground motion estimates, and earthquake phase detection. We systematically test different fine-tuning procedures to evaluate the effectiveness of the pre-trained model encoder for producing a generalized hidden space. The results show improvement when compared to task-specific models, even when applying few-shot learning. The performance of this model highlights the strengths and limitations as a FM and provides insight for future efforts to develop FMs for seismology.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2026
			
	Parole chiave
	
				foundation models; seismic data; autoencoder
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Testing audio compression autoencoders for seismology. Moving toward foundation models / Laurenti, L., Johnson, C.W., Trappolini, D., Tinti, E., Galasso, F., Marone, C.. - In: JOURNAL OF GEOPHYSICAL RESEARCH. MACHINE LEARNING AND COMPUTATION. - ISSN 2993-5210. - 3:1(2026). [10.1029/2025jh000787]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Laurenti_Testing_2026.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 2.22 MB Formato Adobe PDF	2.22 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1768369

Citazioni

ND

ND

ND

Catalogo dei prodotti della ricerca