This dataset documents microfluidic production runs of liposome formulations generated across two independent laboratories using standardized lipid compositions and controlled flow conditions. The data include formulation parameters, microfluidic operating settings, and dynamic light scattering measurements of vesicle size and polydispersity, providing structured input–output relationships suitable for data-driven analysis. Raw spreadsheets from each laboratory were harmonized using a reproducible preprocessing workflow implemented in Python, which performs column standardization, fuzzy-matching corrections, physical-range validation, chip-type filtering, and dataset consolidation. The cleaned dataset comprises 304 micromixer-produced liposome formulations, while a separate file contains 12 independent wet-lab validation formulations. Two additional dataset extensions were generated via Gaussian-noise perturbation and SMOTENC-based oversampling to support machine-learning benchmarking and algorithm comparison. Crosslaboratory records enable evaluation of operator variability, equipment reproducibility, and robustness of predictive modeling workflows. Metadata files documenting feature descriptions, naming conventions, and physical bounds facilitate reuse in automated pipelines and FAIR-compliant repositories. All datasets, raw files, preprocessing scripts, and cleaning logs are publicly available on Zenodo (https://doi.org/10.5281/zenodo.17867478), enabling regression benchmarking, inversedesign studies, and comparative evaluation of modeling approaches for formulation–property relationships in microfluidic liposome synthesis.
Smart microfluidics. A curated dataset of microfluidic liposome formulations with cross-laboratory validation for machine-learning applications / Lavagna, Leonardo; Buttitta, Giorgio; Bonacorsi, Simone; Barbarito, Chiara; Moliterno, Mauro; Saito, Gabriele; Oddone, Irene; Verdone, Giuliana; Raimondi, Sergio; Panella, Massimo. - In: DATA IN BRIEF. - ISSN 2352-3409. - 66:(2026), pp. 1-12. [10.1016/j.dib.2026.112667]
Smart microfluidics. A curated dataset of microfluidic liposome formulations with cross-laboratory validation for machine-learning applications
Leonardo Lavagna;Giorgio Buttitta;Simone Bonacorsi;Mauro Moliterno;Massimo Panella
2026
Abstract
This dataset documents microfluidic production runs of liposome formulations generated across two independent laboratories using standardized lipid compositions and controlled flow conditions. The data include formulation parameters, microfluidic operating settings, and dynamic light scattering measurements of vesicle size and polydispersity, providing structured input–output relationships suitable for data-driven analysis. Raw spreadsheets from each laboratory were harmonized using a reproducible preprocessing workflow implemented in Python, which performs column standardization, fuzzy-matching corrections, physical-range validation, chip-type filtering, and dataset consolidation. The cleaned dataset comprises 304 micromixer-produced liposome formulations, while a separate file contains 12 independent wet-lab validation formulations. Two additional dataset extensions were generated via Gaussian-noise perturbation and SMOTENC-based oversampling to support machine-learning benchmarking and algorithm comparison. Crosslaboratory records enable evaluation of operator variability, equipment reproducibility, and robustness of predictive modeling workflows. Metadata files documenting feature descriptions, naming conventions, and physical bounds facilitate reuse in automated pipelines and FAIR-compliant repositories. All datasets, raw files, preprocessing scripts, and cleaning logs are publicly available on Zenodo (https://doi.org/10.5281/zenodo.17867478), enabling regression benchmarking, inversedesign studies, and comparative evaluation of modeling approaches for formulation–property relationships in microfluidic liposome synthesis.| File | Dimensione | Formato | |
|---|---|---|---|
|
Lavagna_Smart-microfluidics_2026.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
1.8 MB
Formato
Adobe PDF
|
1.8 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


