The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive task, and the needs of the model used. This study analyzes how normalization techniques influence the outcomes of real estate price regression models using machine learning to uncover complex relationships between urban and economic factors. Six normalization techniques are employed to assess how they affect the estimation of relationships between property value and factors like social degradation, resident population, per capita income, green spaces, building conditions, and degraded neighborhood presence. The study’s findings underscore the pivotal role of normalization in shaping the perception of variables, accentuating critical thresholds, or distorting anticipated functional relationships. The work is the first application of a methodological approach to define the best technique on the basis of two criteria: statistical reliability and empirical evidence of the functional relationships obtainable with each standardization technique. Notably, the study underscores the potential of machine-learning-based regression to circumvent the limitations of conventional models, thereby yielding more robust and interpretable results.
The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning / Anelli, Debora; Morano, Pierluigi; Tajani, Francesco; Guarini, Maria Rosaria. - In: INFORMATION. - ISSN 2078-2489. - 6:16(2025). [10.3390/info16060486]
The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning
Debora Anelli
Primo
;Pierluigi MoranoSecondo
;Francesco TajaniPenultimo
;Maria Rosaria GuariniUltimo
2025
Abstract
The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive task, and the needs of the model used. This study analyzes how normalization techniques influence the outcomes of real estate price regression models using machine learning to uncover complex relationships between urban and economic factors. Six normalization techniques are employed to assess how they affect the estimation of relationships between property value and factors like social degradation, resident population, per capita income, green spaces, building conditions, and degraded neighborhood presence. The study’s findings underscore the pivotal role of normalization in shaping the perception of variables, accentuating critical thresholds, or distorting anticipated functional relationships. The work is the first application of a methodological approach to define the best technique on the basis of two criteria: statistical reliability and empirical evidence of the functional relationships obtainable with each standardization technique. Notably, the study underscores the potential of machine-learning-based regression to circumvent the limitations of conventional models, thereby yielding more robust and interpretable results.| File | Dimensione | Formato | |
|---|---|---|---|
|
Anelli_Interpretative-Effects-Normalization_2025.pdf
accesso aperto
Note: articolo
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
4.28 MB
Formato
Adobe PDF
|
4.28 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


