Effect of Training Data Ratio and Normalizing on Fatigue Lifetime Prediction of Aluminum Alloys with Machine Learning

Matin, M.; Azadi, M.

doi:10.5829/ije.2024.37.07a.09

Effect of Training Data Ratio and Normalizing on Fatigue Lifetime Prediction of Aluminum Alloys with Machine Learning

Document Type : Original Article

Authors

Faculty of Mechanical Engineering, Semnan University, Semnan, Iran

10.5829/ije.2024.37.07a.09

Abstract

It is critical to evaluate the estimation of the fatigue lifetimes for the piston aluminum alloys, particularly in the automotive industry. This paper investigates the effect of different normalization methods on the performance of the fatigue lifetime estimation using Extreme Gradient Boosting (XGBoost), as a supervised machine learning method. For this purpose, the dataset used in this study includes various physical and experimental inputs related to an aluminum alloy and the corresponding fatigue lifetime outputs. Furthermore, before fitting the XGBoost model, different fatigue lifetime preprocessing methods were utilized and evaluated using metrics such as Root Mean Square Error (RMSE), Determination Coefficient (R²), and Scatter Band (SB). The results indicate that modeling fatigue lifetime with logarithmic values as a preprocessing method excels when XGBoost is trained with 100% of the data. However, other normalization methods demonstrate superior accuracy in estimating test data with a 20% test and 80% train set split.

Graphical Abstract

Keywords

Main Subjects

Fatigues, friction

References

Farrahi G, Faghidian S, Smith D. Reconstruction of residual stresses in autofrettaged thick-walled tubes from limited measurements. International Journal of Pressure Vessels and Piping. 2009;86(11):777-84. 10.1016/j.ijpvp.2009.03.010
Ismaiel A. Wind turbine blade dynamics simulation under the effect of atmospheric turbulence. Emerging Science Journal. 2023;7(1):162-76. 10.28991/ESJ-2023-07-01-012
Ali Faghidian S. Analytical approach for inverse reconstruction of eigenstrains and residual stresses in autofrettaged spherical pressure vessels. Journal of Pressure Vessel Technology. 2017;139(4):041202. 10.1115/1.4035980
Farrahi G, Faghidian S, Smith D. An inverse method for reconstruction of the residual stress field in welded plates. 2010. 10.1115/1.4001268
Azadi M, Farrahi G, Winter G, Eichlseder W. The effect of various parameters on out-of-phase thermo-mechanical fatigue lifetime of A356. 0 cast aluminum alloy. International Journal of Engineering, Transactions C: Aspects. 2013;26(12):1461-70. 10.5829/idosi.ije.2013.26.12c.06
Azadi M, Rezanezhad S, Zolfaghari M. Effects of various ageing heat treatments on microstructural features and hardness of piston aluminum alloy. International Journal of Engineering, Transactions A: Basics. 2019;32(1):92-8. 10.5829/ije.2019.32.01a.12
Akhtar M, Qamar SZ, Muhammad M, Nadeem A. Optimum heat treatment of aluminum alloy used in manufacturing of automotive piston components. Materials and Manufacturing Processes. 2018;33(16):1874-80. 10.1080/10426914.2018.1512128
Azadi M, Parast MSA. Data analysis of high-cycle fatigue testing on piston aluminum-silicon alloys under various conditions: Wear, lubrication, corrosion, nano-particles, heat-treating, and stress. Data in brief. 2022;41:107984. 10.1016/j.dib.2022.107984
Choi D-K. Data-driven materials modeling with XGBoost algorithm and statistical inference analysis for prediction of fatigue strength of steels. International Journal of Precision Engineering and Manufacturing. 2019;20:129-38. 10.1007/s12541-019-00048-6
Matin M, Azadi M. A Novel Machine Learning-Based Model for Predicting of Transient Fatigue Lifetime in Piston Aluminum Alloys. Available at SSRN 4598611. 10.2139/ssrn.4598611
Matin M, Azadi M. Machine learning-based modeling for estimating bending fatigue lifetimes in AlSi12CuNiMg aluminum alloy of engine pistons under different inputs: Fretting force, corrosion time, lubrication, heat-treating, nano-particles, and stress. Corrosion Time, Lubrication, Heat-Treating, Nano-Particles, and Stress. 2023. 10.2139/ssrn.4549376
Munkhdalai L, Munkhdalai T, Park KH, Lee HG, Li M, Ryu KH. Mixture of activation functions with extended min-max normalization for forex market prediction. IEEE Access. 2019;7:183680-91. 10.1109/ACCESS.2019.2959789
Zhou W, Liu A, Wu L, Chen X. A L1 normalization enhanced dynamic window method for SSVEP-based BCIs. Journal of Neuroscience Methods. 2022;380:109688. 10.1016/j.jneumeth.2022.109688
Dai Z, Chen W, Huang X, Li B, Zhu L, He L, et al., editors. Cnn descriptor improvement based on l2-normalization and feature pooling for patch classification. 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO); 2018: IEEE. 10.1109/ROBIO.2018.8665330
Gómez-Escalonilla V, Martínez-Santos P, Martín-Loeches M. Preprocessing approaches in machine-learning-based groundwater potential mapping: an application to the Koulikoro and Bamako regions, Mali. Hydrology and Earth System Sciences. 2022;26(2):221-43. 10.5194/hess-26-221-2022
Lv S, Liu C, Chen D, Zheng J, You Z, You L. Normalization of fatigue characteristics for asphalt mixtures under different stress states. Construction and Building Materials. 2018;177:33-42. 10.1016/j.conbuildmat.2018.05.109
Lv S, Wang P, Fan X, Cabrera MB, Hu L, Peng X, et al. Normalized comparative study on fatigue characteristics of different pavement materials. Construction and Building Materials. 2021;271:121907. 10.1016/j.conbuildmat.2020.121907
Medar R, Rajpurohit VS, Rashmi B, editors. Impact of training and testing data splits on accuracy of time series forecasting in machine learning. 2017 International Conference on Computing, Communication, Control and Automation (ICCUBEA); 2017: IEEE. 10.1109/ICCUBEA.2017.8463779
Ramezan CA, Warner TA, Maxwell AE, Price BS. Effects of training set size on supervised machine-learning land-cover classification of large-area high-resolution remotely sensed data. Remote Sensing. 2021;13(3):368. 10.3390/rs13030368
Meng Y, Yang N, Qian Z, Zhang G. What makes an online review more helpful: an interpretation framework using XGBoost and SHAP values. Journal of Theoretical and Applied Electronic Commerce Research. 2020;16(3):466-90. 10.3390/jtaer16030029
Md AQ, Kulkarni S, Joshua CJ, Vaichole T, Mohan S, Iwendi C. Enhanced preprocessing approach using ensemble machine learning algorithms for detecting liver disease. Biomedicines. 2023;11(2):581. 10.3390/biomedicines11020581
Chiu W-Y, Chen B-S. Mobile location estimation in urban areas using mixed Manhattan/Euclidean norm and convex optimization. IEEE transactions on Wireless Communications. 2009;8(1):414-23. 10.1109/T-WC.2009.080156
Azadi M, Shahsavand A, Parast MSA. Analyzing experimental data from reciprocating wear testing on piston aluminum alloys, with and without clay nano-particle reinforcement. Data in Brief. 2022;45:108766. 10.1016/j.dib.2022.108766
Nasiri H, Azadi M, Dadashi A. Interpretable extreme gradient boosting machine learning model for fatigue lifetimes in 3D-printed polylactic acid biomaterials. Available at SSRN 4364418. 2023. 10.2139/ssrn.4364418
Eesa AS, Arabo WK. A normalization methods for backpropagation: a comparative study. Science Journal of University of Zakho. 2017;5(4):319-23. 10.25271/2017.5.4.381
Singh D, Singh B. Investigating the impact of data normalization on classification performance. Applied Soft Computing. 2020;97:105524. 10.1016/j.asoc.2019.105524
Surono S, Afitian MYF, Setyawan A, Arofah DKE, Thobirin A. Comparison of CNN Classification Model using Machine Learning with Bayesian Optimizer. HighTech and Innovation Journal. 2023;4(3):531-42. 10.28991/HIJ-2023-04-03-05
Kurdthongmee W. Comprehensive Evaluation of Deep Neural Network Architectures for Parawood Pith Estimation. HighTech and Innovation Journal. 2023;4(3):543-59. 10.28991/HIJ-2023-04-03-06
He L, Wang Z, Akebono H, Sugeta A. Machine learning-based predictions of fatigue life and fatigue limit for steels. Journal of Materials Science & Technology. 2021;90:9-19. 10.1016/j.jmst.2021.02.021
Althnian A, AlSaeed D, Al-Baity H, Samha A, Dris AB, Alzakari N, et al. Impact of dataset size on classification performance: an empirical evaluation in the medical domain. Applied Sciences. 2021;11(2):796. 10.3390/app11020796
Khameneh MJ, Azadi M. Evaluation of high-cycle bending fatigue and fracture behaviors in EN-GJS700-2 ductile cast iron of crankshafts. Engineering Failure Analysis. 2018;85:189-200. 10.1016/j.engfailanal.2017.12.017
Cruz F, Castelli M. Learning Curves Prediction for a Transformers-Based Model. Available at SSRN 4305463. 2023. 10.28991/ESJ-2023-07-05-03
Zhang G, Shi Y, Yin P, Liu F, Fang Y, Li X, et al. A machine learning model based on ultrasound image features to assess the risk of sentinel lymph node metastasis in breast cancer patients: Applications of scikit-learn and SHAP. Frontiers in Oncology. 2022;12:944569. 10.3389/fonc.2022.944569
Giola C, Danti P, Magnani S. Learning curves: A novel approach for robustness improvement of load forecasting. Engineering Proceedings. 2021;5(1):38. 10.3390/engproc2021005038

Volume 37, Issue 7
TRANSACTIONS A: Basics
July 2024
Pages 1296-1305

Article View: 362
PDF Download: 49

Effect of Training Data Ratio and Normalizing on Fatigue Lifetime Prediction of Aluminum Alloys with Machine Learning

References

Volume 37, Issue 7
TRANSACTIONS A: Basics
July 2024
Pages 1296-1305

Files

Share

How to cite

Statistics

Effect of Training Data Ratio and Normalizing on Fatigue Lifetime Prediction of Aluminum Alloys with Machine Learning

References

Volume 37, Issue 7 TRANSACTIONS A: BasicsJuly 2024Pages 1296-1305

Files

Share

How to cite

Statistics

Volume 37, Issue 7
TRANSACTIONS A: Basics
July 2024
Pages 1296-1305