Interpretable Machine Learning for High-Strength High-Entropy Alloy Design
High-entropy alloys (HEAs) are metallic materials with solid solutions stabilized by high mixing entropy. Some exhibit excellent strength, often accompanied by additional properties such as magnetic, invar, corrosion, or cryogenic response. This has spurred efforts to discover new HEAs, but the vast compositional search space has made these efforts challenging. Here we present a framework to predict and optimize the yield strength of face-centered cubic (FCC) HEAs, using CoCrFeMnNi-based alloys as a case study due to abundant available data. Our novel Residual Hybrid Learning Model (RELM) integrates Random Forest and Gradient Boosting, enhanced by material attribute data, to handle sparse, skewed datasets for real-world alloys. A hybrid Generative Adversarial Network-Variational Autoencoder model explores new alloy compositions beyond existing datasets. By incorporating processing parameters, which determine the microstructure and thus strength, RELM achieves an R$^2$ score of 0.915, surpassing traditional models. SHapley Additive Explanations (SHAP) and Partial Dependencies enhance interpretability, revealing composition-processing-property relationships, as validated by experiments, including X-ray diffraction, SEM analysis, and tensile testing. The model discovered two novel Co$_{20}$Cr$_{16}$Fe$_{20}$Mn$_{16}$Ni$_{24}$Al$_4$ and Co$_{24}$Cr$_{12}$Fe$_{12}$Mn$_{16}$Ni$_{28}$Al$_4$Si$_4$ HEAs with a maximum possible yield strength of 842 and 937 MPa, significantly exceeding previously reported values for these alloy systems. This study pioneers interpretable machine learning in alloy design, providing a rigorous, data-driven approach to discovering, processing, and optimizing real-world materials. The findings highlight the critical role of both compositional and post-fabrication processing parameters in advancing the understanding of composition-processing-property relationships.
PDF Abstract