APP下载

Accurate Deep Potential model for the Al–Cu–Mg alloy in the full concentration space*

2021-05-24WanrunJiang姜万润YuzhiZhang张与之LinfengZhang张林峰andHanWang王涵

Chinese Physics B 2021年5期

Wanrun Jiang(姜万润), Yuzhi Zhang(张与之), Linfeng Zhang(张林峰), and Han Wang(王涵)

1Songshan Lake Materials Laboratory,Dongguan 523808,China

2Institute of Physics,Chinese Academy of Sciences,Beijing 100190,China

3Beijing Institute of Big Data Research,Beijing 100871,China

4Yuanpei College of Peking University,Beijing 100871,China

5Program in Applied and Computational Mathematics,Princeton University,Princeton,NJ 08544,USA

6Laboratory of Computational Physics,Institute of Applied Physics and Computational Mathematics,Beijing 100088,China

Keywords: potential energy surface,deep learning,Al–Cu–Mg alloy,materials simulation

1. Introduction

Aluminum, copper, and magnesium (Al–Cu–Mg) based alloys are among the most versatile metallic materials that meet all the requirements of lightweight, high strength, and good fatigue resistance. They are intensively used in automotive, aviation, aerospace, and engineering machinery industries, and expend the scope of activity and the production efficiency of human civilization.[1]The industrial application of Al–Cu–Mg alloys could be traced back to 1906,when age hardening, as the key phenomenon enhancing their specific strength, was first discovered in this material.[2]It led to the well known wrought alloy, duralumin, and soon enabled structural members of Zeppelin airships and the world first passenger airplane without frame or bracing wire to support its wings. After a century, derived commercial aluminum alloy 2000 series are still one of the major material families used for aircraft manufacture.[3]Decades of explorations have linked up the conceptual road-map of age hardening in Al–Cu–Mg alloys, giving a mainline from the super saturated solid solution to a sequence of precipitations that finally impede the movement of dislocations.[4–6]The presence, morphology, and competition of micro-structures were observed by experiments,[6–9]while their chemical formulas,atomic configurations, and relative stability were informed by theoretical calculations.[5,10,11]Phenomenological experiences have been accumulated along the road-map, revealing further complexity of the phenomenon including, but being not limited to,the involvement of distinguishing processes like dislocation–solute interaction and solute clustering in different ageing stages,[12,13]the evolution of micro-structures under various ageing conditions,[14,15]the ageing improvements on desired performance like dimensional stability,[16]and variants of related processes in more complex scenario like Al–Cu–Mg nanocrystals.[17]

A better understanding of the fundamental principles underlying the experiments needs the atom-resolved tracking of the dynamical processes, which is usually beyond the capability of current experiment equipment. Therefore, molecular simulation techniques like molecular dynamics (MD) are a promising alternative, which is in principle capable to produce atom-scale information of the physical quantities of interest in the investigated processes. MD simulations have provided insights into the precipitation strengthening of Al–Cu alloys, which mechanisms might also be inspiring for understanding related processes in Al–Cu–Mg systems. For example, the dislocations–precipitate interactions, that determine the strengthening effect, are dominated by a looping process for most edge dislocation cases[18]and are significantly influenced by the temperature-dependant cross-slip for screw dislocations.[19]The transition among meta-stable precipitates could be adjusted by the vacancy segregation, while the vacancy densities could further be controlled by heating the surface of the designed nanoscale specimen or exposing bulk counterparts to intense irradiation.[20]Besides,possible common factors for precipitation strengthening in different alloys could also be perceived by MD simulations,such as the importance of local interface curvature in driving the dislocation–precipitate interactions.[21]

Ideally,ab initio molecular dynamics(AIMD),[22]which evolves atomic systems under the Newton’s law of motion with forces generated on-the-fly by first-principles-based methods, holds the promise to provide an accurate description of Al–Cu–Mg alloy systems. However, due to the high computational cost, first-principles-based methods like density functional theory(DFT)[23]can only be used to simulate materials within the spacial scale of a few hundreds of atoms and the temporal scale of a few dozens of picoseconds, using desktop workstations or ordinary computer clusters. Despite successes in computing basic quantities like structural factors, heat of formation, and cohesive energy for important precipitation,[10,11,24]they are in general not computationally feasible to reveal the atomistic mechanism in large-scale dynamic processes like phase transition, precipitation, and interface migration under working conditions, whose spacialtemporal scales are far beyond the current capability of AIMD.

In these cases, one has to resort to empirical force fields(EFFs). Representative examples include the Lennard–Jones potential,[25]the Stillinger–Weber potential,[26]the embedded-atom method (EAM) potential,[27]and the modified embedded atom method (MEAM) potential.[28]By approximating the potential energy surface(PES)with relatively simple and analytical functions and optimizing the parameters with a group of target properties,EFF-based MD is orders of magnitude more efficient than AIMD.However,it has been challenging to construct EFF models for multi-component systems like the Al–Cu–Mg alloy. The difficulties are two-folds:(1) The relatively simple function form makes it difficult to fit all target properties in the full concentration space, especially in the case of ternary alloy. (2)The number of tunable parameters in the EFF grows quadratically with respect to the number of components of the system. The number of parameters in an M-component system is M(M+1)/2 times larger than that of a single-component system,so the optimization of the parameters becomes more difficult, if not impossible, for multi-component systems.

For the Al–Cu–Mg system, most of the existing EFF models are restricted to pure metals[29–36]and binary components.[37–42]Although some of them have been demonstrated to provide decent descriptions of the Al end in the Al–Cu phase diagram[39]and the solid–liquid phase equilibria in the target component range of Al–Mg alloy,[42]the generalization of these single or binary component EFFs to ternary Al–Cu–Mg alloy is not straightforward. One of the few examples of multi-component EFF[43]that covers Al, Cu, and Mg is constructed within the formalism of MEAM potential,[28]and is explicitly optimized for single-element systems of Al,Si, Mg, Cu, and Fe, their binary pairs in a hypothetical NaCl reference structure,as well as some stable binary intermetallic components. The fitting targets include a group of selected properties obtained from ab initio calculations, such as the generalized stacking fault energy for single elements, heat of formation, equilibrium volume, elastic moduli of hypothetical binary pairs,and heat of formation for some stable binary components. The transferability of this MEAM potential to the full concentration space was implicitly suggested,but was found not satisfactory for fundamental properties partially beyond the fitted sets for developing the potential.[44]Therefore,there is a demand for a PES model that is substantially less expensive than ab initio methods,and,in the meanwhile,provides remarkably higher accuracy and wider applicable range than the state-of-the-art EFFs. Ideally, it should describe the static, mechanical, and dynamical properties with a satisfactory accuracy in the full concentration space of the ternary Al–Cu–Mg system.

Recent advances in the machine learning (ML)techniques[45–50]provide a promising route towards an accurate and efficient PES model for ternary Al–Cu–Mg alloys. Here we consider the smooth edition of Deep Potential(DP),[50]a deep learning based PES representation. It has been demonstrated that, when trained with ab initio data, DP agrees well with an ab initio PES and has an efficiency comparable to that of EFFs. The representation ability of DP stems from the outstanding ability of the deep neural networks to approximate high-dimensional functions,[51,52]and from a group of automatically generated symmetry-preserving descriptors which faithfully capture the information lying in the local environment of atoms. Compared with most ML models, the flexible and trainable descriptors of DP make it more suitable for complex tasks, e.g., multi-component systems like TiO2–H2O interface,[53]strengthening precipitates from aluminum alloys,[20]as well as a (Zr0.2Hf0.2Ti0.2Nb0.2Ta0.2)C high entropy alloy system with six chemical species.[54]To construct the DP model with an optimal set of data,a concurrent learning strategy,the deep potential generator(DP-GEN),[44]is further established. Implemented as a close-loop iterative workflow,DP-GEN could generate the most compact and adequate data set that guarantees the uniform accuracy of DP in the explored configuration space. The effectiveness and efficiency of DPGEN have been validated in the cases of pure Al, pure Mg,and the Al–Mg alloy,[44]as well as pure Cu,[55]where the DP models outperform the state-of-the-art MEAM potentials in almost all examined properties.

In this paper,we construct the DP model for the Al–Cu–Mg alloys using the DP-GEN scheme for the whole concentration space,i.e.,AlxCuyMgzwith 0 ≤x,y,z ≤1,x+y+z=1, and the configuration space covering a temperature range around 50.0 K to 2579.8 K and a pressure range around 1 bar to 50000 bar. The resulting model gives generally better consistency with DFT results than the state-of-the-art MEAM model[43]when predicting basic energetic and mechanical properties for systems distributed in the full concentration space of Al–Cu–Mg alloy.The tested systems are taken from the Materials Project (MP) database, and most of them are not explicitly covered by the training set. The tested properties include formation energy and volume for equilibrium state,equation of state(EOS),elastic modulus,unrelaxed surface formation energy for both binary and ternary alloys, as well as the relaxed vacancy formation energy and interstitial energy for ternary alloys.

In the following,we will first give a brief introduction of the DP model and DP-GEN scheme for the Al–Cu–Mg system in Section 2. Then in Section 3,we will focus on the comparison of various properties predicted by DFT,DP,and MEAM.Finally,a conclusion will be given in Section 4.

2. Method

The DP-GEN scheme works in an iterative manner. Each iteration is composed of three steps,exploration,labeling,and training. With an initial guess of an ensemble of DP models generated from a relatively simple initial data set, DP-GEN starts to explore the configurational and chemical space by a sampler driven by DP-based simulation. The explored configurations are then classified as accurate, candidate, and failed,according to the model deviation defined as the standard deviation of the atomic force predictions of an ensemble of models.Models in the ensemble use the same neural network architecture and are trained on the same data set. The only difference among them is the random seeds used to initialize parameters of the neural network. Thus,their predictions would converge if the configuration has been well covered by current data set and deviate from each other otherwise. The accurate set consists of the configurations with the model deviation lower than a user-provide lower bound. The failed configurations are those with the model deviation higher than a userprovided upper bound. The failed configurations are typically generated by the DP models with a relatively poor quality in early DP-GEN iterations,and are likely to be unphysical,e.g.,with overlapping atoms. A subset of the candidate configurations with errors between the lower and upper bounds are then sent to the labeling step, at which the labels of a configuration, i.e., the energy, the virial tensor, and the atomic forces,are computed by DFT. Finally, the labeled configurations are added to the existing data set. From the updated data set a new ensemble of DP models are trained,and a new iteration starts from the exploration step. The DP-GEN iterations converge when almost all the explored configurations are classified as accurate. More details of the DP-GEN scheme are referred to previous works.[44,55]

It should be remarked that(1)the quality of the final DP model is insensitive to the data set that constructs the initial DP model to start the DP-GEN scheme. However,with a poor initial guess of DP,more labeled configurations from later iterations would be needed to improve the model. (2)The conditions under which the DP model is used should be determined by the user beforehand,and the exploration strategy is accordingly designed to ergodically explore the relevant configuration space. In this context,the sampler can be,but is not restricted to, molecular dynamics, Monte–Carlo simulation,structure prediction schemes,or enhanced sampling methods.(3) By classifying and sub-sampling the explored configurations, only the configurations of low accuracy are sent to the labeling step, so the most compact data set, i.e., the minimal data set that helps DP achieve a certain accuracy,can be generated during the DP-GEN iterations.

The DP-GEN iterations are conducted by the open-source package DP-GEN.[55]More details on initial data set, exploration,labeling,and training steps are provided as follows.

Initial data set The initial data set used to kick-off the training and exploration can be generated through an automatic workflow integrated in the DP-GEN scheme, which allows the whole scheme to start from little prior knowledge.For alloys, four procedures are sequentially conducted. First,for each user-specific crystalline structure and each possible concentration,the atomic species is randomly assigned on the lattice points. Then, random perturbations are performed on the coordinates by adding values drawn from a uniform distribution in the range[−pa,pa].Perturbations are also performed on the cell vectors by a symmetric deformation matrix that is constructed by adding random noise drawn from a uniform distribution in the range[−pc,pc]to an identity matrix. Next,starting from the perturbed structures, AIMD simulations are conducted to produce labeled data with DFT calculated energy, force, and virial tensor. Finally, the labeled data are used to create the initial data sets. The preparation for singleelement systems follows the same procedure without the need to consider possible concentrations. Further details for preparing initial data sets can be found in previous reports.[44,55]It is worth noting that,users can generate and collect initial data sets through other appropriate manners and tools. The procedure mentioned above just provides a convenient protocol and the quality of the finial model is not sensitive to the initial data sets.

In this work, for binary and ternary alloys, we consider body-centered-cube (BCC), hexagonal-close-packed (HCP),and face-centered-cube (FCC) structures, within 2×2×2 super-cells. For perturbations, pais set to 0.01 ˚A and pcis set to 3%. For single-element cases, training data sets for copper are inherited from previous work,[55]while those for aluminium and magnesium are prepared through individual single-element DP-GEN procedure in similar strategies. Besides, scaling factors compressing the cell are used in the above procedure for binary and ternary alloys. Generated labeled data are added to the final data sets to further cover the high-pressure regime of the configuration space. Specific values are from 0.84 to 0.98 with the interval of 0.02 for binary alloys and from 0.85 to 0.97 with the interval of 0.03 for ternary alloys.

Exploration In this work, the configuration and chemical space are explored by DP-based molecular dynamics(DeePMD) using the LAMMPS package.[56]Two types of structures, bulk and surface, are considered. For bulk systems,the initial configurations are constructed from randomly perturbed FCC, BCC, and HCP, in 2×2×2 super-cells with lattice positions randomly occupied by Al, Mg, or Cu atoms.It is noted that the 2×2×2 super-cells are large enough to capture the local atomic environments needed to construct the accurate DP model for the Al–Cu–Mg alloys(see the supplementary text, Figs. S1 and S2 for the accuracy of the model when predicting alloy structures with larger super-cells up to 12×12×12). Isothermo-isobaric(NpT)MD simulations are conducted at thermodynamic states ranging from 50.0 K to 2579.8 K (twice the experimental melting point of Cu) and from 1 bar to 50000 bar. In the first 40 iterations (0–39), binary alloys(Al–Cu,Al–Mg,and Cu–Mg)are explored. Then from iterations 40 to 90,configurations of ternary alloys(Al–Cu–Mg)are sampled(see supporting information Table S1 for more details). It is noted that in each iteration, NpT simulations of 2 temperatures and 8 pressures covering the range 1–50000 bar are performed, thus configurations are sampled from 16 different thermodynamic conditions. The temperature increases as the iterations proceed, and the pressures are set to fixed values (1, 10, 100, 1000, 5000, 10000, 20000,50000 bar).The surface initial configurations are generated by displacing two halves of FCC and HCP structures along certain crystallographic directions. The structures are prepared in two ways:(1)Conventional cells are copied along the displacing direction so that the thickness of the surface structures is at least 10 ˚A.(2)Conventional cells are replicated by 2×1×1 or 2×2×1 times (see supporting information Table S1 for the assignment of replications to crystals),and then are copied along the displacing direction so that the thickness of surface is at least three layers of atoms. Such sample sizes allow a balance between the rationality in simulating surface environments and the efficiency in successive labeling. We choose{100,110,111}and{001,100,110}as directions of displacing for FCC and HCP, respectively. The atomic positions of these displaced structures are randomly occupied by Al, Mg,or Cu atoms. Simulation cells prepared by(1)are scaled with scaling factors from 0.96 to 1.06 with an interval of 0.02,and those prepared by (2) use scaling factors of 0.96, 1.00, and 1.06. Finally,the atomic positions and the cell shapes are randomly perturbed.Canonical(NVT)simulations are conducted to sample the surface configurations in the temperature range of 50.0–1222.0 K.Binary and ternary alloys are explored from iteration 91 to 94,while Al–Cu and selected ternary alloys are explored from iteration 95 to 103(see supporting information Table S1 for more information). The explored configurations are classified as accurate when the model deviation is smaller than 0.05 eV/˚A,as candidate if the model deviation is between 0.05 eV/˚A and 0.20 eV/˚A,and as failed otherwise.

Labeling The Vienna ab initio simulation package(VASP)[57,58]is used to conduct the DFT calculation with the generalized gradient approximation(GGA)adopting Perdew–Burke–Ernzerhof (PBE)[59,60]exchange–correlation functional. The projector-augmented-wave (PAW) method[61,62]is used with an energy cut-off of 650 eV for the plane wave basis sets. The K-points in the Brillouin zone are sampled by the Monkhorst–Pack mesh[63]with grid spacing of hk=0.1 ˚A−1. The order 1 Methfessel–Paxton smearing method[64]is adopted with σ=0.22 eV.The convergence criterion for the self consistent field (SCF) calculation is 1×10−6eV for the changes of both the total energy and the band structure energy.

Training The smooth edition of the deep potential model is adopted to construct the PES model.[50]The DeePMD-kit package[65]is used for training. The sizes of the embedding and fitting nets are set to(25,50,100)and(240,240,240),respectively. During the DP-GEN iterations, the cut-off radius is set to 6 ˚A. The learning rate starts from 1×10−3and exponentially decays to 3.5×10−8after 1×106training steps.Four models are trained to construct the model ensemble. Using the same architecture and the same training data set,they only differ in the random seeds for initializing model parameters. After the DP-GEN iterations being converged,the production models are trained with the cut-off radius set to 9 ˚A and training steps set to 1.6×107. In all training tasks, the Adam stochastic gradient descent method[66]is used with default hyper-parameter settings provided by the TensorFlow package.[67]

3. Results and discussion

After the convergence of the DP-GEN process,about 2.73 billion alloy configurations are explored and only a small portion of them(~0.0038%)are selected for labeling. Data sets produced by this work have been uploaded to the online open data repository.[68]

In the following we systematically benchmark the DP model by testing its accuracy in terms of formation energy,equilibrium volume,equation of state(EOS),elastic modulus,surface formation energy,vacancy formation energy,and interstitial energies. We consider the 58 alloy structures from the Materials Project(MP)database,including 15 Al–Cu,30 Al–Mg, and 7 Cu–Mg binary structures, as well as 6 Al–Cu–Mg ternary structures.The properties are calculated from the alloy structures relaxed at 0 bar.A state-of-the-art MEAM[43]model is used for comparison. DFT calculations of these properties adopt the same numerical setup as those used in the labeling step. When computing the formation energy, equilibrium volume, interstitial energy as well as vacancy defect and surface formation energies,the ionic relaxation stops when all the forces in a configuration are smaller than 1×10−2eV/˚A.The elastic moduli are calculated by the strain–stress relation using the finite difference method,so it demands a tighter break criterion for both equilibrium and deformed structures. Here we terminate the ionic step when the energy change in two successive steps is smaller than 1×10−5eV.DP and MEAM ionic relaxations break when either one of the following two criteria is satisfied. (1) The energy change between two successive steps is smaller than one part in 1×1012of the total energy,or(2)the norm of the global force vector(could be approximately considered as the upper bound of any force component on any atom) is smaller than 1×10−6eV/˚A. All the computations and analyses of the properties mentioned above are facilitated by the automatic workflow implemented in DPGEN,[55]which invokes VASP[57,58]for DFT calculations and LAMMPS[56]for DP and MEAM calculations. The vacancy and interstitial structures are prepared by exhaustively generating all non-equivalent defect structures from the MP structures by the the python package pymatgen[69](interstital structures are generated by the interstitialcy finding tool(InFit),[70]which is integrated in the pymatgen package).

3.1. Formation energy and equilibrium volume

The formation energy of binary alloys,taking Al–Cu for example,is defined as

and that of the ternary Al–Cu–Mg alloy is defined as

where E0AlCuand E0AlCuMgdenote the equilibrium energies per atom of the tested Al–Cu and Al–Cu–Mg alloys,respectively.cAland cCudenote the component concentrations of Al and Cu,respectively,in an alloy structure. E0Al, E0Cu,and E0Mgdenote the equilibrium energies per atom of the three pure metals in their stable crystalline structures. Here Al and Cu take the FCC structure,and Mg takes the HCP structure.

Fig.1.Comparison of the formation energy and the equilibrium volume predicted by DP and MEAM for alloys, with respect to DFT results.(a) Formation energy with dashed lines representing y=x±0.03 and the inserted histogram showing the distribution of absolute errors with respect to the DFT reference. (b)Equilibrium volume with dished lines representing y=x±0.2 and the inserted histogram showing the distribution of errors. 58 structures,including both binary and ternary alloy structures,are adopted from Materials Projects(MP)database.

The DP predictions of formation energy and equilibrium volume for all tested structures are in satisfactory consistency with the DFT results. In comparison with the MEAM results, the relative errors with respect to DFT references are generally smaller. As shown by Fig. 1(a), the absolute differences between DP and DFT formation energies are below 10 meV·atom−1for 39 MP structures (67% in total 58 MP structures), and no tested case has an error larger than 100 meV·atom−1. As shown in Fig. 1(b), 54 MP structures,or 93% of the tested structures, have absolute errors in equilibrium volume of <0.2 ˚A3·atom−1. Compared with MEAM results, the errors in the formation energy (in eV·atom−1) of most of the testing cases are reduced by at least one order of magnitude, and the relative errors in equilibrium volume are generally reduced from several percents to beneath 1%.About 53%of the MEAM results give errors over 100 meV·atom−1.About 17%in total give errors larger than 200 meV·atom−1,so they are beyond the plotted range of the distribution of errors.

3.2. Equation of state

Next, we investigate the energy dependence on the volume region around the equilibrium value (80%–120% of the equilibrium volume). The performance of DP on the equation of state (EOS) for both binary and ternary alloys is shown in Fig. 2. The absolute DP root-mean-squared errors(RMSEs) with respect to DFT references are smaller than 10 meV·atom−1for 34 out of 57 tested alloys (MP-1200279 is not included due to the convergence difficulty in the DFT calculation), which is close to the performance on equilibrium energy (39 out of 58 structures with errors smaller than 10 meV·atom−1). This indicates that the DP model generally holds its accuracy for the equilibrium state in this volume range. As a comparison,for more than half of the tested cases, the RMSEs of DP predictions are two orders of magnitudes smaller than MEAM results,thus DP ensures the generally more reliable energy–volume dependence than MEAM for crystals in the whole Al–Cu–Mg concentration space.

Fig.2. Comparison of DP and MEAM root-mean-squared error(RMSE)of EOS for alloys, with respect to DFT results. Structures are downloaded from the MP database. Volume range for computing RMSE is approximately from 80% to 120% of the DFT equilibrium volume. (a) 5 Al–Cu–Mg structures, 2 of them give MEAM errors larger than the plotting range: (a1) 0.600 eV·atom−1; (a2) 0.663 eV·atom−1. MP-1200279 is not included due to the convergence difficulty in the DFT calculation;(b)7 Cu–Mg structures; (c) 15 Al–Cu structures; (d) 30 Al–Mg structures. The DP and MEAM EOS of MP-1039192 are estimated by single-point energy calculations on converged structures from the DFT EOS computation,because the MP-1039192 structure is not stable in neither DP nor MEAM energy minimization.

We note that,for all the properties introduced above,despite an overall good agreement with DFT values, DP gives unsatisfactory predictions on a small number of structures. A detailed investigation suggests that this may be attributed to the exploration strategy adopted by the DP-GEN process. The MD simulation for exploring the conformation space starts from structures with FCC,HCP,and BCC lattices,and should include the corresponding crystalline, molten, and surface structures. However, there are local chemical environments that can be hardly sampled using this strategy,so that DP will be in the extrapolation regime when tested on the associated structures.

Examples include MP-1038818 and MP-1039010 in Fig.2,which are in diamond and simple cubic lattices,respectively. It should be mentioned that, however, if we continue with the DP-GEN process and add related structures into the exploration step, the DP model should be systematically improved. As an evidence, we refer to Ref. [44] for an Al–Mg model generated by a DP-GEN process that takes into considerations the diamond and simple cubic structures. Such a model shows a much better prediction accuracy on MP-1038818 and MP-1039010.

3.3. Elastic modulus and surface formation energy

Now we consider the bulk and shear moduli as well as the unrelaxed surface formation energy. The unrelaxed surface formation energy,Esf((lmn)),refers to the energy needed to create a surface with the miller index(lmn)for a given crystal,and is defined as

Here Es((lmn)) and Nsdenotes the energy and number of atoms of the unrelaxed surface structure with the Miller index (lmn). A denotes the surface area. For each MP case,E0refers to the per-atom potential energy of the relaxed bulk structure (under 0 bar) of this MP crystal. For each surface,energy evaluations of Es((lmn))using three methods are conducted on the same surface geometry,i.e.,the one generated by displacing two halves of the DFT equilibrium bulk structure.This ensures the rationality of the comparison, since a small difference in the equilibrium bulk structure relaxed by different methods and codes could bring distinct surface geometries under the same Miller index(lmn)(surface generation method is similar to that described in Section 2).

As shown in Fig.3(a),DP generally gives more accurate bulk and shear moduli than MEAM. It presents a more concentrated distribution of data points along the diagonal line denoting the DFT reference. This advantage is more directly revealed by comparing the absolute errors with respect to the DFT reference. Within the leftmost interval of the inserted histogram that allows errors beneath 5 GPa, about 20%more results in total are presented by DP than MEAM.Meanwhile,the tail of the distribution of error is also shorter for DP.Furthermore, both bulk and shear moduli of all 5 tested ternary cases are more accurately predicted by DP.

DP also outperforms MEAM on predictions of unrelaxed surface formation energy, as shown by Fig. 3(b). Taking DFT results as references and compared with MEAM, there are 50% more data points with absolute errors smaller than 0.05 J/m2when using DP for predictions. However, DP slightly underestimates Esffor some ternary cases and a small portion of binary cases.

Fig.3. Comparison of the elastic moduli and the unrelaxed surface formation energy predicted by DP and MEAM for alloys, with respect to DFT results. (a) The bulk modulus Bv and the shear modulus Gv of 57 MP structures, with dashed lines representing y=x±10 and the inserted histogram showing the distribution of absolute errors with respect to the DFT reference. MP-1200279 is not included due to the convergence difficulty in the DFT calculation.(b)911 unrelaxed surface formation energies of 56 MP structures,with dashed lines representing y=x±0.1 and the inserted histogram showing distribution of errors.MP-1200279 and MP-1185596 are not included due to the convergence difficulty in the DFT calculation.

3.4. Vacancy and interstitial defects

Finally,for ternary alloys,we examine the vacancy defect formation energy and the energy along interstitial relaxation paths. The vacancy formation energy Evfis defined as

where Ev(Nv) denotes the energy of a relaxed structure with one vacancy defect and Nvatoms. E0is the equilibrium energy per atom of the corresponding pristine crystal.

For the interstitial defect, due to the generally high energetic instability of the generated interstitial structures, the relaxation may not end up with structures with locally relaxed interstitial point defects. Meanwhile,the relaxation path would be highly sensitive to the practical implementation of the relaxation algorithm.[44]Therefore, instead of comparing the interstitial formation energies of DFT,DP,and MEAM calculated by independent relaxations,we investigate the DP and the MEAM prediction errors along the DFT relaxation path(excluding early high energy geometries).

Fig.4. Comparison of the vacancy formation energy and the unrelaxed surface formation energy predicted by DP and MEAM for ternary alloys,with respect to DFT results. (a)44 relaxed vacancy formation energies of 6 ternary alloys,with dashed lines representing y=x±0.5 and the inserted panel showing the error distribution.(b)23168 energies per atom along DFT relaxation path of 5 ternary alloys, with dashed lines representing y=x±0.02 and the inserted panel showing the error distribution.MP-1200279 is not included due to the convergence difficulty in the DFT calculation.

DP is in satisfactory consistency with DFT and shows relative advantages compared with MEAM for both properties.As shown by Fig. 4(a), almost all DP predicted Evfvalues have absolute errors beneath 0.5 eV with respect to DFT values. It can be found that, DP tends to slightly underestimate Evfcompared with the DFT reference. In terms of the energy along DFT interstitial relaxation paths, nearly all DP predictions give absolute errors beneath 1 meV·atom−1. In contrast,MEAM gives about 30% less data points in total within the same error range. Meanwhile,some MEAM predictions have errors of tens of meV·atom−1,as demonstrated by those points far from the DFT reference line in Fig.4(b).

It is noted that the vacancy and interstitial formation energies calculated by the tested structures may not be well converged due to the finite size effect.However,for the purpose of investigating the accuracy of the DP and the MEAM in terms of the defect formation energy,it is reasonable to compare the results of DP and MEAM with that of the DFT,computed from the defect structures with the same number of atoms.

4. Conclusion

Using the deep potential representation and the concurrent-learning scheme, DP-GEN, we develop an accurate PES model for the Al–Cu–Mg alloys in the full concentration range. Systematic benchmarks with DFT results on fundamental energetic, mechanical, and defect properties of 58 MP structures suggest a good accuracy in the full concentration space, and an overall better performance than a stateof-the-art MEAM potential. The training does not rely on any existed first-principles data set. Instead,data is generated on-the-fly through DP-GEN and finally forms a compact set supporting reliable DP model after the scheme is converged.The resulting DP model and data sets are expected to help investigate the physical mechanisms behind complex phenomena of Al–Cu–Mg systems and contribute to the methodology development for materials simulation.