Chinese Journal of Catalysis ›› 2023, Vol. 50: 284-296.DOI: 10.1016/S1872-2067(23)64467-5
• Articles • Previous Articles Next Articles
Yuzhuo Chena, Hao Wanga, Bing Lua, Ni Yib, Liang Caob, Yong Wanga, Shanjun Maoa,*()
Received:
2023-03-20
Accepted:
2023-06-01
Online:
2023-07-18
Published:
2023-07-25
Contact:
*E-mail: Supported by:
Yuzhuo Chen, Hao Wang, Bing Lu, Ni Yi, Liang Cao, Yong Wang, Shanjun Mao. Fine-structure sensitive deep learning framework for predicting catalytic properties with high precision[J]. Chinese Journal of Catalysis, 2023, 50: 284-296.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.cjcatal.com/EN/10.1016/S1872-2067(23)64467-5
Fig. 1. Overview of the descriptor (a), grid (b), graph (c), and the "global + local" (d) representations of the chemical structure used for ML, where CNN and MPNN denote convolution neural network and message passing neural network [38], respectively. The white, red, blue and gray squares illustrate the feature vectors corresponding to the different representations.
DA times a | Total * | Grid * | Descriptor * |
---|---|---|---|
0 | 0.134b (NA)c | 0.159 (NA) | 0.361 (NA) |
5 | 0.126 (0.120) | 0.139 (0.134) | NA (NA) |
10 | 0.113 (0.106) | 0.135 (0.130) | NA (NA) |
15 | 0.105 (0.100) | 0.131 (0.126) | NA (NA) |
18 | 0.104 (0.100) | 0.130 (0.126) | NA (NA) |
20 | 0.103 (0.099) | 0.129 (0.125) | NA (NA) |
Table 1 MAE* and MAE for predicting OH adsorption energies (unit: eV).
DA times a | Total * | Grid * | Descriptor * |
---|---|---|---|
0 | 0.134b (NA)c | 0.159 (NA) | 0.361 (NA) |
5 | 0.126 (0.120) | 0.139 (0.134) | NA (NA) |
10 | 0.113 (0.106) | 0.135 (0.130) | NA (NA) |
15 | 0.105 (0.100) | 0.131 (0.126) | NA (NA) |
18 | 0.104 (0.100) | 0.130 (0.126) | NA (NA) |
20 | 0.103 (0.099) | 0.129 (0.125) | NA (NA) |
Algorithm | #Sample | Range (eV) | MAE (eV) | RMSE (eV) | Ref. |
---|---|---|---|---|---|
ANN b | 635 | 1.8 | NA | 0.240 | [ |
ANN | 748 | 4.8 | 0.152 | 0.222 | [ |
GPR c | 1235 | 4.6 | 0.170 | 0.240 | [ |
Bayesian | 512 | 2.2 | 0.160 | 0.209 | [ |
Bayesian | 748 | 4.8 | 0.270 | 0.435 | [ |
DOSnet | 1103 | 5.4 | 0.156 | 0.221 | [ |
CGCNN | 748 | 4.8 | 0.114 | 0.189 | [ |
TinNet | 748 | 4.8 | 0.118 | 0.188 | [ |
LR d | 112 | 5.6 | 0.210 | NA | [ |
GLCNN | 1564e | 5.6 | 0.114 (0.110)* | 0.188 (0.182)* | This work a |
GLCNN | 3128 | 5.6 | 0.103 (0.099)* | 0.173 (0.167)* | This work a |
Table 2 Benchmark comparison of models for predicting OH chemisorption.
Algorithm | #Sample | Range (eV) | MAE (eV) | RMSE (eV) | Ref. |
---|---|---|---|---|---|
ANN b | 635 | 1.8 | NA | 0.240 | [ |
ANN | 748 | 4.8 | 0.152 | 0.222 | [ |
GPR c | 1235 | 4.6 | 0.170 | 0.240 | [ |
Bayesian | 512 | 2.2 | 0.160 | 0.209 | [ |
Bayesian | 748 | 4.8 | 0.270 | 0.435 | [ |
DOSnet | 1103 | 5.4 | 0.156 | 0.221 | [ |
CGCNN | 748 | 4.8 | 0.114 | 0.189 | [ |
TinNet | 748 | 4.8 | 0.118 | 0.188 | [ |
LR d | 112 | 5.6 | 0.210 | NA | [ |
GLCNN | 1564e | 5.6 | 0.114 (0.110)* | 0.188 (0.182)* | This work a |
GLCNN | 3128 | 5.6 | 0.103 (0.099)* | 0.173 (0.167)* | This work a |
Fig. 2. Symmetry elements contained in SV (a), DV (b) and HV (c) defects with different N contents, where Cn, i, σ and E represent n-fold rotation axis, symmetry center, vertical mirror, and identity element, respectively. Numbers in parentheses distinguish structures containing different symmetry elements. TM atom protrudes from the plane in SV, while in DV and HV, it is in the same plane as other atoms owing to the large defect space in DV and HV.
Fig. 3. Workflow of the double-input single-output GLCNN framework based on the "global + local" strategy, where the gray and blue spheres represent C and Pd atoms, respectively. First, the original chemical structure is expanded and gridded, and multiple channels are added. Subsequently, all channels are cropped, flipped, and resized to generate the input of the CNN module. In contrast, the 1D descriptor related to the adsorption site is fed as the second input and concatenated with the output from the CNN module to form a new input, which is finally transmitted to the fully connected (FC) module to obtain the predicted adsorption energy.
Fig. 4. Structure of original sample (a) and a collection of (b) cropped and flipped data (c) derived from it, taking the channel of atomic number as example. Electronegativity channels of catalysts with (d) cis and (e) trans configurations. The original grid with a size of 128×128 is reduced to 32×32 after the processing of a 64×64 sliding window and Gaussian reconstruction. Gray and blue spheres represent C and Pd atoms, respectively. All augmented data represents the same chemical structure. The dark blue background and fluorescent green/blue highlights represent the vacuum area and atoms respectively.
Fig. 5. Architecture of the CNN model. Grids and descriptors are the two inputs of GLCNN, where 6×(32×32) represents grids with the size of 32×32 containing 6 channels, and 15×1 denotes 15 features. Conv2d represents a 2D convolution layer, where 6×(5×5) represents 6 kernels with the size of 5×5. MaxPool2d is the maximum pooling layer with the size of the pooling window set at (2×2). FC is the fully connected layer, where 101×2000 represents the shape of the input and output of the FC layer. The blue and red boxes belong to “global” and “local” sections, respectively. Layers in the green box integrate the outputs of the "global" and "local" sections and produce the final prediction.
Fig. 6. (a) Learning curve of the GLCNN in predicting OH adsorption energies, in which MAE*s are estimated by the five-fold cross-validation approach. Comparison between DFT-calculated and model-predicted values of OH adsorption energies using GLCNN (b), linear regression13 (c) and CGCNN35 (d), where a histogram of data distribution is displayed on the top of (b). Insets show the comparison between DFT-calculated and model-predicted values of data with cis/trans (red/green) configurations in the test set. Points of the same shape denote paired cis/trans configurations with the same cell expansion coefficient, defect type, and N content.
Fig. 7. (a) PFI analysis of the GLCNN model. The black vertical dashed line represents the MAE* of the model without permutation (0.099 eV). Distribution of the difference of OH adsorption energies for DV (b) and HV (c) defects with cis/trans configurations. Energy subtraction occurs between cis and trans configurations with the same cell expansion coefficient, defect type, and N content (i.e., Fig. S1(a) and Fig. S1(b), or Fig. S1(c) and Fig. S1(d)). A difference greater than 0.6 eV is rare and not shown.
Fig. 8. Intermediate output of the first convolution (a) and activation (b) layer and the second convolution (c) and activation layer (d). The red circle enclosed the obvious feature extraction. Thermal diagrams obtained from Grad-CAM analysis of the first (e) and second (f) convolution layer.
Fig. 9. t-SNE analysis using the adsorption energy of OH (a,d,g), group number of TM (b,e,h), defect type (c,f,i) as fitting or classification criteria of the first (a?c), second (d?f), and third (g?i) FC layers, respectively. The numbers 3-12 in (b) represent the group of the TMs in the periodic table from left to right, which is consistent with the element attribute setting in the pymatgen python package (Fig. S3).
|
[1] | Shuaiqi Meng, Zhongyu Li, Peng Zhang, Francisca Contreras, Yu Ji, Ulrich Schwaneberg. Deep learning guided enzyme engineering of Thermobifida fusca cutinase for increased PET depolymerization [J]. Chinese Journal of Catalysis, 2023, 50(7): 229-238. |
[2] | Lijing Wang, Tianyi Yang, Bo Feng, Xiangyu Xu, Yuying Shen, Zihan Li, Arramel , Jizhou Jiang. Constructing dual electron transfer channels to accelerate CO2 photoreduction guided by machine learning and first-principles calculation [J]. Chinese Journal of Catalysis, 2023, 54(11): 265-277. |
[3] | Wanjun Sun, Jiayu Zhu, Meiyu Zhang, Xiangyu Meng, Mengxue Chen, Yu Feng, Xinlong Chen, Yong Ding. Recent advances and perspectives in cobalt-based heterogeneous catalysts for photocatalytic water splitting, CO2 reduction, and N2 fixation [J]. Chinese Journal of Catalysis, 2022, 43(9): 2273-2300. |
[4] | Xiao-Tian Li, Lin Chen, Cheng Shang, Zhi-Pan Liu. Selectivity control in alkyne semihydrogenation: Recent experimental and theoretical progress [J]. Chinese Journal of Catalysis, 2022, 43(8): 1991-2000. |
[5] | Ya-Fei Jiang, Jin-Cheng Liu, Cong-Qiao Xu, Jun Li, Hai Xiao. Breaking the scaling relations for efficient N2-to-NH3 conversion by a bowl active site design: Insight from LaRuSi and isostructural electrides [J]. Chinese Journal of Catalysis, 2022, 43(8): 2183-2192. |
[6] | Letian Chen, Xu Zhang, An Chen, Sai Yao, Xu Hu, Zhen Zhou. Targeted design of advanced electrocatalysts by machine learning [J]. Chinese Journal of Catalysis, 2022, 43(1): 11-32. |
[7] | Xiaoyan Liu, Guojun Lan, Zhenqing Li, Lihua Qian, Jian Liu, Ying Li. Stabilization of heterogeneous hydrogenation catalysts for the aqueous-phase reactions of renewable feedstocks [J]. Chinese Journal of Catalysis, 2021, 42(5): 694-709. |
[8] | Wenjuan Yan, Dongpei Zhang, Yu Sun, Ziqi Zhou, Yihang Du, Yiyao Du, Yushan Li, Mengyuan Liu, Yuming Zhang, Jian Shen, Xin Jin. Structural sensitivity of heterogeneous catalysts for sustainable chemical synthesis of gluconic acid from glucose [J]. Chinese Journal of Catalysis, 2020, 41(9): 1320-1336. |
[9] | Yan Zhang, Hui Hu, Jia Ju, Qianqian Yan, Vasanthakumar Arumugam, Xuechao Jing, Huaqiang Cai, Yanan Gao. Ionization of a covalent organic framework for catalyzing the cycloaddition reaction between epoxides and carbon dioxide [J]. Chinese Journal of Catalysis, 2020, 41(3): 485-493. |
[10] | Zhong-Pan Hu, Dandan Yang, Zheng Wang, Zhong-Yong Yuan. State-of-the-art catalysts for direct dehydrogenation of propane to propylene [J]. Chinese Journal of Catalysis, 2019, 40(9): 1233-1254. |
[11] | Zhongzhe Wei, Fangjun Shao, Jianguo Wang. Recent advances in heterogeneous catalytic hydrogenation and dehydrogenation of N-heterocycles [J]. Chinese Journal of Catalysis, 2019, 40(7): 980-1002. |
[12] | Pingping Zha, Yunyun Zhan, Daokuan L, Hongyou Cu, Lipeng Zhan. Mesoporous polyoxometalate-based ionic hybrid as a highly effective heterogeneous catalyst for direct hydroxylation of benzene to phenol [J]. Chinese Journal of Catalysis, 2018, 39(2): 334-341. |
[13] | Ye Lu, Yoshinori Yamamoto, Abdulrahman I. Almansour, Natarajan Arumugam, Raju Suresh Kumar, Ming Bao. Unsupported nanoporous palladium-catalyzed chemoselective hydrogenation of quinolines:Heterolytic cleavage of H2 molecule [J]. Chinese Journal of Catalysis, 2018, 39(11): 1746-1752. |
[14] | Xiaoming Yan, Xuan Ding, Yu Pan, Xiaowei Xu, Ce Hao, Wenji Zheng, Gaohong He. Quaternary-ammonium-immobilized polystyrenes as efficient and reusable heterogeneous catalysts for synthesis of cyclic carbonate: Effects of linking chains and pendent hydroxyl group [J]. Chinese Journal of Catalysis, 2017, 38(5): 862-871. |
[15] | Haipeng Fan, Zhengliang Qi, Dejun Sui, Fei Mao, Rizhi Chen, Jun Huang. Palladium nanoparticles in cross-linked polyaniline as highly efficient catalysts for Suzuki-Miyaura reactions [J]. Chinese Journal of Catalysis, 2017, 38(3): 589-596. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||