|
Hokkaido University Collection of Scholarly and Academic Papers >
Institute for Catalysis >
Peer-reviewed Journal Articles, etc >
Statistical Analysis and Discovery of Heterogeneous Catalysts Based on Machine Learning from Diverse Published Data
Title: | Statistical Analysis and Discovery of Heterogeneous Catalysts Based on Machine Learning from Diverse Published Data |
Authors: | Suzuki, Keisuke Browse this author | Toyao, Takashi Browse this author →KAKEN DB | Maeno, Zen Browse this author →KAKEN DB | Takakusagi, Satoru Browse this author →KAKEN DB | Shimizu, Ken-ichi Browse this author →KAKEN DB | Takigawa, Ichigaku Browse this author →KAKEN DB |
Keywords: | Machine learning | Catalysis informatics | Methane oxidative coupling | Water gas shift | CO oxidation |
Issue Date: | 9-Jul-2019 |
Publisher: | Wiley-Blackwell |
Journal Title: | ChemCatChem |
Volume: | 11 |
Issue: | 18 |
Start Page: | 4537 |
End Page: | 4547 |
Publisher DOI: | 10.1002/cctc.201900971 |
Abstract: | The literature provides insights for catalyst design and discovery. Effective analysis of reported data using machine learning (ML) methods offers the ability to gain valuable information. However, utilizing the literature in this way has obstacles such as lack of compositional overlaps, bias from prior published data, and low sample counts for many elements. The present study describes an ML approach that considers elemental features as input representations instead of inputting catalyst compositions directly. This ML method has the potential for catalyst discovery, including catalytic reactions with limited catalyst composition overlap in the available data. Oxidative coupling of methane (OCM), water gas shift (WGS), and CO oxidation reactions were chosen to confirm the effectiveness of the proposed method by analysis using several state-of-the-art ML methods. Among the ML methods tested, gradient boosting regression with XGBoost (XGB) provided the best results, and prediction accuracy was improved by the proposed approach for all three reaction types. In addition, a quantitative value of "feature importance score" was calculated to evaluate the most influential input variables on catalyst performance. Finally, catalyst optimization was explored using ML as a "surrogate" model, and the top 20 promising candidate catalysts were identified for the OCM reaction based on the optimization. The advantages of ML in catalysis analysis as well as the difficulties and limitations originating from the complexity of heterogeneous catalysis were explored. |
Rights: | This is the peer reviewed version of the following article: K. Suzuki, T. Toyao, Z. Maeno, S. Takakusagi, K. Shimizu, I. Takigawa, Statistical Analysis and Discovery of Heterogeneous Catalysts Based on Machine Learning from Diverse Published Data, ChemCatChem 2019, 11, 4537, which has been published in final form at https://doi.org/10.1002/cctc.201900971 . This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving. |
Type: | article (author version) |
URI: | http://hdl.handle.net/2115/78835 |
Appears in Collections: | 触媒科学研究所 (Institute for Catalysis) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)
|
Submitter: 清水 研一
|