CatEmbed: A Machine-Learned Representation Obtained via Categorical Entity Embedding for Predicting Adsorption and Reaction Energies on Bimetallic Alloy Surfaces

Clara Kirkvold, Brianna A. Collins, Jason D. Goodpaster

Research output: Contribution to journalArticlepeer-review

Abstract

Machine-learning models for predicting adsorption energies on metallic surfaces often rely on basic elemental properties and electronic and geometric descriptors. Here, we apply categorical entity embedding, a featurization method inspired by natural language processing techniques, to predict adsorption energies on bimetallic alloy surfaces using categorical descriptors. Using this method, we develop a machine-learned representation from categorical descriptors (e.g., surface composition, adsorbate type, and site type) of the slab/adsorbate complex. By combining this representation with numerical features (e.g., slab metal stoichiometric ratios), we create the CatEmbed representation. Remarkably, decision tree models trained using CatEmbed, which includes no explicit geometric information, achieve a Mean Absolute Error (MAE) of 0.12 eV. Additionally, we extend this technique to predict reaction energies on bimetallic surfaces, creating the CatEmbed-React representation, which achieves an MAE of 0.08 eV. These findings highlight the effectiveness of categorical entity embedding for predicting adsorption and reaction energies on bimetallic alloy surfaces.

Original languageEnglish (US)
Pages (from-to)6791-6797
Number of pages7
JournalJournal of Physical Chemistry Letters
Volume15
Issue number26
DOIs
StatePublished - Jul 4 2024

Bibliographical note

Publisher Copyright:
© 2024 American Chemical Society.

PubMed: MeSH publication types

  • Journal Article

Fingerprint

Dive into the research topics of 'CatEmbed: A Machine-Learned Representation Obtained via Categorical Entity Embedding for Predicting Adsorption and Reaction Energies on Bimetallic Alloy Surfaces'. Together they form a unique fingerprint.

Cite this