Graphormer-IR: Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Cailum M.K. Stienstra
  • Liam Hebert
  • Patrick Thomas
  • Alexander Haack
  • Jason Guo
  • W. Scott Hopkins

External Research Organisations

  • University of Waterloo
  • WaterMine Innovation
  • Centre for Eye and Vision Research
View graph of relations

Details

Original languageEnglish
Pages (from-to)4613-4629
Number of pages17
JournalJournal of Chemical Information and Modeling
Volume64
Issue number12
Early online date7 Jun 2024
Publication statusPublished - 24 Jun 2024
Externally publishedYes

Abstract

Infrared (IR) spectroscopy is an important analytical tool in various chemical and forensic domains and a great deal of effort has gone into developing in silico methods for predicting experimental spectra. A key challenge in this regard is generating highly accurate spectra quickly to enable real-time feedback between computation and experiment. Here, we employ Graphormer, a graph neural network (GNN) transformer, to predict IR spectra using only simplified molecular-input line-entry system (SMILES) strings. Our data set includes 53,528 high-quality spectra, measured in five different experimental media (i.e., phases), for molecules containing the elements H, C, N, O, F, Si, S, P, Cl, Br, and I. When using only atomic numbers for node encodings, Graphormer-IR achieved a mean test spectral information similarity (SISμ) value of 0.8449 ± 0.0012 (n = 5), which surpasses that the current state-of-the-art model Chemprop-IR (SISμ = 0.8409 ± 0.0014, n = 5) with only 36% of the encoded information. Augmenting node embeddings with additional node-level descriptors in learned embeddings generated through a multilayer perceptron improves scores to SISμ = 0.8523 ± 0.0006, a total improvement of 19.7σ (t = 19). These improved scores show how Graphormer-IR excels in capturing long-range interactions like hydrogen bonding, anharmonic peak positions in experimental spectra, and stretching frequencies of uncommon functional groups. Scaling our architecture to 210 attention heads demonstrates specialist-like behavior for distinct IR frequencies that improves model performance. Our model utilizes novel architectures, including a global node for phase encoding, learned node feature embeddings, and a one-dimensional (1D) smoothing convolutional neural network (CNN). Graphormer-IR’s innovations underscore its value over traditional message-passing neural networks (MPNNs) due to its expressive embeddings and ability to capture long-range intramolecular relationships.

ASJC Scopus subject areas

Cite this

Graphormer-IR: Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention. / Stienstra, Cailum M.K.; Hebert, Liam; Thomas, Patrick et al.
In: Journal of Chemical Information and Modeling, Vol. 64, No. 12, 24.06.2024, p. 4613-4629.

Research output: Contribution to journalArticleResearchpeer review

Stienstra CMK, Hebert L, Thomas P, Haack A, Guo J, Hopkins WS. Graphormer-IR: Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention. Journal of Chemical Information and Modeling. 2024 Jun 24;64(12):4613-4629. Epub 2024 Jun 7. doi: 10.1021/acs.jcim.4c00378
Stienstra, Cailum M.K. ; Hebert, Liam ; Thomas, Patrick et al. / Graphormer-IR : Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention. In: Journal of Chemical Information and Modeling. 2024 ; Vol. 64, No. 12. pp. 4613-4629.
Download
@article{0f2846b535024b8fa08ecd9b03a70f68,
title = "Graphormer-IR: Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention",
abstract = "Infrared (IR) spectroscopy is an important analytical tool in various chemical and forensic domains and a great deal of effort has gone into developing in silico methods for predicting experimental spectra. A key challenge in this regard is generating highly accurate spectra quickly to enable real-time feedback between computation and experiment. Here, we employ Graphormer, a graph neural network (GNN) transformer, to predict IR spectra using only simplified molecular-input line-entry system (SMILES) strings. Our data set includes 53,528 high-quality spectra, measured in five different experimental media (i.e., phases), for molecules containing the elements H, C, N, O, F, Si, S, P, Cl, Br, and I. When using only atomic numbers for node encodings, Graphormer-IR achieved a mean test spectral information similarity (SISμ) value of 0.8449 ± 0.0012 (n = 5), which surpasses that the current state-of-the-art model Chemprop-IR (SISμ = 0.8409 ± 0.0014, n = 5) with only 36% of the encoded information. Augmenting node embeddings with additional node-level descriptors in learned embeddings generated through a multilayer perceptron improves scores to SISμ = 0.8523 ± 0.0006, a total improvement of 19.7σ (t = 19). These improved scores show how Graphormer-IR excels in capturing long-range interactions like hydrogen bonding, anharmonic peak positions in experimental spectra, and stretching frequencies of uncommon functional groups. Scaling our architecture to 210 attention heads demonstrates specialist-like behavior for distinct IR frequencies that improves model performance. Our model utilizes novel architectures, including a global node for phase encoding, learned node feature embeddings, and a one-dimensional (1D) smoothing convolutional neural network (CNN). Graphormer-IR{\textquoteright}s innovations underscore its value over traditional message-passing neural networks (MPNNs) due to its expressive embeddings and ability to capture long-range intramolecular relationships.",
author = "Stienstra, {Cailum M.K.} and Liam Hebert and Patrick Thomas and Alexander Haack and Jason Guo and Hopkins, {W. Scott}",
note = "Publisher Copyright: {\textcopyright} 2024 American Chemical Society.",
year = "2024",
month = jun,
day = "24",
doi = "10.1021/acs.jcim.4c00378",
language = "English",
volume = "64",
pages = "4613--4629",
journal = "Journal of Chemical Information and Modeling",
issn = "1549-9596",
publisher = "American Chemical Society",
number = "12",

}

Download

TY - JOUR

T1 - Graphormer-IR

T2 - Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention

AU - Stienstra, Cailum M.K.

AU - Hebert, Liam

AU - Thomas, Patrick

AU - Haack, Alexander

AU - Guo, Jason

AU - Hopkins, W. Scott

N1 - Publisher Copyright: © 2024 American Chemical Society.

PY - 2024/6/24

Y1 - 2024/6/24

N2 - Infrared (IR) spectroscopy is an important analytical tool in various chemical and forensic domains and a great deal of effort has gone into developing in silico methods for predicting experimental spectra. A key challenge in this regard is generating highly accurate spectra quickly to enable real-time feedback between computation and experiment. Here, we employ Graphormer, a graph neural network (GNN) transformer, to predict IR spectra using only simplified molecular-input line-entry system (SMILES) strings. Our data set includes 53,528 high-quality spectra, measured in five different experimental media (i.e., phases), for molecules containing the elements H, C, N, O, F, Si, S, P, Cl, Br, and I. When using only atomic numbers for node encodings, Graphormer-IR achieved a mean test spectral information similarity (SISμ) value of 0.8449 ± 0.0012 (n = 5), which surpasses that the current state-of-the-art model Chemprop-IR (SISμ = 0.8409 ± 0.0014, n = 5) with only 36% of the encoded information. Augmenting node embeddings with additional node-level descriptors in learned embeddings generated through a multilayer perceptron improves scores to SISμ = 0.8523 ± 0.0006, a total improvement of 19.7σ (t = 19). These improved scores show how Graphormer-IR excels in capturing long-range interactions like hydrogen bonding, anharmonic peak positions in experimental spectra, and stretching frequencies of uncommon functional groups. Scaling our architecture to 210 attention heads demonstrates specialist-like behavior for distinct IR frequencies that improves model performance. Our model utilizes novel architectures, including a global node for phase encoding, learned node feature embeddings, and a one-dimensional (1D) smoothing convolutional neural network (CNN). Graphormer-IR’s innovations underscore its value over traditional message-passing neural networks (MPNNs) due to its expressive embeddings and ability to capture long-range intramolecular relationships.

AB - Infrared (IR) spectroscopy is an important analytical tool in various chemical and forensic domains and a great deal of effort has gone into developing in silico methods for predicting experimental spectra. A key challenge in this regard is generating highly accurate spectra quickly to enable real-time feedback between computation and experiment. Here, we employ Graphormer, a graph neural network (GNN) transformer, to predict IR spectra using only simplified molecular-input line-entry system (SMILES) strings. Our data set includes 53,528 high-quality spectra, measured in five different experimental media (i.e., phases), for molecules containing the elements H, C, N, O, F, Si, S, P, Cl, Br, and I. When using only atomic numbers for node encodings, Graphormer-IR achieved a mean test spectral information similarity (SISμ) value of 0.8449 ± 0.0012 (n = 5), which surpasses that the current state-of-the-art model Chemprop-IR (SISμ = 0.8409 ± 0.0014, n = 5) with only 36% of the encoded information. Augmenting node embeddings with additional node-level descriptors in learned embeddings generated through a multilayer perceptron improves scores to SISμ = 0.8523 ± 0.0006, a total improvement of 19.7σ (t = 19). These improved scores show how Graphormer-IR excels in capturing long-range interactions like hydrogen bonding, anharmonic peak positions in experimental spectra, and stretching frequencies of uncommon functional groups. Scaling our architecture to 210 attention heads demonstrates specialist-like behavior for distinct IR frequencies that improves model performance. Our model utilizes novel architectures, including a global node for phase encoding, learned node feature embeddings, and a one-dimensional (1D) smoothing convolutional neural network (CNN). Graphormer-IR’s innovations underscore its value over traditional message-passing neural networks (MPNNs) due to its expressive embeddings and ability to capture long-range intramolecular relationships.

UR - http://www.scopus.com/inward/record.url?scp=85195536191&partnerID=8YFLogxK

U2 - 10.1021/acs.jcim.4c00378

DO - 10.1021/acs.jcim.4c00378

M3 - Article

C2 - 38845400

AN - SCOPUS:85195536191

VL - 64

SP - 4613

EP - 4629

JO - Journal of Chemical Information and Modeling

JF - Journal of Chemical Information and Modeling

SN - 1549-9596

IS - 12

ER -