Tailored Mass Spectral Data Exploration Using SpecXplore Interactive Dashboard

02.04.2024

Kevin Mildau, Henry Ehlers, Ian Oesterle, Manuel Pristner, Benedikt Warth, Maria Doppler, Christoph Büschl, Jürgen Zanghellini, and Justin J. J. van der Hooft

Published in Analytical Chemistry, 96, 15, 5798–5806.

doi.org/10.1021/acs.analchem.3c04444

Abstract: Untargeted metabolomics promises comprehensive characterization of small molecules in biological samples. However, the field is hampered by low annotation rates and abstract spectral data. Despite recent advances in computational metabolomics, manual annotations and manual confirmation of in-silico annotations remain important in the field. Here, exploratory data analysis methods for mass spectral data provide overviews, prioritization, and structural hypothesis starting points to researchers facing large quantities of spectral data.

In this research, we propose a fluid means of dealing with mass spectral data using specXplore, an interactive Python dashboard providing interactive and complementary visualizations facilitating mass spectral similarity matrix exploration. Specifically, specXplore provides a two-dimensional t-distributed stochastic neighbor embedding embedding as a jumping board for local connectivity exploration using complementary interactive visualizations in the form of partial network drawings, similarity heatmaps, and fragmentation overview maps. SpecXplore makes use of state-of-the-art ms2deepscore pairwise spectral similarities as a quantitative backbone while allowing fast changes of threshold and connectivity limitation settings, providing flexibility in adjusting settings to suit the localized node environment being explored.

We believe that specXplore can become an integral part of mass spectral data exploration efforts and assist users in the generation of structural hypotheses for compounds of interest.

SpecXplore: MS/MS Data Exploration

SpecXplore dashboard overview of visual components using wheat data

Process of exploring the local environment of a known feature node is illustrated.