Skip to content

gerasimoska/mass_spectra

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mass Spectra

Mass spectral libraries search tool (MSL-ST), used to enhance organic compounds' identification

🔧 The tool:

https://massspectra.dev/

💭 Project aim

  • Identification of new organic compounds through suspect screening (SS) and non-targeted analysis (NTA) became the most challenging task in environmental and metabolomics research in the recent two decades. Identification of thousands of organic compounds is performed using the recent technology advancements in chromatography-mass spectrometry as the core analytical platform, assisted by multitude of cheminformatics-assisted approaches.
  • As many of those approaches rely on mass spectral libraries (MSLs) search, the availability of comprehensive MSLs with engines for batch search and export of MS data and batch search engines for simultaneous search and export of MS data from multiple MSLs is of crucial importance. In lack of such, analysts perform this step in a laborious, time-consuming manual manner, importing significant risk of compound misidentification.
  • Web scraping is used as a method for extracting data from the websites of MoNa and Mass Bank of Europe. The processing of the collected structural identifiers from the metadata (SMILES, InChIKey, molecular mass, MF), as well as data on the chromatographic-spectrometric methods used to generated mass spectra (GCMS, LC-MS and capillary electrophoresis (CE-MS)) and their storage in a tabular structure are done in Python, while the web system is made with the Django framework. By using Vue.js and Bootstrap a user-friendly interface was developed, intended for use by researchers with chemical, but also with computer science background.
  • This is the first tool for automated batch search and storage of MS spectra that uses two of the largest publicly available MSLs as data source, the MassBank of North America (MoNa) and the MassBank of Europe. MSL-ST assembles large amount of MS data in an automated, time- and cost-effective manner in a format which allows its further processing, especially for the purpose of compound identification. The tool, accompanied with user manual, is publicly available on GitHub, and available for usage on https://massspectra.dev/.

💡 Work to be done:

  • Addition of more publicly available Mass Spectral Libraries
  • UI/UX improvements

📚 Resources:

⚖️ License:

This project is licensed under GNU General Public License v3.0