Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Matched ions #106

Open
kevinkovalchik opened this issue Oct 15, 2020 · 10 comments
Open

Matched ions #106

kevinkovalchik opened this issue Oct 15, 2020 · 10 comments

Comments

@kevinkovalchik
Copy link

Hello,

I am interested in getting lists of of matched ions for each PSM as part of the search results. For example, something like:

matched_y_ions="y1+=175.118952913371, y3+=361.18301123237103, etc...", matched_b_ions="..."

I do not see anything in the parameters that looks like this. Is there a switch for this that I am not seeing?

Thanks!

Kevin

@dpolasky
Copy link
Member

Hi Kevin,
Reporting a list of specific ions matched for each PSM is not a feature in MSFragger at this time. You can see the total number of matched ions for each PSM in the pepXML output (the tag to look for is "num_matched_ions"), but not which ions make up the total. I think the pepXML schema does not support reporting a detailed list of matched ions (but I could be wrong).

@fcyu
Copy link
Member

fcyu commented Oct 15, 2020

BTW, if you want to know which peaks could be matched, you can use PDV to plot the annotated spectra.

Best,

Fengchao

@fcyu fcyu closed this as completed Oct 15, 2020
@kevinkovalchik
Copy link
Author

kevinkovalchik commented Oct 16, 2020 via email

@fcyu
Copy link
Member

fcyu commented Oct 16, 2020

Hi Kevin,

Yes, Comet can output matched ions only for .out files. As far as I know, pepxml doesn't have any field for this information.

Best,

Fengchao

@EvgeniyReznik
Copy link

Hello,

Are there any plans to implement this feature in the future?

Thanks,
Evgeniy.

@fcyu
Copy link
Member

fcyu commented Dec 24, 2023

Yes.

Best,

Fengchao

@fcyu fcyu reopened this Dec 27, 2023
@FloBay
Copy link

FloBay commented May 13, 2024

We are also highly interested in this and more features !!

It would also be very nice to get the matched m/z error for each fragment ion too. We are switching from MQ atm, and this is one of the very useful columns in MQ.

It's a helpful qc / optimisation metric for the mass analyser and for creating all sorts of spectral visualisations.

Is there also a way to add percolator score and q value for each psm? At the moment it's not possible from the xml alone to get to the final outcome because the percolator information is missing.

Thanks & Best
Flo

@fcyu
Copy link
Member

fcyu commented May 13, 2024

Is there also a way to add percolator score and q value for each psm? At the moment it's not possible from the xml alone to get to the final outcome because the percolator information is missing.

Which xml file were you referring to? interact-*.pep.xml has the probability which is 1 - Percolator PEP.

As to the q-value, the results are not/should not be filtered with a single PSM-level q-value. FragPipe applied sequential or 2-D filtering combining PSM- and protein-level FDR. I guess you'd probably better to use the psm.tsv file.

Best,

Fengchao

@FloBay
Copy link

FloBay commented May 14, 2024

Thanks for the fast reply...

Which xml file were you referring to? interact-*.pep.xml has the probability which is 1 - Percolator PEP.

Thanks for pointing to the other xml file. I only checked the *.pepXML but not the interact-*.pep.xml.
Do you have a reference file somewhere explaining the columns (e.g. ntt, nmc)? So the 'peptideprophet_probability' is actually 1 - Percolator PEP, when run percolator? I cannot find another probability column in this file.

As to the q-value, the results are not/should not be filtered with a single PSM-level q-value. FragPipe applied sequential or 2-D filtering combining PSM- and protein-level FDR. I guess you'd probably better to use the psm.tsv file.

I totally agree for standard usage. But, we were playing a bit around (e.g. Plot target decoy distribution of scores, mass errors, etc...) while optimising a new MS + search engines atm. And we couldn't find all the values we were interested in the psm.tsv, that's why I started to look at other places too. Yes, one can put things again together from the Percolator output file (if not deleted) but for the not so programming experienced colleagues its not that straight forward. Thus would be cool if the PSM_Ranks, Percolator Score, Percolator PEP that was actually used would appear in the psm.tsv file eventually.

@fcyu
Copy link
Member

fcyu commented May 14, 2024

Do you have a reference file somewhere explaining the columns (e.g. ntt, nmc)?

Here is the tutorial (https://fragpipe.nesvilab.org/docs/tutorial_fragpipe_outputs.html#psmtsv), but it is a little outdated. I will update it in the following weeks.

So the 'peptideprophet_probability' is actually 1 - Percolator PEP, when run percolator? I cannot find another probability column in this file.

That's correct, PeptideProphet Probability is the PSM probability. When using PeptideProphet, it is PeptideProphet probability. When using Percolator, it is 1 - Percolator PEP. We will change the column name to Probability in the next release.

Thus would be cool if the PSM_Ranks, Percolator Score, Percolator PEP that was actually used would appear in the psm.tsv file eventually.

ranks and Percolator PEP are already (kind of) in the psm.tsv, but the Percolator score is not. We will discuss internally to see if it is necessary to propagate this information, since it is not very useful to general users.

Best,

Fengchao

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants