Replies: 25 comments
-
I humbly note #1438 (comment) that you explicitly rejected this idea three weeks ago on the grounds that "derived data go stale" I offer no objection to adding the additional |
Beta Was this translation helpful? Give feedback.
-
With regards to ensuring the matching of (unmerged?) MTZ and mmcif, these should contain the same data (or directly derived in a merged MTZ case), one should therefore be able to run the same merging code on both (using the audited dials version perhaps?) to verify that both give the same stats as given in the mmcif. Also I know the mmcif has several auditing fields, we should be able to add detailed auditing to MTZ (in MTZ language I think this is referred to as 'history') so that it's obvious if the two were created at a different point. Although this might require allowing dials.export to export multiple formats at once, this should be easy to add. |
Beta Was this translation helpful? Give feedback.
-
Do note that I reopened your issue, which you closed in haste, because my only objection was that you wanted to dump this stuff in the experiments file, not in the "proper place" 🙄 |
Beta Was this translation helpful? Give feedback.
-
Agree that adding to xia2 and DUI sounds reasonable. What I meant with |
Beta Was this translation helpful? Give feedback.
-
Right - in most cases ${USER} only cares about what |
Beta Was this translation helpful? Give feedback.
-
This is currently not doing what I would expect
That is, I ask for mmcif, but get an mtz (and no mmcif gets written) |
Beta Was this translation helpful? Give feedback.
-
You also need
Or you can leave out the
I agree that this is potentially confusing, but the default format is |
Beta Was this translation helpful? Give feedback.
-
I can vouch that the confusion potential was realised. Thanks. |
Beta Was this translation helpful? Give feedback.
-
The default values for (as an aside, I'm not a big fan of the |
Beta Was this translation helpful? Give feedback.
-
Ok, that gets me where I wanted to be. I'm not sure about adding this by default to every |
Beta Was this translation helpful? Give feedback.
-
Having said that, a bzip2-compressed unmerged reflections mmcif is smaller than the unmerged MTZ (in fact, just smaller than the bzip2-compressed MTZ in the case I looked at). Writing this file compressed makes sense to me, as it is meant to simply exist without change until such time that it is uploaded to the PDB. |
Beta Was this translation helpful? Give feedback.
-
Still not convinced that this should be written out by default? Unless there is some specific pressure which means that the output via |
Beta Was this translation helpful? Give feedback.
-
Oh wait, I see conversation has moved on - defaults in |
Beta Was this translation helpful? Give feedback.
-
I'm keeping this open here at the moment to uncover the issues (point of confusion above, file size etc.) but I think we've agreed that the behaviour change should actually be in xia2 and DUI. In DIALS the user can get mmcif if they want. |
Beta Was this translation helpful? Give feedback.
-
I guess if xia2 is going to write a compressed mmcif it would be make sense if DIALS writes the file as present, and xia2 does the compression? |
Beta Was this translation helpful? Give feedback.
-
Ideally, |
Beta Was this translation helpful? Give feedback.
-
Would it be a performance advantage if Lines 77 to 78 in efe8a19 |
Beta Was this translation helpful? Give feedback.
-
That would be very useful imo. Do we even need the format= if the user provides output filenames? MTZ could be default if no filename is provided. If mmcif.hklout is set then you get only an mmcif. If mmcif.hklout and mtz.hklout are set then you get both. Would that work? |
Beta Was this translation helpful? Give feedback.
-
Or a single output.hklout parameter where the format is recognised by extension. This could be multi, so output.hklout=foo.mmcif output.hklout=foo.mtz gives both types. Providing neither defaults to mtz |
Beta Was this translation helpful? Give feedback.
-
Hi
I guess I should comment on the experience that Phenix had with outputting mmCIF.
They found that they had to output both PDB and mmCIF by default after every refinement run – the PDB was used by users for viewing in Coot / further model building and the mmCIF only used when it came to deposition. They found if they put another barrier in the way of producing mmCIF (i.e. users must select an extra option) then users never used it and then complained that they didn’t have an mmCIF file to deposit to the PDB.
HTH
John
|
Beta Was this translation helpful? Give feedback.
-
Thanks @berrisfordjohn, we recognise the problem. We assume that most users (especially those using a CCP4 interface like i2 or the cloud) would be driving xia2 or DUI, so always writing out the mmcif "file of record" at the end of these jobs makes sense. For DIALS command-line use we tend towards getting nothing for free. The user at that level should specify exactly what they want. |
Beta Was this translation helpful? Give feedback.
-
Yes, this makes perfect sense.
|
Beta Was this translation helpful? Give feedback.
-
Hi
Did this discussion get concluded?
If so, are there now new / existing options for Dials / Xia2 to output an mmCIF file which contains data processing statistics?
I would like to pass this information onto Eugene for inclusion in CCP4 so that users can use the statistics from Dials for deposition.
I would also like to pass this onto Frank’s group at Diamond as we are working with them to improve automatic deposition for fragment screening.
Thanks
John
From: John Berrisford <jmb@ebi.ac.uk>
Sent: 23 October 2020 10:17
To: 'dials/dials' <reply@reply.github.com>; 'dials/dials' <dials@noreply.github.com>
Cc: 'Mention' <mention@noreply.github.com>
Subject: RE: [dials/dials] Write out scaling/merging statistics in mmCIF format by default in xia2 and DUI (#1457)
Yes, this makes perfect sense.
John
From: David Waterman <notifications@github.com <mailto:notifications@github.com> >
Sent: 23 October 2020 09:52
To: dials/dials <dials@noreply.github.com <mailto:dials@noreply.github.com> >
Cc: John Berrisford <jmb@ebi.ac.uk <mailto:jmb@ebi.ac.uk> >; Mention <mention@noreply.github.com <mailto:mention@noreply.github.com> >
Subject: Re: [dials/dials] Write out scaling/merging statistics in mmCIF format by default in xia2 and DUI (#1457)
Thanks @berrisfordjohn <https://github.com/berrisfordjohn> , we recognise the problem. We assume that most users (especially those using a CCP4 interface like i2 or the cloud) would be driving xia2 or DUI, so always writing out the mmcif "file of record" at the end of these jobs makes sense. For DIALS command-line use we tend towards getting nothing for free. The user at that level should specify exactly what they want.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub <https://github.com/dials/dials/issues/1457#issuecomment-715200246> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADCP7N7NS4TQMXFGST6J5FLSME73FANCNFSM4SZ3ESXA> . <https://github.com/notifications/beacon/ADCP7N6ESHSKZS2R6KCAIQ3SME73FA5CNFSM4SZ3ESXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOFKQRN5Q.gif>
|
Beta Was this translation helpful? Give feedback.
-
Unless there are objections I sense tacit agreement that DUI and xia2 should run However, the discussion has also identified areas in which An alternative would be to write out merged data only, along with the statistics, to keep file size down, |
Beta Was this translation helpful? Give feedback.
-
Hi everyone, I came here from David's comment from project-gemmi/gemmi#116 (comment) IIUC two separate issues are discussed here:
The first thing is important for the deposition task and the file with statistics only would be small, so the size is not a concern. MmCIF file with scaled unmerged reflections could be useful if it contained information that's not in MTZ. |
Beta Was this translation helpful? Give feedback.
-
Following #281 we can write out mmCIF formatted reflections files from DIALS programs. As a subset of this, we include data scaling statistics in this file (see 007bea5). The PDB deposition pipeline can now accept the mmCIF-formatted scaling statistics files to merge with the coordinate mmCIF file, as noted by @berrisfordjohn. The developers of CCP4 interfaces would like it if this file was written out by default. In particular, if xia2 and DUI produced these files on every run to completion, then it would make deposition easier down the line.
Should this file be output by default whenever
dials.scale
is run?What issues are there? I can see one - how do we ensure that the statistics match with a particular MTZ used for structure determination?
Beta Was this translation helpful? Give feedback.
All reactions