Enable time-of-flight indexing and Laue/ToF refinement #2662

toastisme · 2024-05-01T15:15:14Z

Apologies for this being a large PR - it's difficult to separate out the different components. This adds indexing and refinement for time-of-flight data and Laue data more broadly. The main contributions are Laue specific refinement classes that also minimise degrees of freedom with respect to the calculated and observed wavelength of each reflection.

Usage:
dials.index imported.expt strong.refl reflections.weighting_strategy.override=<constant, statistical> reflections.weighting_strategy.wavelength_weight=<weight>

Background

The diffraction condition can be written

$$\mathbf{p^*_0}\cdot \mathbf{p^*_0} + 2\mathbf{p^*_0}\cdot\mathbf{S_0} = 0,$$

where $\mathbf{p^*_0}$ is the reciprocal lattice vector obtained from the candidate UB matrix for a given Miller index. For a rotation experiment we satisfy the condition by identifying the angle $\mathbf{p^*_0}$ must be rotated by to cross the Ewald sphere. For a Laue experiment, we can instead find the required wavelength by rearranging the diffraction condition to give

$$\lambda = -2\frac{\mathbf{\hat{S_0}} \cdot \mathbf{p^*_0}}{\mathbf{p^*_0}\cdot \mathbf{p^*_0}}.$$

The target function to minimise during refinement is a weighted sum of squared residuals over all $n$ observed reflections

$$L = \frac{1}{2}\sum^n_{i=1}w_{i,X}(X-X_{obs})^2 + w_{i,Y}(Y-Y_{obs})^2 + w_{i,\lambda}(\lambda - \lambda_{obs})^2,$$

where $X, Y, \lambda$ correspond to calculated reflection centroid positions in the x, y panel positions, and a wavelength calculated as above, respectively. Likewise, $X_{obs}, Y_{obs}, \lambda_{obs}$ refer to the centroid positions obtained from spot finding. $w_{i,X}, w_{i,Y}, w_{i,\lambda}$ are weight coefficients based on the strategy selected by the user. $L$ has first derivatives

$$\frac{\partial L}{\partial p} = \sum^n_{i=1}w_{i,X}\frac{\partial X}{\partial p}(X-X_{obs}) + w_{i,Y}\frac{\partial Y}{\partial p}(Y-Y_{obs}) + w_{i,\lambda}\frac{\partial \lambda}{\partial p}(\lambda - \lambda_{obs}),$$

where $p$ refers to a degree of freedom of the model obtained from indexing (e.g. unit cell angles, lengths, panel positions etc.). To refine centroids with respect to their calculated wavelengths we need to calculate $\frac{\partial \lambda}{\partial p}$. To do this we follow the same approach as for rotational data[1] and use the implicit function theorem. This states that if a given function $G(\lambda, p) = c$, $\lambda(p)$ has derivatives

$$\frac{\partial}{\partial p}\lambda(p) = -\frac{\frac{\partial G}{\partial p}}{\frac{\partial G}{\partial \lambda}}.$$

In this case $G$ is the diffraction condition, and hence

$$\frac{\partial \lambda}{\partial p} = -\frac{\frac{\partial}{\partial p}(\mathbf{p^*_0}\cdot \mathbf{p^*_0} + 2\mathbf{p^*_0}\cdot\mathbf{S_0})}{\frac{\partial}{\partial\lambda}(\mathbf{p^*_0}\cdot \mathbf{p^*_0} + 2\mathbf{p^*_0}\cdot\mathbf{S_0})}.$$

Terms in the numerator have already been worked out in DIALS[1], so only the denominator needs calculating.

For the first term in the denominator,

$$\frac{\partial}{\partial \lambda}\mathbf{p^*_0}\cdot\mathbf{p^*_0} = \mathbf{p^*_0}\cdot\frac{\partial \mathbf{p^*_0}}{\partial \lambda} + \frac{\partial \mathbf{p^*_0}}{\partial \lambda}\cdot\mathbf{p^*_0}$$

$$\begin{split} \frac{\partial \mathbf{p^*_0}}{\partial \lambda} &= \frac{\partial}{\partial\lambda}(\mathbf{S'}-\mathbf{S_0})\\\ &= \frac{\partial}{\partial\lambda}(\mathbf{S'}-\mathbf{\hat{S_0}}\lambda^{-1})\\\ &= \frac{1}{\lambda}\mathbf{S_0}, \end{split}$$

and hence

$$\frac{\partial}{\partial \lambda}\mathbf{p^*_0}\cdot\mathbf{p^*_0} = \frac{2}{\lambda}\mathbf{p^*_0}\cdot\mathbf{S_0}.$$

For the second term,

$$\frac{\partial}{\partial\lambda}(\mathbf{p^*_0}\cdot\mathbf{S_0}) = \mathbf{p^*_0}\cdot\frac{\partial \mathbf{S_0}}{\partial \lambda} + \mathbf{S_0}\cdot\frac{\partial \mathbf{p^*_0}}{\partial \lambda}$$

$$\frac{\partial\mathbf{S_0}}{\partial\lambda} = -\frac{\mathbf{S_0}}{\lambda},$$

and so

$$\begin{split} \frac{\partial}{\partial\lambda}(\mathbf{p^*_0}\cdot\mathbf{S_0}) &= \frac{1}{\lambda}(\mathbf{S_0}\cdot\mathbf{S_0}-\mathbf{p^*_0}\cdot\mathbf{S_0}). \end{split}$$

Putting it all together

$$\frac{\partial}{\partial\lambda}(\mathbf{p^*_0}\cdot \mathbf{p^*_0} + 2\mathbf{p^*_0}\cdot\mathbf{S_0}) = \frac{2}{\lambda}\mathbf{S_0}\cdot\mathbf{S_0}$$

$$\begin{split} \frac{\partial \lambda}{\partial p} &= -\frac{\frac{\partial}{\partial p}(\mathbf{p^*_0}\cdot \mathbf{p^*_0} + 2\mathbf{p^*_0}\cdot\mathbf{S_0})}{\frac{2}{\lambda}\mathbf{S_0}\cdot\mathbf{S_0}}\\\ &= -\frac{\mathbf{p^*_0}\cdot\frac{\partial\mathbf{p^*_0}}{\partial p} + \mathbf{S_0}\cdot\frac{\partial\mathbf{p^*_0}}{\partial p} + \mathbf{p^*_0}\frac{\partial\mathbf{S_0}}{\partial p}}{\frac{1}{\lambda}\mathbf{S_0}\cdot\mathbf{S_0}}\\\ &= -\frac{\lambda(\frac{\partial\mathbf{p^*_0}}{\partial p}\cdot\mathbf{S'}+\mathbf{p^*_0}\cdot\frac{\partial\mathbf{S_0}}{\partial p})}{\mathbf{S_0}\cdot\mathbf{S_0}}, \end{split}$$

[1] D. G. Waterman, G. Winter, R. J. Gildea, J. M. Parkhurst, A. S. Brewster, N. K. Sauter, and G. Evans.
Diffraction-geometry refinement in the DIALS framework. Acta Crystallographica Section D, 72(4):558–
575, Apr 2016.

Weighting strategies

I looked at constant and statistical weighting strategies for $w_{\lambda}$. At least for time-of-flight experiments, the variance along the wavelength direction for each spot is less obviously useful, as the spot profile typically has a long tail. Instead I used the variance in the X and Y multiplied by a constant. Results for different values are given below. exp (blue dashed line) refers to expected results for a given unit cell parameter. Looking at the average mean squared error across the datasets, statistical weighting seems generally better, with the best weighting being 1E4.

…r wavelength/s0 in reflection table first.

…ve oscillation.

…n managers.

…ategy and fix typo.

…ypo.

…f. Added wavelength columns to refinement output.

… range.

…thm to mcd. Fix typo when checking wavelength_strategy override.

biochem-fan · 2024-05-02T01:05:06Z

This adds indexing and refinement for time-of-flight data and Laue data more broadly. The main contributions are Laue specific refinement classes that also minimise degrees of freedom with respect to the calculated and observed wavelength of each reflection.

Just to make sure, is this PR for Laue data collected with ToF detectors? In other words, we cannot use this on X-ray Laue or pink beam datasets without ToF information (i.e. unknown per-spot $\lambda_{obs}$), can we?

dagewa · 2024-05-03T08:23:54Z

src/dials/algorithms/refinement/prediction/managed_predictors.py

+                    raise ValueError(
+                        "Cannot find max cell for ToF and non-ToF experiments at the same time"
+                    )


I'm not sure this is the right error message. This class gets called in quite a number of places, not just by the indexer where I assume the max cell issue is coming from. So we might end up here in some other situation. Maybe it should be "Cannot create an ExperimentsPredictor for ToF and non-ToF experiments at the same time"

toastisme · 2024-05-03T09:28:27Z

This adds indexing and refinement for time-of-flight data and Laue data more broadly. The main contributions are Laue specific refinement classes that also minimise degrees of freedom with respect to the calculated and observed wavelength of each reflection.

Just to make sure, is this PR for Laue data collected with ToF detectors? In other words, we cannot use this on X-ray Laue or pink beam datasets without ToF information (i.e. unknown per-spot λobs), can we?

I've written it so you should be able to use this for any data where you have assigned wavelengths to observed reflections, and you have enough confidence in the assignment to minimise your model against them. All your data should need is a wavelength column in your reflection table. The only thing missing is a consistent way for DIALS to recognise you have a Laue experiment in the absence of a reflection table, which is the motivation for #733.

If you have any Laue data I could use a basis of a test that would be useful!

biochem-fan · 2024-05-03T23:32:37Z

@toastisme
Most of public datasets are from pink beam serial crystallography (e.g. https://zenodo.org/records/8354296, https://www.cxidb.org/id-180.html)

The only Laue dataset I found is https://zenodo.org/records/10199220.

dagewa · 2024-05-23T14:00:31Z

There are quite a lot of new classes added here, but no added tests.

LauePredictionParameterisation
LaueReflectionManager
LaueReflectionPredictor
LaueExperimentsPredictor
TOFExperimentsPredictor
LaueReflectionManager
TOFReflectionManager
TOFLeastSquaresResidualWithRmsdCutoff
LaueLeastSquaresResidualWithRmsdCutoff
LaueStatisticalWeightingStrategy
LaueMixedWeightingStrategy

I don't think it is necessary to write individual tests for all of these, as not all of the rotation equivalents of these classes are tested directly in a unit test like manner either. However, some of them are. For example, derivatives are tested versus finite difference approximations in places like test_beam_parameters.py, test_stills_spherical_relp_derivatives.py, etc., and for the full prediction equation parameterisation in test_prediction_parameters.py.

A similar test vs finite-differences might be useful for LauePredictionParameterisation.

There's also a test of the gradients of the target function LeastSquaresPositionalResidualWithRmsdCutoff in test_finite_diffs.py (bad module name). Perhaps something similar for TOFLeastSquaresResidualWithRmsdCutoff / LaueLeastSquaresResidualWithRmsdCutoff?

The whole machinery of rotation method refinement is also tested against generated reflection positions using ideal geometry in test_orientation_refinement.py. Might something like that be useful here?

toastisme · 2024-05-23T21:26:58Z

There are quite a lot of new classes added here, but no added tests.
* `LauePredictionParameterisation`

* `LaueReflectionManager`

* `LaueReflectionPredictor`

* `LaueExperimentsPredictor`

* `TOFExperimentsPredictor`

* `LaueReflectionManager`

* `TOFReflectionManager`

* `TOFLeastSquaresResidualWithRmsdCutoff`

* `LaueLeastSquaresResidualWithRmsdCutoff`

* `LaueStatisticalWeightingStrategy`

* `LaueMixedWeightingStrategy`
I don't think it is necessary to write individual tests for all of these, as not all of the rotation equivalents of these classes are tested directly in a unit test like manner either. However, some of them are. For example, derivatives are tested versus finite difference approximations in places like test_beam_parameters.py, test_stills_spherical_relp_derivatives.py, etc., and for the full prediction equation parameterisation in test_prediction_parameters.py.

A similar test vs finite-differences might be useful for LauePredictionParameterisation.

There's also a test of the gradients of the target function LeastSquaresPositionalResidualWithRmsdCutoff in test_finite_diffs.py (bad module name). Perhaps something similar for TOFLeastSquaresResidualWithRmsdCutoff / LaueLeastSquaresResidualWithRmsdCutoff?

The whole machinery of rotation method refinement is also tested against generated reflection positions using ideal geometry in test_orientation_refinement.py. Might something like that be useful here?

Yes completely agree. Thanks for highlighting the key areas to test. I had some dials data / dials data files prs to add some neutron data to write these kind of tests that are now merged. I was hoping to have some X-ray Laue data too.

toastisme and others added 27 commits March 27, 2024 09:08

mapping centroids to reciprocal space and entering flags now check fo…

022d9cd

…r wavelength/s0 in reflection table first.

Enabled find_max_cell to work with tof data.

417fbe0

Avoid applying z from xyzobs.mm.value as rotation if scan does not ha…

bd36f2f

…ve oscillation.

Add Laue weighting strategies.

cab82ff

Add LaueReflectionManager and separate logic when selecting reflectio…

d1cd998

…n managers.

Add oscillation check when choosing best orientation matrix.

9cd335c

Add additional columns to refinement reflections.

dcfe117

Add LaueExperimentPredictor, TOFExperimentPredictor, LaueRayPredictor

5ead133

Added LauePredictionParameterisation

26c8b57

typo

5a8bfb7

update indexer._xyzcal_mm_to_px to include tof data.

a455ef8

typo

46c0691

Check for s0 in reflection table before using beam.s0.

4144627

Fix typo in _post_predict_one_experiment.

10b8630

Add check for oscillation when checking for scan.

445bd49

Change default Laue weighting strategy to LaueStatisticalWeightingStr…

b0f2586

…ategy and fix typo.

Add LaueLeastSquaresResidualWithRmsdCutoff

d7472b3

Add missing export for laue ray predictor. Add missing include. Fix t…

3b06394

…ypo.

Fixed bug where wavelength_weight was not being set.

95da5ee

Added wavelength residual column in output.

241baec

Added TOFReflectionManager. Added TOFLeastSquaresResidualWithRmsdCutf…

01e97a5

…f. Added wavelength columns to refinement output.

Set minimum frame value to calculated reflections below interpolation…

8c524b3

… range.

Include goniometer in calculating reciprocal lattice vectors.

edcad2e

Supress warning for multiple scans being present for ToF data.

71c435b

Change default wavelength_weight value. Change default outlier algori…

72bbad6

…thm to mcd. Fix typo when checking wavelength_strategy override.

newsfragment

218cffa

Rename newsfragments/XXX.feature to newsfragments/2662.feature

a0fbec3

dagewa reviewed May 3, 2024

View reviewed changes

toastisme changed the title ~~Enable time-of-flight/Laue indexing and refinement~~ Enable time-of-flight indexing and Laue/ToF refinement Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable time-of-flight indexing and Laue/ToF refinement #2662

Enable time-of-flight indexing and Laue/ToF refinement #2662

toastisme commented May 1, 2024

biochem-fan commented May 2, 2024

dagewa May 3, 2024

toastisme commented May 3, 2024 •

edited

biochem-fan commented May 3, 2024

dagewa commented May 23, 2024

toastisme commented May 23, 2024

Enable time-of-flight indexing and Laue/ToF refinement #2662

Are you sure you want to change the base?

Enable time-of-flight indexing and Laue/ToF refinement #2662

Conversation

toastisme commented May 1, 2024

Background

Weighting strategies

biochem-fan commented May 2, 2024

dagewa May 3, 2024

Choose a reason for hiding this comment

toastisme commented May 3, 2024 • edited

biochem-fan commented May 3, 2024

dagewa commented May 23, 2024

toastisme commented May 23, 2024

toastisme commented May 3, 2024 •

edited