Performance: Speed up of deepcopy for Miller #461

CSSFrancis · 2023-10-26T18:51:25Z

Description of the change

Speeds up deep copy for Miller class. Current deep copy is slowed down by using the deepcopy function.

This causes issues in diffsims when we have to repeatedly copy and rotate some ReciprocalLatticeVector

For reviewers

The PR title is short, concise, and will make sense 1 year later.
New functions are imported in corresponding __init__.py.
New features, API changes, and deprecations are mentioned in the unreleased
section in CHANGELOG.rst.
Contributor(s) are listed correctly in __credits__ in orix/__init__.py and in
.zenodo.json.

hakonanes · 2023-11-02T11:24:57Z

Thank you for looking into making deepcopying faster.

Your solution only deepcopies the vectors and not the phase, which is not in line with the current behavior. The phase should not be shared. If deepcopying the phase is a bottleneck we should look into improving the speed of this method instead.

hakonanes · 2023-11-02T15:09:06Z

I realize now that copying only the data is what you want in the case you describe... Are you happy with a parameter deepcopy(data_only=False)? If true, your current solution is used. It is false by default, retaining the current behavior.

CSSFrancis · 2023-11-02T16:44:47Z

@hakonanes let me look into this more. I think deepcopying only the data is a good solution but I'll look into the phase deepcopy as well.

hakonanes · 2023-11-03T15:53:27Z

A sidenote: I've considered replacing deepcopy() with just copy() (and deprecating deepcopy() one minor release). The behavior is the same. But the naming is in line with NumPy's interpretation of a "copy" (not shared memory) and a "view" (shared memory). What do you think?

harripj · 2023-11-29T18:46:29Z

@CSSFrancis @hakonanes thanks for looking into this!

I think the deepcopy API should return a deep copy of all relevant data in this case. Whilst this PR may lead to speedups, I think this could be a footgun for the majority of users who would not expect any memory to be shared when using the API.

Are there other examples in other codebases where deepcopy returns partial copies?

If the speed ups are useful in diffsims then I think this PR should be an internal optimisation rather than in orix, such that the copying scope is well-defined and controlled, ie. it is known that phase is shared memory.

hakonanes · 2023-11-30T17:48:50Z

Thank you for your input, @harripj. I agree with you on all points. The deepcopy method is meant to be used by end users or in the setup of more demanding methods, not repeatedly inside a loop or similar.

But if we can get any speed up that would be nice.

Performance: Speed up of deepcopy for Miller

75abcea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance: Speed up of deepcopy for Miller #461

Performance: Speed up of deepcopy for Miller #461

CSSFrancis commented Oct 26, 2023

hakonanes commented Nov 2, 2023 •

edited

hakonanes commented Nov 2, 2023

CSSFrancis commented Nov 2, 2023

hakonanes commented Nov 3, 2023

harripj commented Nov 29, 2023 •

edited

hakonanes commented Nov 30, 2023

Performance: Speed up of deepcopy for Miller #461

Are you sure you want to change the base?

Performance: Speed up of deepcopy for Miller #461

Conversation

CSSFrancis commented Oct 26, 2023

Description of the change

For reviewers

hakonanes commented Nov 2, 2023 • edited

hakonanes commented Nov 2, 2023

CSSFrancis commented Nov 2, 2023

hakonanes commented Nov 3, 2023

harripj commented Nov 29, 2023 • edited

hakonanes commented Nov 30, 2023

hakonanes commented Nov 2, 2023 •

edited

harripj commented Nov 29, 2023 •

edited