Improved semantics for destroy() #2126

andralex · 2018-03-02T18:12:13Z

This supersedes #2124 and previous attempts. Apologies for the override but it seemed easier to simply write this than discuss what to do over several sessions. cc @JinShil

dlang-bot · 2018-03-02T18:12:15Z

Thanks for your pull request, @andralex!

Bugzilla references

Your PR doesn't reference any Bugzilla issue.

If your PR contains non-trivial changes, please reference a Bugzilla issue or create a manual changelog.

JinShil · 2018-03-02T23:43:22Z

src/object.d

+Detailed behavior of `destroy(obj)` depends on the type `T` as
+follows:
+
+$(UL $(LI If `T` is a primitive type, a `struct` types that does not


`struct` type (singular)

JinShil · 2018-03-02T23:46:36Z

src/object.d

-references any other objects. It does $(I not) initiate a GC cycle or free
-any GC memory.
+Destroys an object by means of calling its destructor, if present. For
+types `T` that define a destructor, `destroy` then also fills the


probably best to use "... that have a destructor" here since that's the verbage used in the following "NOTE".

JinShil · 2018-03-03T00:09:40Z

This is better than my prior understanding of @andralex's specification as, at least, reference types and value types are treated uniformly. However, I still don't understand why having a destructor or not having a destructor should result in different behavior.

And, it looks like we'll have to do something about Higgs. cc @sbstp See the Jenkins "details".

sbstp · 2018-03-04T05:54:15Z

I'm not the author of Higgs, you should probably cc @maximecb.

maximecb · 2018-03-04T06:32:39Z

Oha @JinShil.

So, it's been a long time since I've looked at this code, but this might be the source of your problem: https://github.com/higgsjs/Higgs/blob/master/source/runtime/gc.d#L914

12345swordy · 2018-03-04T23:52:45Z

no attempts of fixing bugs regarding the class deconstructor when it comes to attributes?

JinShil · 2018-03-07T07:18:25Z

Breaking Higgs

I removed one destructor from Higgs in an attempt to make it compatible with this PR, but there are still 3 more that need attention:

https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/object.d#L512
https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/vm.d#L805
https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/gc.d#L87

Alternate Proposal

I propose decoupling destruction from re-initialization for all cases, and delegating the decision and method of re-initialization to the user. That puts the burden on the user to understand what they need to do and why, but I believe that's responsibility the user opts into when they choose to implement a destructor. Consider the following contrived examples.

Method 1: Use a boolean flag to indicate whether or not the object has already been destroyed

struct S 
{
    void* p;
    bool destroyed = false;
    this(size_t size)
    {
        p = allocateResource(size);  // may not have the same semantics as `malloc`
    }
    ~this() 
    {
        if (!destroyed)
        {
            deallocateResource(p);   // may not have the same semantics as `free`
            destroyed = true;
        }
    }
}

Method 2: Use a special state to indicate whether or not the object has already been destroyed

struct S 
{
    void* p;
    this(size_t size)
    {
        p = allocateResource(size);   // may not have the same semantics as `malloc`
    }
    ~this() 
    {
        if (p == null)
        {
            deallocateResource(p);   // may not have the same semantics as `free`
            p = null;
        }
    }
}

Method 3: Re-destuction is benign due to the semantics of the resource deallocation

struct S 
{
    void* p;
    this(size_t size)
    {
        p = allocateResource(size);  // same semantics as `malloc`
    }
    ~this() 
    {
        deallocateResource(p);     // same semantics as `free`
        p = null;
    }
}

Learning from C#

There may also be some wisdom to be found in C#'s dispose pattern documented here: https://docs.microsoft.com/en-us/dotnet/standard/design-guidelines/dispose-pattern. It is Microsoft's solution to basically the same problem.

JinShil · 2018-03-07T07:18:39Z

no attempts of fixing bugs regarding the class deconstructor when it comes to attributes?

That's out of scope for now, and probably needs changes to be made in DMD.

andralex · 2018-03-08T03:05:55Z

@JinShil the destructors in Higgs look reasonable. In what way are they broken?

As a general disposition, we're going toward more, not less, safety. So the situation in which we make destroy leave the object in an unsafe state for destruction is not acceptable. Yes, per your examples there are numerous ways in which user code can make the destructor safe. (Those include of course overwriting the object with .init, which is exactly what destroy does!) But it is impossible for the compiler to assess that the changes make the destroy/~this combo safe.

JinShil · 2018-03-08T03:31:10Z

In what way are they broken?

I believe it is their mere existence, causing the object to be re-initialized, that causes problems for the Higg's test suite. But, I need to look into it in more detail to be sure.

JinShil · 2018-03-20T12:26:52Z

@andralex How confident are you in this implementation? For some reason, noone from team DRuntime seems very interested in it (or maybe you're just too scary). Are you willing to play the dictator with this, or are you looking for feedback?

andralex · 2018-03-20T13:36:49Z

@JinShil thanks for asking! :) I think the intent is what we need; of course it can be improved, and also the implementation may have issues. So feedback is welcome.

Generally destroy needs to rely on what we already have in the language instead of requiring new paraphernalia or asking the user to rewrite their destructors in a certain way.

The mechanisms at our disposal are the following:

Destructors get rid of an object's extra resources.
Destructors leave the object in an invalid state; adding a requirement here is not tenable.
Every object (even those with @disable this();) has a well-defined init state
The init state does not allocate any resources and can be copied with memcpy
At least for objects without @disable this();, an object in its initial state must be destroyable, otherwise trivial code such as { T t; } would be incorrect.
We can use introspection to figure out whether an object has a destructor or not.

The fundamental approach of this PR is simple - calling destroy should call the destructor, if any, and leave the object in a valid destroyable state. So if the object has no destructor there's no need to do anything at all - leave the object as is, it's as destroyable as it was before calling destroy. If the object does have a destructor we should call it and then repaint the object with its initial state - the one that we can reasonably assume it's safely destroyable. If we remove from this behavior we allow for unsafe code (people call destroy and then the destructor gets automatically invoked). If we add to this behavior we do work for naught (as was the case before - why should destroy on an int set it to zero? Every int has as much resources (none) and as many rights as others). If we add requirements to the destructor we break all existing destructors. If we add compiler support we incur overhead. There's really not a lot of wiggle room here.

I'm unclear on how Higgs got broken, and probably we need to get to the bottom of that. Where can be failures be seen and how do they manifest themselves?

wilzbach · 2018-03-20T13:46:37Z

I'm unclear on how Higgs got broken, and probably we need to get to the bottom of that. Where can be failures be seen and how do they manifest themselves?

See Mike's comment above. They can be seen on the Project Tester:

https://ci.dlang.io/blue/organizations/jenkins/dlang-org%2Fdruntime/detail/PR-2126/2/pipeline

For local reproduction, do sth. like:

git clone https://github.com/higgsjs/Higgs
cd Higgs
dub --compiler=/home/seb/dlang/dmd/generated/linux/release/64/dmd

JinShil · 2018-03-21T04:07:02Z

I'm unclear on how Higgs got broken, and probably we need to get to the bottom of that. Where can be failures be seen and how do they manifest themselves?

The symptom appears in the Higgs test suite where the unitttests are specifically testing if an object is in a certain state:

Assertion failed (runtime/gc.d@472): gcForward: object not in from-space heap

https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/gc.d#L472

I believe this is caused simply by the existence of some of destructors in the Higgs code base. I already removed 1 empty constructor which solved one error, but another appeared. There are current 2 more empty constructors in Higgs:

https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/object.d#L512
https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/vm.d#L805

I tried to remove those as well, but it didn't solve the problem we're currently experiencing.

The current error appears to be due to the following destructor, which is not empty:

https://github.com/higgsjs/Higgs/blob/dbe1fb993addfef98e900df6fe2e1e3afe8ecae4/source/runtime/gc.d#L87

I tried to fix that also to make it compatible with this PR, but ultimately failed. I don't understand the code or the test suite logic well enough to make the change.

If you are convinced that this PR is the way to go, and are willing to exercise your authority, given the lack of reviews, I propose the following procedure:

Remove Higgs from the Jenkins test suite
File an issue in the Higgs repository and leave it to the authors there to fix it
Merge this PR
If and when Higgs is fixed to make it compatible with this PR, we can re-add it to the Jenkins test suite.

Note that I am not a member of team DRuntime, so if you don't want to merge your own PR, you'll still have to convince someone to review this PR. I'm not crazy about these semantics, but I also don't have any other solution. I'd be hesitant putting my name next to the merge notice even if I were on team DRuntime, but I'd do it if enough time passed without anyone else stepping up with an alternate solution.

Pinging @klickverbot @schveiguy @MartinNowak @rainers @ZombineDev @DmitryOlshansky @maximecb for help.

12345swordy · 2018-06-07T17:16:59Z

!ping

ntrel · 2018-06-07T17:39:17Z

Optimization: If a class/struct aggregate does not define a destructor itself, but one of its fields defines a destructor, we need to do field = field.init so the field destructor can be called again later. The aggregate's generated destructor will not access any fields that don't have a destructor, so we do not need to reinit those. This pull seems to always reinit all fields of the aggregate (which is fine, but overkill).

Also, I like the new semantics of destroy, but I expect people have relied on it always doing the reinit. I read the discussion in #2115, and even if the doc description was vague (until recently?), the [documented] unittests were not - they clearly show reinit, implying that users can rely on it. I suggest this pull either uses a different function name or somehow uses deprecation to make users aware of the change.

andralex · 2018-06-07T19:50:49Z

@ntrel we won't cater to people relying on undocumented behavior. This not only creates difficulties for this particular PR, but creates a dangerous precedent whereby we dig ourselves in bureaucracy.

schveiguy

I approve the general idea, but needs a few things, see comments.

schveiguy · 2018-06-07T21:07:31Z

src/object.d

+
+$(UL $(LI If `T` is a primitive type, a `struct` types that does not
+have a destructor, or a `class` type that does not have a destructor,
+the call to has no effect.)


"call to has no effect" Maybe missing "destroy" in there?

Note, this is technically incorrect for a class type with no destructor, as destroying a class instance that has no destructor is still going to call rt_finalize, which at least sets the vtable to null.

schveiguy · 2018-06-07T21:07:37Z

src/object.d

 */
-void destroy(T)(T obj) if (is(T == class))
+void destroy(T)(T obj) if (is(T == class) || (is(T == interface)))


I think there is a pre-existing bug in here, as I think the static test will pass for extern(C++) classes, which is definitely NOT correct to call rt_finalize on. Something to think about as we move towards betterC.

And Objective-C classes. There’s a recent change where __traits(getLinkage) supports classes.

schveiguy · 2018-06-07T21:25:06Z

src/object.d

@@ -3145,9 +3231,8 @@ unittest

 /// ditto
 void destroy(T)(ref T obj)
-    if (!is(T == struct) && !is(T == interface) && !is(T == class) && !_isStaticArray!T)
+if (!is(T == struct) && !is(T == interface) && !is(T == class) && !_isStaticArray!T)


This is going to accept an lvalue dynamic array, which is potentially going to override the array code above. I'm not 100% sure, because it's a specialization.

This may fail:

static struct S { int x; ~this() {} } S[] arr = new S[5]; arr[] = S(42); destroy(arr); assert(arr[0].x == 0); // will this fail?

ntrel · 2018-06-09T10:57:37Z

people relying on undocumented behavior

@andralex I suggest we revert #2054 ASAP unless this is merged now. #2054 was merged on 27 Jan (it also says you approved those changes).

Observation: The hasElaborateDestroy changes and the destroy(T[]) changes are orthogonal and could have been separated.

andralex added 2 commits March 2, 2018 11:39

Eliminate obsolete workaround for bugzilla 14746

c282903

Implement enhanced behavior of destroy

3d6a92e

andralex requested a review from MartinNowak as a code owner March 2, 2018 18:12

JinShil mentioned this pull request Mar 2, 2018

New destroy implementation for value types and instinsic types #2124

Closed

JinShil reviewed Mar 2, 2018

View reviewed changes

JinShil mentioned this pull request Mar 6, 2018

Remove destructor from IRFunction higgsjs/Higgs#214

Merged

schveiguy approved these changes Jun 7, 2018

View reviewed changes

thewilsonator added the Needs Rebase needs a `git rebase` performed label May 5, 2019

dlang-bot added Needs Work stalled labels May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved semantics for destroy() #2126

Improved semantics for destroy() #2126

andralex commented Mar 2, 2018

dlang-bot commented Mar 2, 2018

JinShil Mar 2, 2018

JinShil Mar 2, 2018 •

edited

JinShil commented Mar 3, 2018 •

edited

sbstp commented Mar 4, 2018

maximecb commented Mar 4, 2018

12345swordy commented Mar 4, 2018

JinShil commented Mar 7, 2018

JinShil commented Mar 7, 2018

andralex commented Mar 8, 2018

JinShil commented Mar 8, 2018 •

edited

JinShil commented Mar 20, 2018

andralex commented Mar 20, 2018 •

edited

wilzbach commented Mar 20, 2018

JinShil commented Mar 21, 2018

12345swordy commented Jun 7, 2018

ntrel commented Jun 7, 2018 •

edited

andralex commented Jun 7, 2018

schveiguy left a comment

schveiguy Jun 7, 2018

schveiguy Jun 7, 2018

jacob-carlborg Jun 8, 2018

schveiguy Jun 7, 2018

ntrel commented Jun 9, 2018

Improved semantics for destroy() #2126

Are you sure you want to change the base?

Improved semantics for destroy() #2126

Conversation

andralex commented Mar 2, 2018

dlang-bot commented Mar 2, 2018

Bugzilla references

JinShil Mar 2, 2018

Choose a reason for hiding this comment

JinShil Mar 2, 2018 • edited

Choose a reason for hiding this comment

JinShil commented Mar 3, 2018 • edited

sbstp commented Mar 4, 2018

maximecb commented Mar 4, 2018

12345swordy commented Mar 4, 2018

JinShil commented Mar 7, 2018

Breaking Higgs

Alternate Proposal

Learning from C#

JinShil commented Mar 7, 2018

andralex commented Mar 8, 2018

JinShil commented Mar 8, 2018 • edited

JinShil commented Mar 20, 2018

andralex commented Mar 20, 2018 • edited

wilzbach commented Mar 20, 2018

JinShil commented Mar 21, 2018

12345swordy commented Jun 7, 2018

ntrel commented Jun 7, 2018 • edited

andralex commented Jun 7, 2018

schveiguy left a comment

Choose a reason for hiding this comment

schveiguy Jun 7, 2018

Choose a reason for hiding this comment

schveiguy Jun 7, 2018

Choose a reason for hiding this comment

jacob-carlborg Jun 8, 2018

Choose a reason for hiding this comment

schveiguy Jun 7, 2018

Choose a reason for hiding this comment

ntrel commented Jun 9, 2018

JinShil Mar 2, 2018 •

edited

JinShil commented Mar 3, 2018 •

edited

JinShil commented Mar 8, 2018 •

edited

andralex commented Mar 20, 2018 •

edited

ntrel commented Jun 7, 2018 •

edited