[Analyzer] Add union types to Wasp #381

Martinsos · 2021-12-04T14:47:45Z

If we have route and if it can point to either page or another route, how would this work right now in Wasp?

We would have to say, in Wasp langauge definition, that route declaration type has three fields:

required path :: String
optional page :: Ref Page
optional route :: Ref Route
and then either page or route field need to be set, but both can't be set at the same time. However, this we can't capture in the type system and we would have to check for this later, either in some very late stage of Analyzer, or in the Generator itself.

However, if we had sum types, then we could define it like this:

required path :: String
required to :: Ref Page | Ref Route
and then this requirement would be captured by the type system!

I am not sure how hard would it be to implement sum type, at first thought it doesn't sound like it should be super hard, but I am not sure until we dive deeper into it.

This is not urgent but it would be cool to have!

The text was updated successfully, but these errors were encountered:

Martinsos · 2021-12-24T11:08:33Z

Additional thoughts:

Once we implement sum types, unification (unifyTypes) will always result with a sum type if it can't find another way to unify types, right? So if it can't unify types T1 and T2 in any other way, it should unify them as T1 | T2 I think. In that case, there won't be any UnificationErrors anymore, right? Hmmmm, interesting. If so, this will potentially result in nicer error messages, since we won't have the "can't mix the following types" messages any more, instead it will be unexpected type error with actual type being a sum type, which offer more information (the expected type).
When unifying types, what happens when we try to unify sum type with another type? I think what we need to do in that case is try to unify that another type with every single type from the sum type, one by one. If we manage to unify it with one of those, we stop and replace that type in the sum type with unified type. If not, we add this another type to the sum type as a new option. This should work recursively and also if another type is also a sum type.

Martinsos · 2022-02-22T09:19:58Z

@sodic and I discussed how to implement sum types in Wasp, here are some conclusions:

When unifying two types, we should try to construct as specific type as possible (this is what other languages, like TypeScript, also do). Sometimes exceptions are made for ergonomical reasons (check this TypeScript example), but it is not yet obvious that any exceptions should be made in the current Wasp type system.
Currently, when unifying dicts in Wasp, we have special rules how to do it in regards with optional dict fields, but with sum types, we can remove this special logic and just do normal sum types unification -> this should work equally well and we have no special rules.
So, specifically, based on conclusions above, when unifying two dictionaries, result will always be the sum of those two dictionaries -> we will not go inside of them and do unification on level of the specific fields. So, if we have [{ a :: Int, b :: String }, { a :: Boo, b :: Bool }], we want unification to result with [ { a :: Int, b :: String } | { a :: Bool, b :: Bool } ], not [ { a :: Int | Bool, b :: String | Bool } ].

Martinsos · 2022-02-22T09:52:30Z

More thoughts:

empty list -> we have a special "empty list" type, so we should consider that. Unifying it with a non-empty list should simply result in the type of non-empty list.
unification algorithm -> we can just put all the types together and do "unique" on them. But then we need to see if we can further reduce that type -> specifically, if there are any subtypes, those should be merged into their super types. For example if we have empty list and non-empty list, empty list we can just kick out. That is probably it, I don't think we have any other subtypes (in our case dictionaries are not sub-types since we are having a nominal typing system!), but we should still keep this in mind.

sodic · 2022-03-01T10:27:33Z

To cover the use case from the issue's description (e.g., supporting both pages and routes for to fields in route declarations):

route OldLoginRoute { path: "/signup", to: LoginRoute }
route LoginRoute { path: "/signup", to: LoginPage }

It should be enough to make changes inside checkStmnt and weaken functions, without needing to touch type inference and unification. This is because users will never be using the possibilities of sum types directly. More precisely, the user will either specify a page or a route (or cause an error by specifying something else), and Wasp then has to make sure whatever they used is assignable to type page | route

More generally, our main goal is to never show unification errors to the user. Instead, we'll always construct a sum type and handle potential errors in the weakening phase as "Expected type X, got type Y". Let's say a user specifies a list value as [1, "2", 3, 4]. The compiler currently says something like Cannot unify types Number and String which is a bit confusing to the user. We want it to say Expected to get a list of Numbers, but got a list of Numbers and Strings.

Unification errors can currently only happen in lists. When encountering a list, we should find the most common supertype of all elements and infer that. Wasp will later check the inferred type against the required type.

To make this change possible, it should be enough to change the function unifyTypes. Instead of throwing an error when finding two uncoercable types, it should return their reduced sum type. More precisely, there are three possible cases:

When unifyTypes receives two equivalent types T, it should return the type T
When unifyTypes receives two different types T and U, it should return their reduced sum type T | U
When unifyTypes receives a a list, it should fold it with unifyTypes.

sodic · 2022-03-01T12:35:42Z

While researching, I stumbled upon the discovery that we are actually implementing union types (not sum types). The difference comes from something we've touched on several times in an unrelated context. There are two kinds of unions:

Untagged unions (most often called "union types") - this is what TypeScript has (e.g., number | string) and what we want to add to Wasp
Tagged unions (most often called "sum types") - This is what Haskell has (e.g., data Either a b = Left a | Right b)

In essence, a sum type is a union type with explicit labels. Here are several helpful resources on the subject:

A Wikipedia article on tagged unions
An easy-to-read blog post
A Stack Overflow comment (but most answers and comments are worth reading

By the way, several answers/comments in the above SO thread give a great answer to the question "Why doesn't Haskell support untagged unions?" Spoiler: you were on the right track with the reasoning that there would be massive inferred unions all over the place, but there's more to it.

Martinsos · 2022-03-01T13:10:51Z

Thanks for sharing @sodic this is very interesting! Ok great, then let's call them union types!

Interesting regarding Haskell -> I read what you shared and besides certain technical reasons, I understood it mostly comes down to cost / benefit ratio. It would not bring enough to Haskell to be worth the complexity it would introduce. Which makes sense to me! In our case though, I think it could be a good cost / benefit ratio, what do you think?

sodic · 2022-03-01T13:48:40Z

Yes, it definitely makes sense for us. Haskell already has tagged unions, so they wouldn't be getting much out of adding untagged unions (i.e., tagged unions already cover most use cases).

Since Wasp doesn't yet support any kind of unions, our cost/benefit ratio is very favorable.

sodic · 2022-03-10T08:27:50Z

We won't "penetrate" into type constructors (e.g., dictionaries, tuples). For example:

// this is inferred as [{ to: Route } | { to: Page }]
[{ to: Route }, { to: Page}]

It should be assignable to its supertype [{ to: Route | Page }]. We will check this during the weakening phase.
TypeScript does the same thing (playground link):

type T = { to: number | boolean }[]

let x = [{to: 1}, { to: true }]

Martinsos · 2022-03-10T13:30:16Z

In this session we defined new unifyTypes function, and we started implementing makeReducedUnionType function, by using UnorderedSet!

Ones we make that work, next big step will be implementing an algorithm in weaken that will correctly weaken sum types.

Before that we will also probably want to take care of TODOs and write some tests.

There is also Evaluator -> we will need to add support for UnionTypes there at some point, and figure out how are they deduced from AppSpec.

Martinsos · 2022-03-15T16:09:07Z

We finished implementing makeReducedUnionType, we updated unify function to reflect the changes, and we updated places where unify is called.

Next step is updating the weaken function! It will be interesting figuring out what to do with Dict.

sodic · 2022-03-22T11:18:09Z

We started writing tests and should continue from inferExpr - Type checks a list of dictionaries that unify but have different types.

Summary:

We decided it was OK for now to unify lists of dictionaries as union types (as opposed to inferring optionals)
We decided that we should explicitly handle those cases in weaken because of the error messages
We thought about assigning types to expressions in the AST and concluded that we most likely don't want to lose type information by assigning inferred supertypes to child nodes. For example, when unifying a list, we only want to assign the supertype to the list node, we don't want to weaken the types of iits elements.

Martinsos · 2022-03-29T08:56:23Z

We fixed some of the tests!
Next steps:

Write tests for unifyTypes, and simplify tests for unify based on that.
Fix weaken -> also write tests while doing it and get other tests passing (they are now failing because of how weaken incorrectly works.
Take care of more prominent TODOs, first one being not weakening typed expressions while doing unify, and another is figuring out the best way to structure UnionType (and ensure order in it and some other stuff).

sodic · 2022-04-06T09:22:27Z

Today we implemented tests for unifyTypes.

Martinsos · 2022-04-12T08:38:46Z

Next time: continue writing test for weaken, and then let's implement weaken. We also have a lot of TODOs sprinkled around.

Martinsos · 2022-04-26T08:47:12Z

We wrote tests for weaken: some of them testing dealing with dictionaries optional field, some testing super types. We didn't write all of them, but we did define all of them, so next time we should continue from there, filling in the remaining tests!

sodic · 2022-05-03T08:54:41Z

Finished the tests for weaken, haven't started editing weaken yet: c5bcea3
Removed the function unify and changed a portion of inferExprType to prevent it from weakening list element types: 3bc592f

Next step:

Remove the tests for unify
Update the tests for inferExprType to ensure we don't weaken list element expression types
Refactor the weaken function. Make sure to apply the same non-weakening subexpression logic to weaken (if necessary, check line 257)

Martinsos · 2022-05-17T08:33:52Z

We removed unify and its tests.
We updated the tests for inferExprType to ensure we don't weaken list element expression types.
Next step is refactoring the weaken function! And then taking care of all the TODOs.

Martinsos · 2022-05-24T11:53:27Z

We started looking into weaken, to consider if we need it at all.

We traced where it is used, and found that its only place of usage is currently in checkStmt, where it is used on the typed expression of the declaration.

The question was, is this needed? Can we just skip this part and instead of doing the weakening, which updates the types in typed expression tree to match the wanted type, can we just check that typed expression is a sub-type of wanted declaration type and that is it, don't do any weakening? So effectively we would refactor weaken into isSuperType (or isSubType) and that would be it.

But, will that be a problem down the road then? Does Evaluator expect the typed expression in the declaration to match the declaration type, or can it just be its sub type?
We searched for declEvaluate and found that it says (in Wasp.Analyzer.TypeDefinitions.Class.IsDeclType) that it assumes the type of typed expression is indeed the same as declaration type, so that would mean that it can't be a subtype, instead it expects it to be exactly the same type.
That said, we weren't sure if that assumption is actually important in the Evaluator -> if nobody uses that assumption, we could try relaxing it to assuming it is a sub type, not the exactly same type.

Therefore we next looked into the Evaluator, specifically how evaluation is implemented -> does it use that assumption or not. One interesting file is Wasp.Analyzer.Evaluator.Evaluation.TypedExpr.Combinators -> the most basic evaluation logic is defined there. None of these seemed to care about the type vs subtype difference.

Then next we looked into Wasp.Analyzer.TypeDefinitions.TH.Decl, to see how are these combinators further composed. Interesting pieces of code there were data WaspKind, genWaspTypeAndEvaluationForHaskellType, genDictWaspTypeAndEvaluationForRecord, ... .
Here, it seemed as if type vs subtype thing might be an issue, but we didn't look deep enough into it to figure out how to go about it yet.
We should continue with his direction, figure out if we can modify evaluators so that they don't require that assumption of exact type match but instead be ok with subtypes, and if we can, then do so and drop weaken. If not, we just make sure that weaken works as it should. But we should be able to do so, I think.

What we did conclude though is that it might be a good idea to take a step back and start from the AppSpec side, implementing Union type there, via Either or via our Union or maybe even via any custom data that is sum type in Haskell. We are not yet sure though what is better, to use something like Either/Union, or to use custom data (which in that case will need to have data constructors that have a single type each and those types must all be distinct). Oh another tricky thing -> in any case, what if we define union type in AppSpec in such way that one of the types in union is a sub type of the other type in union? That is not a valid union then, right? We should detect that and not accept it.

To conclude, there are main two directions to chase:

Can we get rid of weaken and not weaken any typed expressions? Or at least weaken only the top level -> declarations? We should explore evaluators and try to adjust them to accomplish this.
Introducing "union" to the AppSpec, via Either/Union, and/or via custom data types that have correct shape, while also handling the possibility of "union" not being reduced in its definition in AppSpec which means we probably can't evaluate it.

sodic · 2022-06-07T08:38:38Z

We started working on a new branch (branched out from main): https://github.com/wasp-lang/wasp/tree/filip-martin-weaken. The main idea is to first simplify (or completely get rid of) weaken and its logic. That is what the new branch is for. It also gives us an isolated place to test out whether our simplification plan works with the evaluator (Martin explained most of our concerns in the previous comment).

After we simplify the type inference (by making sure we don't traverse the AST twice and lose type information by weakening subexpressions after unification), we'll create a new branch for adding union types.

Martinsos · 2023-02-14T09:58:49Z

We merged the PR that removes the weakening and instead introduces the explicit notion of subtypes! This will make this PR much simpler.

For this PR we discussed we should continue in the following way:

Check out latest main, where "weakening" PR got merged, and continue from it from scratch, picking up pieces of code from this PR.
Decide how are we describing union type in AppSpec: Either, Union, or custom sum data type. Make sure that it is fully reduced (check it).
The fact that we have isSubtypeOf should help us a lot. unifyTypes t1 t2 should come down to checking if t1 is subtype of t2 and vice versa, and if so, returns the super type of those two, otherwise it does union. This will take care of unifying unions, empty list, dicts, ... -> everything. Reduction of union type should also come down to checking for subtypes (quadratically?).

Martinsos added enhancement New feature or request haskell language labels Dec 4, 2021

Martinsos mentioned this issue Jan 8, 2022

[Analyzer] Enrich type unification errors with info about expected type #421

Open

Martinsos assigned Martinsos and sodic and unassigned Martinsos Feb 22, 2022

sodic linked a pull request Mar 22, 2022 that will close this issue

Filip martin union types #518

Draft

Martinsos changed the title ~~[Analyzer] Add sum types to Wasp~~ [Analyzer] Add sum (union) types to Wasp May 24, 2022

sodic mentioned this issue Feb 9, 2023

Bringing Prisma entities to the frontend #962

Merged

sodic changed the title ~~[Analyzer] Add sum (union) types to Wasp~~ [Analyzer] Add union types to Wasp Feb 14, 2023

Martinsos added status:backlog and removed status:backlog labels Jul 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Analyzer] Add union types to Wasp #381

[Analyzer] Add union types to Wasp #381

Martinsos commented Dec 4, 2021

Martinsos commented Dec 24, 2021

Martinsos commented Feb 22, 2022

Martinsos commented Feb 22, 2022 •

edited

sodic commented Mar 1, 2022 •

edited

sodic commented Mar 1, 2022

Martinsos commented Mar 1, 2022

sodic commented Mar 1, 2022 •

edited

sodic commented Mar 10, 2022 •

edited

Martinsos commented Mar 10, 2022

Martinsos commented Mar 15, 2022

sodic commented Mar 22, 2022 •

edited

Martinsos commented Mar 29, 2022

sodic commented Apr 6, 2022

Martinsos commented Apr 12, 2022

Martinsos commented Apr 26, 2022

sodic commented May 3, 2022 •

edited

Martinsos commented May 17, 2022

Martinsos commented May 24, 2022

sodic commented Jun 7, 2022

Martinsos commented Feb 14, 2023

[Analyzer] Add union types to Wasp #381

[Analyzer] Add union types to Wasp #381

Comments

Martinsos commented Dec 4, 2021

Martinsos commented Dec 24, 2021

Martinsos commented Feb 22, 2022

Martinsos commented Feb 22, 2022 • edited

sodic commented Mar 1, 2022 • edited

sodic commented Mar 1, 2022

Martinsos commented Mar 1, 2022

sodic commented Mar 1, 2022 • edited

sodic commented Mar 10, 2022 • edited

Martinsos commented Mar 10, 2022

Martinsos commented Mar 15, 2022

sodic commented Mar 22, 2022 • edited

Martinsos commented Mar 29, 2022

sodic commented Apr 6, 2022

Martinsos commented Apr 12, 2022

Martinsos commented Apr 26, 2022

sodic commented May 3, 2022 • edited

Martinsos commented May 17, 2022

Martinsos commented May 24, 2022

sodic commented Jun 7, 2022

Martinsos commented Feb 14, 2023

Martinsos commented Feb 22, 2022 •

edited

sodic commented Mar 1, 2022 •

edited

sodic commented Mar 1, 2022 •

edited

sodic commented Mar 10, 2022 •

edited

sodic commented Mar 22, 2022 •

edited

sodic commented May 3, 2022 •

edited