Validated newtypes #1454

denisrosca · 2024-03-19T12:11:53Z

Add an option to render constrained newtypes over Smithy primitives as "validated" newtypes.

Closes #966

PR Checklist (not all items are relevant to all PRs)

Added unit-tests (for runtime code)
Added bootstrapped code + smoke tests (when the rendering logic is modified)
Added build-plugins integration tests (when reflection loading is required at codegen-time)
Added alloy compliance tests (when simpleRestJson protocol behaviour is expanded/updated)
Updated dynamic module to match generated-code behaviour
Added documentation
Updated changelog

Baccata · 2024-03-19T13:16:40Z

Hey @denisrosca , thanks for looking into this.

However : validated newtypes should not be rendered by default, for the simple reason that it'd break a lot of user code and that's not really acceptable now that smithy4s has received a reasonable amount of adoption. So it should be an opt-in, controlled by means of smithy metadata (with a key prefixed withsmithy4s)

denisrosca · 2024-03-19T13:28:21Z

Hey @denisrosca , thanks for looking into this.

However : validated newtypes should not be rendered by default, for the simple reason that it'd break a lot of user code and that's not really acceptable now that smithy4s has received a reasonable amount of adoption. So it should be an opt-in, controlled by means of smithy metadata (with a key prefixed withsmithy4s)

Working on that 👍
Just wanted to create the PR as early as possible to get some 👀 on the newtype encoding.

denisrosca · 2024-03-25T17:26:14Z

@Baccata @kubukoz
I've hit a bit of a problem after adding the metadata conditional.

The main issue is that now the same smithy source file can be rendered in different ways depending if the metadata flag is set to true or false.

Consider this smithy definition:

namespace newtypes.validated

use smithy4s.meta#unwrap
use alloy#simpleRestJson

@length(min: 1, max: 10)
string ValidatedCity

@length(min: 1, max: 10)
string ValidatedName

@unwrap
@length(min: 1, max: 10)
string ValidatedCountry

structure Person {
  @httpLabel
  @required
  name: ValidatedName

  @httpQuery("town")
  town: ValidatedCity

  @httpQuery("country")
  country: ValidatedCountry
}

which gets rendered to:

final case class Person(name: ValidatedName, town: Option[ValidatedCity] = None, country: Option[String] = None)

object Person extends ShapeTag.Companion[Person] {
  val id: ShapeId = ShapeId("newtypes.validated", "Person")

  val hints: Hints = Hints.empty

  implicit val schema: Schema[Person] = struct(
    ValidatedName.schema.required[Person]("name", _.name).addHints(smithy.api.HttpLabel()),
    ValidatedCity.schema.optional[Person]("town", _.town).addHints(smithy.api.HttpQuery.unsafeApply("town")),
    ValidatedCountry.underlyingSchema.optional[Person]("country", _.country).addHints(smithy.api.HttpQuery.unsafeApply("country")),
  ){
    Person.apply
  }.withId(id).addHints(hints)
}

This fails to compile because smithy.api#httpQuery is defined like this:

@length(min: 1)
string httpQuery

which when converted from smithy to IR is interpreted as a validated newtype. As a result the hint in Person.schema is rendered via a call to smithy.api.HttpQuery.unsafeApply which leads to a compilation error:

[info] [error] /private/var/folders/_z/c52c1q4n0fj6617nkht2pgcr0000gn/T/sbt_59b16f00/target/scala-2.13/src_managed/main/scala/newtypes/validated/Person.scala:18:89: value unsafeApply is not a member of object smithy.api.HttpQuery
[info] [error]     ValidatedCity.schema.optional[Person]("town", _.town).addHints(smithy.api.HttpQuery.unsafeApply("town")),

As a workaround I'm excluding the smithy.api namespace from the validated newtypes feature, but that doesn't seem like a bullet proof solution.

kubukoz · 2024-03-25T21:25:06Z

I see. This is probably not specific to smithy.api, but a symptom of a more general problem: we should make sure that we refer to newtypes in the same way they were generated, even across codegen runs (e.g. smithy4s runs its own, then you have a library on top, then you have the user code).

If the default is "false" (unvalidated newtypes), then you can only go from false (nothing) to true (by setting the metadata key). Perhaps keeping the "previous value" in smithy4sGenerated metadata would be the way?

Shapes from a known previously-generated namespace would be referred to in the style they were generated, and the shapes in the currently-being-generated compilation unit would use the metadata key.

Regardless of whether we do it this way or another, I think smithy.api could get the validated treatment from day 1.

kubukoz · 2024-03-25T21:27:39Z

modules/core/src-3/NewtypeValidated.scala

+
+  def apply(a: A): Either[String, Type]
+
+  def unsafeApply(a: A): Type = a


question: (for discussion): should we make this throw in case of invalid values? We could also have both variants available, although the "just wrap, not questions asked" one should be so obscure that nobody will accidentally use it.

Baccata · 2024-03-26T14:07:23Z

If anything, this is yet another argument for rendering dynamic bindings instead of static ones. As much as I hate to say it, the arguments in favour of dynamic bindings are starting to largely outweigh the ones in favour of static binding.

Regardless of whether we do it this way or another, I think smithy.api could get the validated treatment from day 1.

I disagree (somewhat heavily). I appreciate that validated newtypes may be a desirable feature for users, but it's also not something I want to deal when writing unit tests in smithy4s that involve hints, and arguably the maintainers of smithy4s are the ones who are most expose to the shapes coming from the "smithy.api" namespace ... unless you could macro-ify the validation when dealing with literals.

Putting aside the "should we just give up and render dynamic bindings", what we could do is create a meta-trait that would trigger the render of newtypes as validated. The metadata flag would indicate that a model preprocessor should be run to add the meta trait to all relevant shapes. The meta-trait would be stored in the resource of published artifacts, alongside the generated code, and could be inspected by downstream code-gen runs (unlike smithy4s-prefixed metadata, which get eluded).

kubukoz · 2024-03-26T16:05:04Z

it's also not something I want to deal when writing unit tests in smithy4s that involve hints

We discussed this briefly with Denis, I suggested to try and see how much stuff breaks: if it's more than a handful of locations let's keep them as-is, but if it's just a few I'd say unsafeApply is perfectly fine. Let's be pragmatic ;)

The metadata flag would indicate that a model preprocessor should be run to add the meta trait to all relevant shapes

hmm, so that way users could still apply the meta-trait on a case-by-case basis even if metadata is false / undefined. That'd be cool, I think.

The meta-trait would be stored in the resource of published artifacts, alongside the generated code, and could be inspected by downstream code-gen runs (unlike smithy4s-prefixed metadata, which get eluded).

Not sure what you mean here - I was thinking of using the smithy4sGenerated metadata, which does get inspected by downstream codegen runs.

Baccata · 2024-03-29T11:03:22Z

Not sure what you mean here - I was thinking of using the smithy4sGenerated metadata, which does get inspected by downstream codegen runs.

Well, thing is in order to use the generated metadata, you'd basically have to store a Map[ShapeId, Something], which is somewhat redundant where you could have a @something trait instead, which you could inspect from the shapes itself without having to thread the map where it needs to be inspected. It's arguably more idiomatic to smithy to use traits to store metadata related to specific shapes.

but if it's just a few I'd say unsafeApply is perfectly fine

To be transparent, my concern also has to do with the following aspects :

binary compatibility. If you want smithy.api and alloy shapes to be validated newtypes, it'd unavoidably means moving this PR to 0.19, as it'll be breaking bincompat of generated traits.
convenience of using hints in annotations. This concern is tied to my prototype https://github.com/Baccata/smithy4s-deriving. The inconvenience of validated types in hints would then be felt by end users.

kubukoz · 2024-03-29T11:58:43Z

moving this PR to 0.19

It's already on 0.19 :)

Baccata · 2024-03-29T12:10:55Z

My bad, haha

…wtypes

denisrosca · 2024-04-09T13:32:07Z

Putting aside the "should we just give up and render dynamic bindings", what we could do is create a meta-trait that would trigger the render of newtypes as validated. The metadata flag would indicate that a model preprocessor should be run to add the meta trait to all relevant shapes. The meta-trait would be stored in the resource of published artifacts, alongside the generated code, and could be inspected by downstream code-gen runs (unlike smithy4s-prefixed metadata, which get eluded).

I'm currently exploring this option and have a question about how the metadata flag would be passed to the generator.

I have a ValidatedNewtypesTransformer model transformer that, when enabled, adds a new @validateNewtypeConstraints (name is still up for debate) trait to the relevant shapes.

My initial attempt was to basically do the same thing we do for smithy4sRenderOptics (i.e. have a setting in the build plugin that gets written to generated-metadata.smithy), but that presents a bit of a problem.
The issue is that the decision if ValidatedNewtypesTransformer should be executed needs to happen before loading the smithy model, but that information is in generated-metadata.smithy which is read by the model loader.

What do you think about making this setting a full-blown generator argument instead of passing it via the generated metadata file?

Baccata · 2024-04-09T15:00:13Z

What do you think about making this setting a full-blown generator argument instead of passing it via the generated metadata file?

You make a good point, it'd be silly to load the model twice. However, I'd like to avoid multiplying codegen arguments which need to be forwarded to 3 different "front-ends" (SBT/Mill/cli), in favour of allowing some level of configuration via the reading of a smithy-build.json file. Interestingly, there was a discussion this week about this.

kubukoz · 2024-04-09T20:07:10Z

@Baccata I had a brief discussion with Denis, we're thinking to "just":

make the transformer run unconditionally (like e.g. AwsConstraintsRemover), but
make it check the metadata key before applying any changes

That should resolve the double-loading problem, and the configuration continues to live in the Smithy files. Did we get this right?

Baccata · 2024-04-09T20:34:17Z

That's sensible, actually !

Baccata · 2024-04-11T08:47:17Z

modules/core/src-2/NewtypeValidated.scala

+  protected val validators: List[A => Either[String, A]]
+
+  protected def validateInternal[C](c: C)(value: A)(implicit
+      ev: RefinementProvider.Simple[C, A]
+  ): Either[String, A] =
+    ev.make(c).apply(value)


I'm not liking this. For starters, the validateInternal re-instantiates the refinement on a per-element basis. Also, the fact that a validation may be composed of several validators is an implementation detail that should not appear in the interface of NewTypeValidated

You could create a smithy4s.Validator interface that Refinement would extend, and enrich it with a composition method (andThen).

The renderer would be responsible for rendering something that could look like

val validator : Validator[A] = Validator.of[A].validating(C1).and(C2).and(C3).build() def apply(a: A) : Either[String, Type] = { validator(a) }

This is done now, could you please take another look?
Let me know if matches what you had in mind.

sampleSpecs/validated-newtype.smithy

kubukoz · 2024-04-19T14:36:33Z

modules/core/src/smithy4s/Validator.scala

+package smithy4s
+
+trait Validator[A, B] {
+  def validate(value: A): Either[String, B]


question: given that a validator can potentially include multiple refinements, should we support EitherNel / Either[::[String] (due to the lack of a better NEL in the stdlib) from day 1, with error accumulation? A fail-fast alternative can also be given (with the current shape), for convenience.

I thought about this, but decided against it initially due to a lack of a NonEmptyList. Forcing the caller to deal with a result of Left(Nil) felt kinda awkward, even though it would technically never happen.

modules/core/src/smithy4s/Validator.scala

kubukoz · 2024-04-19T14:40:35Z

modules/core/src/smithy4s/Validator.scala

issue: commenting on a random spot, but we need a CHANGELOG entry. For Dynamic, I don't think we need to do anything but correct me if I'm wrong please @Baccata

Added an entry to the CHANGELOG.

modules/codegen/src/smithy4s/codegen/internals/CodegenImpl.scala

kubukoz · 2024-04-19T14:44:54Z

modules/codegen-plugin/src/smithy4s/codegen/Smithy4sCodegenPlugin.scala

+            case (
+                  changed,
+                  (
+                    wildcardArg,
+                    shouldGenerateOptics,
+                    shouldRenderValidatedNewtypes
+                  )
+                ) =>
+              val lastOutput = Tracked.lastOutput[Boolean, Seq[File]](
+                cacheFactory.make("smithy4sGeneratedSmithyFilesOutput")
+              ) { case (changed, prevResult) =>
+                if (changed || prevResult.isEmpty) {
+                  val file =
+                    (config / smithy4sGeneratedSmithyMetadataFile).value
+                  IO.write(
+                    file,
+                    s"""$$version: "2"
+                       |metadata smithy4sWildcardArgument = "$wildcardArg"
+                       |metadata smithy4sRenderOptics = $shouldGenerateOptics
+                       |metadata smithy4sRenderValidatedNewtypes = $shouldRenderValidatedNewtypes
+                       |""".stripMargin
+                  )
+                  Seq(file)
+                } else {
+                  prevResult.get
+                }
+              }
+              lastOutput(changed)


suggestion: I'd extract some of this to avoid the current nesting level, which is becoming a bit unwieldy

Co-authored-by: Jakub Kozłowski <kubukoz@gmail.com>

denisrosca added 3 commits March 19, 2024 13:43

Add NewtypeValidated

dffd10d

Fix formatting

c477541

Add simple renderer tests

468d54c

denisrosca added 4 commits March 19, 2024 15:37

Fix validated newtype usage

c600b1f

Render validated newtypes only if smithy4sRenderValidatedNewtypes=true

ba9b81b

Fix formatting again

301ed39

Don't validate types from smithy.api namespace

cf9ec8e

Minor fixup of sbt plugin test

de82030

kubukoz reviewed Mar 25, 2024

View reviewed changes

denisrosca added 4 commits April 8, 2024 16:14

Add validated newtypes usage to generated metadata

cd48f63

Merge remote-tracking branch 'upstream/series/0.19' into validated-ne…

89f3911

…wtypes

Fix newline

043c859

Fix missing license header

be7b3e1

denisrosca added 4 commits April 10, 2024 16:30

Add ValidatedNewtypesTransformer and switch to meta trait

7ec86d6

Add default value to bootstrapped sample

e6db5e1

Fix renderer spec

4f7b313

Add transformer spec

54762fe

Fix formatting

c00ed39

Baccata reviewed Apr 11, 2024

View reviewed changes

sampleSpecs/validated-newtype.smithy Outdated Show resolved Hide resolved

denisrosca added 6 commits April 18, 2024 13:10

Add Validator interface and refactor constraint validation

5ff681c

Rename trait

5f27413

Fix scala3 encoding

382b96a

Fix formatting, again

a09a568

Add doc entry

9fe555a

Make mdoc compile only

e378c0a

denisrosca marked this pull request as ready for review April 18, 2024 14:03

kubukoz reviewed Apr 19, 2024

View reviewed changes

kubukoz mentioned this pull request Apr 22, 2024

Bincompat-friendly codegen #1500

Open

denisrosca and others added 4 commits April 24, 2024 23:22

Format mandatory transformers

1e2b2ea

Co-authored-by: Jakub Kozłowski <kubukoz@gmail.com>

Make Validator trait sealed

c98a563

Add changelog entry

e7ca985

Change unsafeApply to throw exceptions

d38247d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validated newtypes #1454

Validated newtypes #1454

denisrosca commented Mar 19, 2024 •

edited

Baccata commented Mar 19, 2024

denisrosca commented Mar 19, 2024

denisrosca commented Mar 25, 2024 •

edited

kubukoz commented Mar 25, 2024 •

edited

kubukoz Mar 25, 2024

Baccata commented Mar 26, 2024

kubukoz commented Mar 26, 2024

Baccata commented Mar 29, 2024

kubukoz commented Mar 29, 2024

Baccata commented Mar 29, 2024

denisrosca commented Apr 9, 2024 •

edited

Baccata commented Apr 9, 2024 •

edited

kubukoz commented Apr 9, 2024

Baccata commented Apr 9, 2024

Baccata Apr 11, 2024

denisrosca Apr 18, 2024 •

edited

kubukoz Apr 19, 2024

denisrosca Apr 24, 2024

kubukoz Apr 19, 2024

denisrosca Apr 26, 2024

kubukoz Apr 19, 2024


		def apply(a: A): Either[String, Type]

		def unsafeApply(a: A): Type = a

Validated newtypes #1454

Are you sure you want to change the base?

Validated newtypes #1454

Conversation

denisrosca commented Mar 19, 2024 • edited

PR Checklist (not all items are relevant to all PRs)

Baccata commented Mar 19, 2024

denisrosca commented Mar 19, 2024

denisrosca commented Mar 25, 2024 • edited

kubukoz commented Mar 25, 2024 • edited

kubukoz Mar 25, 2024

Choose a reason for hiding this comment

Baccata commented Mar 26, 2024

kubukoz commented Mar 26, 2024

Baccata commented Mar 29, 2024

kubukoz commented Mar 29, 2024

Baccata commented Mar 29, 2024

denisrosca commented Apr 9, 2024 • edited

Baccata commented Apr 9, 2024 • edited

kubukoz commented Apr 9, 2024

Baccata commented Apr 9, 2024

Baccata Apr 11, 2024

Choose a reason for hiding this comment

denisrosca Apr 18, 2024 • edited

Choose a reason for hiding this comment

kubukoz Apr 19, 2024

Choose a reason for hiding this comment

denisrosca Apr 24, 2024

Choose a reason for hiding this comment

kubukoz Apr 19, 2024

Choose a reason for hiding this comment

denisrosca Apr 26, 2024

Choose a reason for hiding this comment

kubukoz Apr 19, 2024

Choose a reason for hiding this comment

denisrosca commented Mar 19, 2024 •

edited

denisrosca commented Mar 25, 2024 •

edited

kubukoz commented Mar 25, 2024 •

edited

denisrosca commented Apr 9, 2024 •

edited

Baccata commented Apr 9, 2024 •

edited

denisrosca Apr 18, 2024 •

edited