Backport adraffy/ens-normalize@1.10.1 to ethers@5 #4577

adraffy · 2024-02-04T01:29:53Z

Replaced ens-normalize/ directory with almost 1:1 port w/types
- replaced Set with {[x]: x} and membership with set[x] or x in set
- replaced Map with {}
- replaced class Emoji with extra wrapper
validate.ts runs ens_normalize() through tests.json → no errors

Simplified ens-normalize()-related calls in namehash.ts, functionality unchanged
Added max argument to dnsEncode()

ricmoo · 2024-03-19T21:46:30Z

packages/hash/src.ts/ens-normalize/lib.ts

+
+// given a list of codepoints
+// returns a list of lists, where emoji are a fully-qualified (as Array subclass)
+// eg. explode_cp("abc💩d") => [[61, 62, 63], Emoji[1F4A9, FE0F], [64]]


No non-ASCII7 characters are allowed in the source code (applies to v6 as well as v5) as they can cause issues in some editors, over various transports, etc. It also helps prevent "trojan" code.

ricmoo · 2024-03-19T22:04:09Z

A few quick notes I've noticed in a few places:

all imports that reference local files must include the extension, otherwise lots of things break; e.g. "import foo from "./foo.js" instead of "import foo from "./foo".
I avoid non-ASCII7 characters in source as this eliminates issues in many console-based editors and terminal emulators, as well as prevents any concerns regarding trojan code exploits, including in comments and jsdocs

There are a lot of changes that spook me every time I read over this PR. Is it possible to not include the nf component for this v5 change? Is there a way to mostly just update the base64-encoded data and add an extra function or so where necessary? Just to minimize the changes?

I also need to analyze the size with this change a bit more... Do you have any info on the size difference in the /packages/ethers/dist/ files?

adraffy · 2024-03-24T22:18:34Z

all imports that reference local files must include the extension, otherwise lots of things break; e.g. "import foo from "./foo.js" instead of "import foo from "./foo".

I can make this change and remove the emoji from the comments, I was just trying to satisfy whatever TSC was saying.

I 100% agree that the extension-less includes are insane. I have a hard time reasoning about TSC builds as everyone does something different.

There were significant changes from your early port of normalize to the final, eg. script (group) and confusable logic.

In general, I can't do much about the file size, as it's the only the solution to:
"you have a JS engine with unknown versions of Unicode, what do?"

With async import, which I assume is not allowed in v5, I could dynamically import significantly smaller norm libraries using runtime capability checks, but I'm unsure how to package code like that, when this is just a small library in a framework in someone else's project.

You don't need the NF part but considering this is a legacy build that's going to deployed on aging platforms, it's literally the use-case for a system-independent version of Unicode NF. The issue is that the % of names impacted by NF divergences is small. My goal was to make it so ens_normalize() works without any outside feature assumptions.

I can replace this with the system shim: https://github.com/adraffy/ens-normalize.js/blob/main/src/nf-native.js

The other changes just involve removing duplicated logic.

Names that are normalized can be split on "."
Names that are normalized round-trip picky UTF-8 coders (eg. Text{En|De}coder)
dnsEncode() should use 255 char limit as discussed previously

The rest of the code is exactly the same as my currently library except the arguments are typed to satisfy TSC.

…ecode.ts file, replace nf with system nf

adraffy · 2024-03-24T22:44:35Z

files should be ASCII7
fixed mangled decoder.ts --- I'm still unclear how my PR got screwed up
changed extensions to explicit
removed validate/tests.json
changed nf to system

just code

af25ca5

ricmoo added enhancement New feature or improvement. on-deck This Enhancement or Bug is currently being worked on. minor-bump Planned for the next minor version bump. v5 Issues regarding legacy-v5 next-patch Issues scheduled for the next arch release. labels Feb 28, 2024

adraffy added 3 commits March 7, 2024 23:29

missing namehash file

6f6f867

lets try that again

72c6f3b

add other missing files

502c3a0

ricmoo reviewed Mar 19, 2024

View reviewed changes

adraffy added 2 commits March 24, 2024 15:39

remove tests and validate, change extension to .js, replace mangled d…

71ef51b

…ecode.ts file, replace nf with system nf

remove unicode from comments and literals

d19f4ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport adraffy/ens-normalize@1.10.1 to ethers@5 #4577

Backport adraffy/ens-normalize@1.10.1 to ethers@5 #4577

adraffy commented Feb 4, 2024 •

edited

ricmoo Mar 19, 2024

ricmoo commented Mar 19, 2024

adraffy commented Mar 24, 2024

adraffy commented Mar 24, 2024

Backport adraffy/ens-normalize@1.10.1 to ethers@5 #4577

Are you sure you want to change the base?

Backport adraffy/ens-normalize@1.10.1 to ethers@5 #4577

Conversation

adraffy commented Feb 4, 2024 • edited

ricmoo Mar 19, 2024

Choose a reason for hiding this comment

ricmoo commented Mar 19, 2024

adraffy commented Mar 24, 2024

adraffy commented Mar 24, 2024

adraffy commented Feb 4, 2024 •

edited