Forwarding email communication #1390

dcodeIO · 2020-04-16T17:25:16Z

I've received two mails about protobuf.js today, which is quite an uncommon thing to happen nowadays. Thought I forward these here so everyone, especially the maintainers who joined, is informed. Here is a summary:

One party is depending on feat: parsed options #1256 to land, and is asking if they can help to get it in.
Another party has made changes to code generation, in that it splits emitted files into one per service / message / enum to reduce overhead on the frontend. They are asking if there is interest in this, if they open sourced it?

Interestingly, I also had one email about bytebuffer.js:

Is there interest in adding aliases like getInt for readInt to make it work more like Java's ByteBuffer class?

The text was updated successfully, but these errors were encountered:

dcodeIO · 2020-04-16T17:43:18Z

On a more general note, a good path forward appears to be to merge #1356 soonish, and continue from there by applying the new release processes, versioning etc. that have been set up?

alexander-fenster · 2020-04-17T10:51:18Z

@dcodeIO Looking at #1356.

alexander-fenster · 2020-04-17T11:00:55Z

@dcodeIO #1356 is good to go! Then we can complete the release automation (Cc: @bcoe) and keep moving.

As for the list of things to work on, #1234 is also high in my personal priority list (since it will make it possible to use CLI tools in hermetic environments like bazel, and we do need it).

Thank you for forwarding the email request, I will do my best here :)

dcodeIO · 2020-04-17T15:29:43Z

Merged that PR meanwhile, thanks for fixing! Shall we make a 6.9.0 release with the new process next?

bcoe · 2020-04-17T16:39:56Z

Shall we make a 6.9.0 release with the new process next?

This sounds great to me, @alexander-fenster let me know if you need any help rolling out today?

bcoe · 2020-04-20T19:46:11Z

@dcodeIO @alexander-fenster we rolled out the first release using the new process this morning 🎉

Shall we follow up with the folks emailing you?

bcoe · 2020-04-20T19:47:57Z

catching up, sounds like we'll also need to work on reviewing and merging. #1256? perhaps we can let our new release bake for a bit, and then prioritize landing that fix?

cherryland · 2020-04-21T00:04:01Z

Another party has made changes to code generation, in that it splits emitted files into one per service / message / enum to reduce overhead on the frontend. They are asking if there is interest in this, if they open sourced it?

No. Strictly speaking, protobuf.js is a generator, and not a compiler.

Dead code elimination is a classical task for a compiler. And JavaScript is no exception to this rule. Closure Compiler e.g. parses JavaScript, analyzes it, removes dead code and rewrites and minimizes what's left. This is common best practice.

ok

alexander-fenster · 2020-04-21T08:31:55Z

@cherryland If I'm reading the original request correctly, it's just about splitting the code, not to do any optimization or dead code removal. If we run pbjs today, we get a huge .js file, and I can imagine why it could make sense to split it by services, messages, or anything else.

(as an example, this generated js file from one of our libraries weighs 1.1MB, and it's not the biggest one I saw)

cherryland · 2020-04-21T11:43:56Z

@alexander-fenster You are right brother, on JavaScript Island 🏝️ the traditional approach to reduce file size is mere code splitting and nothing else. This was acceptable when Netscape was a thing and no other tools existed. I understand that.

The output from protobuf.js is meant to be consumed as a library. A developer makes the necessary references to the output from protobuf.js and defines the chunks (or code splits) at a higher level. A JavaScript compiler goes ahead and strips the unused parts based upon the dependency graph of the chunk. And as a developer, you always have the option to build a second proto file.

taylorcode · 2020-04-22T23:14:21Z

@cherryland hi, my team at Dropbox added the no-bundle feature, and maintains a fork of protobufjs (https://github.com/dropbox/protobuf.js) for this and other reasons we have PRs open for.

I'm not sure if what @dcodeIO mentioned in bullet 2 is this feature or something else (sounds similar but might not actually be the same), but if it is I think you may have a misunderstanding. The no-bundle flag isn't about eliminating dead code, it's about eliminating duplicate code.

pbjs generates a single bundle of JS for all of the transitive .protos. Therefore if you run pbjs on proto A and then proto B, and both depend on C, then you'll end up with two copies of C's generated JS code -- a.js will contain it, as well as b.js. This is a problem if you need to load proto A and B on the same page for two reasons:

code bloat - you're loading C's code twice. If you need to load many protos and they share dependencies, you'll end up with an explosion of duplicate code.
orphaned object references - because of the way protobuf creates the roots data structure at runtime, every time C is loaded into memory it replaces the previous instance of C. Therefore you can potentially end up with references to different instances of C. Practically I'm not sure what issues this could cause but best to be avoided.

To work around this you could use pbjs to generate one bundle of JS bundle per page, but this isn't ideal for two reasons:

http caching - if two pages both need C's code, then ideally you have one copy of it that can be shared.
on-demand loading - if you only have one bundle then you can't load parts of it as needed.

You could also generate one bundle for all pages, but this does not scale.

The fact that pbjs bundles at all is a little weird. The protoc tool doesn't do this for any language, including JS. And even for web browsers this isn't ideal for the reasons stated. If bundling is needed, asset bundlers (e.g. rollup / webpack) should be used for this.

Example

my/protos/c.proto

package my_protos_c;

message C {
   string some_field = 1;
}

my/protos/a.proto

package my_protos_a;
import "my/protos/c.proto";

message A {
  my_proto_c.C nested_msg_c = 1;
}

my/protos/b.proto

package my_protos_b;
import "my/protos/c.proto";

message B {
   my_proto_c.C nested_msg_c = 1;
}

Strategy 1: run pbjs on each proto
pbjs --target static-module --out a.js a.proto
pbjs --target static-module --out b.js b.proto

import {my_protos_a} from 'my/protos/a';
import {my_protos_b} from 'my/protos/b';

page loads C's code twice

Strategy 2: generate one js bundle per page with pbjs
pbjs --target static-module --out page_1.js a.proto
pbjs --target static-module --out page_2.js b.proto

page 1

import {my_protos_a} from 'my/protos/page_1';

page 2

import {my_protos_b} from 'my/protos/page_2';

page 1 loads different copy of C than page 2

Strategy 3: generate one .js file for every .proto

pbjs --target static-module --path out/ --no-bundle a.proto
pbjs --target static-module --path out/ --no-bundle b.proto
pbjs --target static-module --path out/ --no-bundle c.proto

import {my_protos_a} from 'my/protos/a';
import {my_protos_b} from 'my/protos/b';

only 1 copy of C is loaded

page 1

import {my_protos_a} from 'my/protos/page_1';

page 2

import {my_protos_b} from 'my/protos/page_2';

page 1 loads same copy of C as page 2

joshvarcheesy · 2020-12-21T22:45:12Z

@dcodeIO Very large interest in the second bullet point. I just started to look into how to do this for my company.

taylorcode mentioned this issue Jul 28, 2020

Add cli option to disable bundling #1461

Closed

taylorcode mentioned this issue Jul 16, 2021

Add "bundle" cli option to make js bundling optional #1634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Forwarding email communication #1390

Forwarding email communication #1390

dcodeIO commented Apr 16, 2020

dcodeIO commented Apr 16, 2020

alexander-fenster commented Apr 17, 2020

alexander-fenster commented Apr 17, 2020

dcodeIO commented Apr 17, 2020

bcoe commented Apr 17, 2020

bcoe commented Apr 20, 2020

bcoe commented Apr 20, 2020

cherryland commented Apr 21, 2020

alexander-fenster commented Apr 21, 2020

cherryland commented Apr 21, 2020

taylorcode commented Apr 22, 2020 •

edited

joshvarcheesy commented Dec 21, 2020

Forwarding email communication #1390

Forwarding email communication #1390

Comments

dcodeIO commented Apr 16, 2020

dcodeIO commented Apr 16, 2020

alexander-fenster commented Apr 17, 2020

alexander-fenster commented Apr 17, 2020

dcodeIO commented Apr 17, 2020

bcoe commented Apr 17, 2020

bcoe commented Apr 20, 2020

bcoe commented Apr 20, 2020

cherryland commented Apr 21, 2020

alexander-fenster commented Apr 21, 2020

cherryland commented Apr 21, 2020

taylorcode commented Apr 22, 2020 • edited

joshvarcheesy commented Dec 21, 2020

taylorcode commented Apr 22, 2020 •

edited