In-place minification #638

StoneCypher · 2020-02-02T22:20:57Z

Issue type

Feature Request:
- flag "dense output" that turns off the readable whitespace

Prerequisites

Can you reproduce the issue?:
- yes
Did you search the repository issues?:
Did you check the forums?:
Did you perform a web search (google, yahoo, etc)?:
- no need; i know what's wrong

Description

PEG parsers sometimes get really large because they're indented for human readability. I would like to add an option to change that.

My PEG grammar is about 34k. The parser it produces is about 1.7 meg. If you run that through Uglify3, it's about 101k, or a 94.2% reduction.

Thing is, I tried turning off symbol scrambling and relabelling, and it still dropped 93% in size. Almost all of that was just removing spaces

I can't find a minifier that can do this in under 2 minutes. Uglify takes about 2:40 on GH actions. My builds are [node 12,13][win,mac,linux][ff,ie,ch].

That means every time I commit, I spend about 45 minutes (spend because they cost money) on just minification.

I want to contribute a patch that fixes this. I don't know whether to work against 0.10 or 0.11

The preview pane from my editor is pretty amusing, as the code gets so deep (wasted spaces) that it entirely exits the preview panel for several hundred lines

The thing is, if you switch the thing from "for speed" to "for size," it stops wasting all that space, but then it also compiles a pretty slow parser

I'd like the fast parser's structure without the million wasted spaces (it's actually about 1.67 million wasted spaces in my current build)

Steps to Reproduce

Compile a mid-sized grammar, such as this one
Look at the output

Expected behavior:

I expect to be able to turn off the 94% whitespace without having to create a slower parser

StoneCypher · 2020-02-02T22:22:13Z

I have a patch ready for this, but it's buggy.

I went to contribute it, only to find out that the 0.11 I wrote it against is being thrown away, along with all of peg.js, in favor of some other parser written in a different language

Seb35 · 2020-04-01T20:10:56Z

I tested a very simple thing: apply .replace( /^ */gm, "" ) on the generated JS code and then minify it with terser. My PEG.js grammar is 46K, the generated JS code is 1.3M or 452K when removing initial spaces on each line (409K when removing all spaces) and 148K when minified (*); minifying the 1.3M file took about 23.3 seconds (sample = 5 runs from 22.9 s to 24.3 s) and minifying the 452K file took about 24.02 s (5 runs from 23.4 s to 24.9 s) (probably IO or other stuff could explain the slightly varying time).

So you can try such a simple instruction on your grammar, but I’m afraid that removing spaces will not change significantly the minification time. Depending on your goals, you can almost instantly divide by 3 the size, simply by removing initial spaces. Else given your figure of 2 minutes for uglify for a grammar of size similar to mine, terser seems to do better, you should try on your grammar.

(*) I remarked a few unused rules are completely dropped after minification, that’s fine :)

StoneCypher · 2020-04-12T04:18:46Z

the problem is that'll interfere with things that actually have multiple spaces in their rules

my grammar is 34k and produces a 3.9 meg parser 🤣

This was referenced Feb 3, 2020

Use a better abstraction for code generation #448

Open

Add support for incremental parsers #507

Open

Reduce the size of the parser StoneCypher/fsl#394

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In-place minification #638

In-place minification #638

StoneCypher commented Feb 2, 2020 •

edited

StoneCypher commented Feb 2, 2020

Seb35 commented Apr 1, 2020 •

edited

StoneCypher commented Apr 12, 2020

In-place minification #638

In-place minification #638

Comments

StoneCypher commented Feb 2, 2020 • edited

Issue type

Prerequisites

Description

Steps to Reproduce

StoneCypher commented Feb 2, 2020

Seb35 commented Apr 1, 2020 • edited

StoneCypher commented Apr 12, 2020

StoneCypher commented Feb 2, 2020 •

edited

Seb35 commented Apr 1, 2020 •

edited