Evaluation cache #1226

mmatera · 2021-04-04T21:40:43Z

Here is a proposal for implementing a cache expression. This should reduce the evaluation time for expressions that are evaluated many times. The idea is to keep a dictionary inside the Evaluation object, whose keys are the hashes of the expressions. The dict stores as values both the input as the output expression. Also, the implementation handles the caching of subexpressions.

rocky · 2021-04-04T21:55:10Z

@mmatera:

There is something I broke in the scanner which I think will cause all tests to fail with something like:

self = <test.test_parser.test_parser.GeneralTests testMethod=testFunction>

    def testFunction(self):
        self.check("x &", Node("Function", Symbol("x")))
>       self.check("x \\[Function] y", "Function[x, y]")

test/test_parser/test_parser.py:240: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
test/test_parser/test_parser.py:29: in check
    expr1 = self.parse(expr1)
test/test_parser/test_parser.py:25: in parse
    return self.parser.parse(SingleLineFeeder(s))
mathics/core/parser/parser.py:66: in parse
    return self.parse_e()
mathics/core/parser/parser.py:103: in parse_e
    result.append(self.parse_exp(0))
mathics/core/parser/parser.py:133: in parse_exp
    child = self.parse_exp(q + 1)
mathics/core/parser/parser.py:112: in parse_exp
    result = self.parse_p()
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <mathics.core.parser.parser.Parser object at 0x12e9fe710>

    def parse_p(self):
        token = self.next_noend()
        tag = token.tag
        method = getattr(self, "p_" + tag, None)
        if method is not None:
            return method(token)
        elif tag in prefix_ops:
            self.consume()
            q = prefix_ops[tag]
            child = self.parse_exp(q)
            return Node(tag, child)
        else:
            self.tokeniser.sntx_message(token.pos)
>           raise InvalidSyntaxError()
E           mathics_scanner.errors.InvalidSyntaxError

mathics/core/parser/parser.py:156: InvalidSyntaxError
=========================== short test summary info ============================
FAILED test/test_parser/test_parser.py::GeneralTests::testFunction - mathics_...
============= 1 failed, 253 passed, 8 xfailed in 115.78s (0:01:55) =============

I am looking at this now. Some short-term solutions are to ignore this test here. I suspect also in CI we could go back to an older version of the scanner but this is tricky in that there was a bug in adding \[IndentingNewline] and handling that properly in mathicsscript.

rocky · 2021-04-04T22:06:36Z

@mmatera:

There is something I broke in the scanner which I think will cause all tests to fail with something like:

self = <test.test_parser.test_parser.GeneralTests testMethod=testFunction>

    def testFunction(self):
        self.check("x &", Node("Function", Symbol("x")))
>       self.check("x \\[Function] y", "Function[x, y]")

test/test_parser/test_parser.py:240: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
test/test_parser/test_parser.py:29: in check
    expr1 = self.parse(expr1)
test/test_parser/test_parser.py:25: in parse
    return self.parser.parse(SingleLineFeeder(s))
mathics/core/parser/parser.py:66: in parse
    return self.parse_e()
mathics/core/parser/parser.py:103: in parse_e
    result.append(self.parse_exp(0))
mathics/core/parser/parser.py:133: in parse_exp
    child = self.parse_exp(q + 1)
mathics/core/parser/parser.py:112: in parse_exp
    result = self.parse_p()
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <mathics.core.parser.parser.Parser object at 0x12e9fe710>

    def parse_p(self):
        token = self.next_noend()
        tag = token.tag
        method = getattr(self, "p_" + tag, None)
        if method is not None:
            return method(token)
        elif tag in prefix_ops:
            self.consume()
            q = prefix_ops[tag]
            child = self.parse_exp(q)
            return Node(tag, child)
        else:
            self.tokeniser.sntx_message(token.pos)
>           raise InvalidSyntaxError()
E           mathics_scanner.errors.InvalidSyntaxError

mathics/core/parser/parser.py:156: InvalidSyntaxError
=========================== short test summary info ============================
FAILED test/test_parser/test_parser.py::GeneralTests::testFunction - mathics_...
============= 1 failed, 253 passed, 8 xfailed in 115.78s (0:01:55) =============

I am looking at this now. Some short-term solutions are to ignore this test here. I suspect also in CI we could go back to an older version of the scanner but this is tricky in that there was a bug in adding \[IndentingNewline] and handling that properly in mathicsscript.

@mmatera For now I have pinned in master Mathics_Scanner at 1.0.0 which should work for now. When that is in you might have to rebase or merge your PR

rocky

Before undertaking such a pervasive change, it would be interesting to get a rough idea of how much improvement we are going to get.

For example a maybe come up with some scenarios based in specific examples and show how much time gain we get using this.

mmatera · 2021-04-05T10:27:16Z

Before undertaking such a pervasive change, it would be interesting to get a rough idea of how much improvement we are going to get.

For example a maybe come up with some scenarios based in specific examples and show how much time gain we get using this.

This is an experiment: I know that WMA implements something similar, and the reason is that this helps to avoid keeping many Expressions identical expression objects, as well as reducing the time used in applying replacing rules over the same expression. Also, this could help to reduce the time doing comparisons over identical objects.

My implementation is incomplete (apart from the obvious thing that is not working) because I didn't think about a mechanism to decide when to remove elements from the cache, so even if this passes the tests, I shouldn't merge this.

rocky · 2021-04-05T10:38:18Z

Before undertaking such a pervasive change, it would be interesting to get a rough idea of how much improvement we are going to get.
For example a maybe come up with some scenarios based in specific examples and show how much time gain we get using this.

This is an experiment: I know that WMA implements something similar, and the reason is that this helps to avoid keeping many Expressions identical expression objects, as well as reducing the time used in applying replacing rules over the same expression. Also, this could help to reduce the time doing comparisons over identical objects.

My implementation is incomplete (apart from the obvious thing that is not working) because I didn't think about a mechanism to decide when to remove elements from the cache, so even if this passes the tests, I shouldn't merge this.

Ok - thanks for the explanations and clarifications.

mmatera · 2021-04-05T23:06:54Z

This is a result of one possible experiment:

In[1]:= Timing[N[Pi,100];]
Out[1]= {0.013185, Null}

In[2]:= Timing[N[Pi,100];]
Out[2]= {0.000101354, Null}

In[3]:= Timing[Table[N[Pi^k,100],{10}];]
Out[3]= {0.0105746, Null}

In[4]:= Timing[Table[N[Pi^k,100],{10}];]
Out[4]= {0.000143036, Null}

rocky · 2021-04-05T23:11:10Z

This is a result of one possible experiment:

In[1]:= Timing[N[Pi,100];]
Out[1]= {0.013185, Null}

In[2]:= Timing[N[Pi,100];]
Out[2]= {0.000101354, Null}

In[3]:= Timing[Table[N[Pi^k,100],{10}];]
Out[3]= {0.0105746, Null}

In[4]:= Timing[Table[N[Pi^k,100],{10}];]
Out[4]= {0.000143036, Null}

Very nice.

The concept is good. We should huddle over the implementation strategies though.

Also, I should mention that next weekend I will start looking at comparisons and try to come up with a slightly different organization of the code, keeping the essential logic inside the comparisons the same.

TiagoCavalcante · 2021-06-29T13:45:41Z

@rocky and @mmatera what still needs to be done here? If it still needs work, I have an interest.

mmatera · 2021-06-29T14:04:00Z

I can try to do the rebase. In any case, this was an experiment.

TiagoCavalcante · 2021-06-29T14:14:39Z

I can try to do the rebase.

Ok.

In any case, this was an experiment.

So a very performant experiment!

rocky · 2021-06-29T14:48:29Z

I'd like to look the whole performance issue in detail sometime. The last time I looked at this, it didn't felt more like a shot in the dark rather than looking at what's slow and attacking that.

mmatera · 2021-06-29T15:44:49Z

I'd like to look the whole performance issue in detail sometime. The last time I looked at this, it didn't felt more like a shot in the dark rather than looking at what's slow and attacking that.

I think that the problem is that there is not a single performance issue, but several ones. This PR tries to implement something that the is in WMA: the expression cache. So, if you want to evaluate many times the same complicate expression, which always produces the same result, the best is to store it and reuse it.
This does not resolve other huge performance issue: the expression matching is quite slow. But OK, let's keep with baby steps...

rocky · 2021-06-29T16:13:36Z

It's not that I don't think an expression cache will help. I in fact do think so. However the implementation that I saw previously might have been a bit lacking. Ideally we'd come up with a bunch of ideas at a high level, kick those around and then drill down from there.

Although many things I do them incrementally, for some things, I don't think they should be done as incremental improvements. Like come up with some kind of cache. See if you can improve it a little, and so on.

Up front, various performance measurements and estimates should be done.

rocky · 2021-06-29T16:23:53Z

mathics/builtin/assignment.py

@@ -64,7 +64,7 @@ def get_symbol_list(list, error_callback):
 class _SetOperator(object):
    def assign_elementary(self, lhs, rhs, evaluation, tags=None, upset=False):
        # TODO: This function should be splitted and simplified
-
+        evaluation.cache_result = False


This line of code appear many many times. What's the rule, and reason behind when to do and when not to?

I wonder if there's a better way.

Again, this is just an experiment to see how we can implement the Expression cache.

evaluation.cache_result = False

means that the result of this expression should not be stored in the cache. For example, control expressions, loops and file handling should not be stored in the cache.

Ok - thanks for the information. I got the gist of that aspect.

Generally one thinks of a cache as just working not something where you have to make 30 hand decisions as to what goes in and what doesn't.

That's what I mean by I wonder if there is a better way. Like using someone else's mechanism like LRUCache. (Hmm, where have I heard this idea before - use someone else's package rather than rolling your own?)

Since this is an experiment let's mark it as a draft for now.

Disclaimer: for now I am rewriting/revising the doc system. Although I would love to ditch that, I also want the docs to get tagged and organized better and using a more standard doc system is too big a lift for me right now.

That's what I mean by I wonder if there is a better way. Like using someone else's mechanism like LRUCache. (Hmm, where have I heard this idea before - use someone else's package rather than rolling your own?)

OK, I totally agree: if there is something already implemented, let's use it. The problem here is that I didn't find something like LRUCache that works with expressions. In any case, right now I do not have much time to work on this. I just rebase this in case we want to come back to it in the future.

Since this is an experiment let's mark it as a draft for now.

Disclaimer: for now I am rewriting/revising the doc system. Although I would love to ditch that, I also want the docs to get tagged and organized better and using a more standard doc system is too big a lift for me right now.

I think this is more urgent: to have a clear code and documentation will help to get more help...

That's what I mean by I wonder if there is a better way. Like using someone else's mechanism like LRUCache. (Hmm, where have I heard this idea before - use someone else's package rather than rolling your own?)

OK, I totally agree: if there is something already implemented, let's use it. The problem here is that I didn't find something like LRUCache that works with expressions. In any case, right now I do not have much time to work on this. I just rebase this in case we want to come back to it in the future.

Since this is an experiment let's mark it as a draft for now.
Disclaimer: for now I am rewriting/revising the doc system. Although I would love to ditch that, I also want the docs to get tagged and organized better and using a more standard doc system is too big a lift for me right now.

I think this is more urgent: to have a clear code and documentation will help to get more help...

I am working on it and I hope to have the code for Django in place this week. Basically I'm continuing what I started before in on the Mathics side where we are grouping builtins - eventually according to how it appears in Mathematica 5 or listed as "Guides" in the online docs. So far it has been pleasing to see.

After the code to see this on Django is done, on the Core side we can move more stuff around and make it more closely conform to the Mathematica organization.

TiagoCavalcante · 2021-06-29T16:40:36Z

@rocky and @mmatera what do you think should be done next for performance? You already mentioned that expression matching is slow, but what can be done to increase its performance?

rocky · 2021-06-29T16:43:28Z

@rocky and @mmatera what do you think should be done next for performance?

Get performance benchmarks and pinpoint exactly where things are slow and figure figure out why. You can use this draft to see where it lies in speeding things up.

As an example, build this with Pyston and run the unit tests. I got a 30% speedup which is what the Pyston folks said to expect.
Try running it on the doctests. Is the speedup the same? Try using this draft to see what the speedup is.

What is the speedup using PyPy 3.7?

But please, if you can't do this on your own, leave it. There are many many many other things you can do totally on your own.

rocky reviewed Apr 5, 2021

View reviewed changes

mmatera added enhancement experimental needs work performance labels Apr 5, 2021

rocky force-pushed the master branch 2 times, most recently from b007e8c to ef16eb6 Compare May 19, 2021 22:40

rocky force-pushed the master branch 5 times, most recently from 8367d69 to 83bb068 Compare June 7, 2021 21:01

rocky force-pushed the master branch 2 times, most recently from 9570fdd to 499f1bf Compare June 26, 2021 14:12

rocky reviewed Jun 29, 2021

View reviewed changes

rocky marked this pull request as draft June 29, 2021 16:43

mmatera added 5 commits June 29, 2021 18:45

adding cache_eval to the Evaluation object

ceecb4b

using evaluation.expr_cache to avoid recalculate

51b4569

mark symbols that shouldn't be cached

37ad64d

implementing cache at the level of the leaves

c3cebc2

cache_expr to Definitions

74ae499

mmatera force-pushed the cachexpr branch 2 times, most recently from 934273c to ada387b Compare June 29, 2021 22:14

do not catch Out and MakeBoxes

fbbb55e

mmatera force-pushed the cachexpr branch from e7ad6c0 to fbbb55e Compare June 29, 2021 22:15

fixup: Format Python code with Black

f3f433c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation cache #1226

Evaluation cache #1226

mmatera commented Apr 4, 2021

rocky commented Apr 4, 2021

rocky commented Apr 4, 2021

rocky left a comment

mmatera commented Apr 5, 2021

rocky commented Apr 5, 2021

mmatera commented Apr 5, 2021

rocky commented Apr 5, 2021

TiagoCavalcante commented Jun 29, 2021 •

edited

mmatera commented Jun 29, 2021

TiagoCavalcante commented Jun 29, 2021

rocky commented Jun 29, 2021

mmatera commented Jun 29, 2021

rocky commented Jun 29, 2021 •

edited

rocky Jun 29, 2021

mmatera Jun 29, 2021

rocky Jun 29, 2021

mmatera Jun 29, 2021

rocky Jun 29, 2021

TiagoCavalcante commented Jun 29, 2021

rocky commented Jun 29, 2021 •

edited

Evaluation cache #1226

Are you sure you want to change the base?

Evaluation cache #1226

Conversation

mmatera commented Apr 4, 2021

rocky commented Apr 4, 2021

rocky commented Apr 4, 2021

rocky left a comment

Choose a reason for hiding this comment

mmatera commented Apr 5, 2021

rocky commented Apr 5, 2021

mmatera commented Apr 5, 2021

rocky commented Apr 5, 2021

TiagoCavalcante commented Jun 29, 2021 • edited

mmatera commented Jun 29, 2021

TiagoCavalcante commented Jun 29, 2021

rocky commented Jun 29, 2021

mmatera commented Jun 29, 2021

rocky commented Jun 29, 2021 • edited

rocky Jun 29, 2021

Choose a reason for hiding this comment

mmatera Jun 29, 2021

Choose a reason for hiding this comment

rocky Jun 29, 2021

Choose a reason for hiding this comment

mmatera Jun 29, 2021

Choose a reason for hiding this comment

rocky Jun 29, 2021

Choose a reason for hiding this comment

TiagoCavalcante commented Jun 29, 2021

rocky commented Jun 29, 2021 • edited

TiagoCavalcante commented Jun 29, 2021 •

edited

rocky commented Jun 29, 2021 •

edited

rocky commented Jun 29, 2021 •

edited