Step 1 - Transpiler #595

neworderofjamie · 2023-08-15T11:17:31Z

Transpiler

This is the biggest change in GeNN 5. Previous user code had some 'preprocessor-level' transformations applied and was passed straight to the CUDA/C++ compiler. However, this has a lot of issues:

Really bad user experience when syntax errors in your code result in compiler errors deep in some generated code
Not well defined what bits of C/C++ worked in user code and what might work but is inadvisable e.g. results in really non-deterministic behaviour on SIMT backends
User code variables could clash with variables used in code generated by GeNN
Make GeNN unable to perform some classes of optimisation e.g. replacing standard += operations with atomics or vectorising to CUDA half2 types.
Impossible to generate code for things that can't be programmed in a C-like language e.g. the FPGA accelerator Zainab is making rapid progress on

This PR solves this by implementing a pretty basic source-to-source transpiler which I have to say works rather nicely. Integrating this involved rewriting a large proportion of GeNN's code generator which definitely improved it but does mean reviewing the change is a nightmarish proposition. However, hopefully the following explains the ideas and highlights some of the potentially-controversial aspects.

Implementation

Transpiling code from GeNN code strings to backend-specific code is done in 4 stages. The implementation of the first two steps is heavily inspired by the first section of https://craftinginterpreters.com/contents.html which has been my key reading for this journey into CS 😄 Also because, compared to a general purpose compiler, GeNN code strings are very short and because stages 2-4 happen only on merged groups there is no real need for any of this to be super-high performance so the implementations all aim for simplicity rather than cutting-edge compiler design.

1 - Scanning

The scanner (https://github.com/genn-team/genn/blob/transpiler/src/genn/genn/transpiler/scanner.cc) converts strings to vectors of tokens (https://github.com/genn-team/genn/blob/transpiler/include/genn/genn/transpiler/token.h) to make all subsequent processing simpler.

2 - Parsing

The parser (https://github.com/genn-team/genn/blob/transpiler/src/genn/genn/transpiler/parser.cc) turns sequences of tokens into a Abstract Syntax Tree consisting of expressions (https://github.com/genn-team/genn/blob/transpiler/include/genn/genn/transpiler/expression.h) and statements (https://github.com/genn-team/genn/blob/transpiler/include/genn/genn/transpiler/statement.h). This is implemented as a Recursive Descent Parser (https://en.wikipedia.org/wiki/Recursive_descent_parser) where the C++ call stack takes on a lot of the heavy lifting.

3 - Type-checking

A large proportion of compile errors you get in C are the results of type checking so, to achieve the dream of users not having to deal with real compiler errors, the transpiler needs a type checker (https://github.com/genn-team/genn/blob/transpiler/src/genn/genn/transpiler/typeChecker.cc). Basically what this does is 'visit' (https://en.wikipedia.org/wiki/Visitor_pattern) the AST and, for each expression, recursively determine its type checking that children's types are valid for each operation e.g. that you're not assigning to a const variable or whatever. Because, as I mention later, function overloading is supported, this stage also emits a dictionary of expressions->types to allow the pretty printer to pick the correct function implementation.

4 - Pretty printing

For the current backends we need to go from the AST back to C-like code and this is done by the pretty printer (https://github.com/genn-team/genn/blob/transpiler/src/genn/genn/transpiler/prettyPrinter.cc). Like the type-checker, it recursively visits the nodes of the AST but rather than doing some analysis on them it just prints out the C-code. One semi-smart thing this does do is add an underscore in front of all variables declared in user code, thus fixing #385.

Language

No preprocessor (I kinda liked being able to #define stuff in user code but I don't think it's worth it - same effect can easily achieved by combining bits of code in Python)
There is enough support for strings to printf debug messages but not much more.
Can't define functions, typedefs or structs in user code
Structs aren't supported at all
Some weird corner cases like octal integer and hexadecimal floating point literals aren't supported
The old $(xx) syntax for referencing GeNN stuff is no longer necessary at all and the $(xx, arg1, arg2) function syntax I added doesn't hold water grammatically so, currently, there is some code which strips this out (https://github.com/genn-team/genn/blob/transpiler/src/genn/genn/gennUtils.cc#L30-L58) before transpiling to improve backward compatibility somewhat although I'm tempted to move this to PyGeNN.
The & operator isn't supported - user code should not be taking the address of local variables and doing stuff with them as, in our general SIMT paradigm, local variables are essentially registers and not addressable. The only time this is slightly annoying is when dealing with extra global parameter arrays as you can no longer do stuff like const int *egpSubset = &egp[offset]; and instead have to do const int *egpSubset = egp + offset; but, personally I think that's ok.
Like OpenCL, while what's supported is basically C99 rather than C++, function overloading is supported so sin(30.0f) will resolve to the floating point rather than double-precision version.
Previously floating point literals like 30.0 were always treated as the scalar type but this is kind of annoying if you're writing mixed-precision code. Now, 30.0 will be treated as scalar but 30.0f will always be treated as float and 30.0d will always be treated as double.
I don't think we ever encountered it but there was potential for issues due to the size of types e.g. long being compiler-specific (even on 64-bit systems, it's 32-bit on Windows and 64-bit on Linux). The transpiler now guarantees a LP64 data model where int is 32-bit and long is 64-bit by always generating code with sized types i.e. int32_t

Integration

One of the reasons I chose to build all this from scratch rather than e.g. leverage LLVM is that all of this process is tightly integrated with the rest of GeNN. The scanner gets run on code strings when NeuronGroup, SynapseGroup and friends get constructed and the tokenised representation is then used in place of all the ad-hoc regular expressions for stuff like determining whether e.g. any of the RNG functions have been referenced in a code string. The type system used by the type checker is also used in place of strings to represent types throughout GeNN (the only exception is "scalar" which gets replaced with the actual type when it's encountered in the parser, type-checker and pretty-printer). This means rather than adding stars to strings you can do stuff like:

auto type = Type::Uint32.createPointer();

or

const bool signed = type.getNumeric().isSigned;

One of the increasingly nasty parts of GeNN was the whole group merged class hierarchy mess which meant that logic about what to do with a given merged group was scattered between the code that added fields to the merged group structure and the code that actually generated the code. The answer to this is to build the structures 'lazily' so only adding fields when they are required. Both the type checker and the pretty printer have the concept of 'environments' which are basically scopes with stuff defined in them, in the case of the type checker, what matters is the type e.g. const int* and, in the case of the pretty printer, how they should be displayed. These environments (https://github.com/genn-team/genn/blob/transpiler/include/genn/genn/code_generator/environment.h) extend outwards from the transpiler to form a replacement for the old Substitions class and a lot of the functionality that was in GroupMerged and provide various helpers for correctly populating the merged structures as you generate code e.g.

groupEnv.printLine("const unsigned int npre = $(_col_length)[$(id_post)];");

will mark the struct field corresponding to _col_length as required (the _ syntax here means that these variables aren't exposed to user code but are only used internally). Another issue that caused a lot of unused code to be generated or expensive index-calculation code to be duplicated (e.g. this finally fixes #47) so bits of initialisation code can be attached to variables you add to the environment and only generated if the variable is referenced e.g.:

synEnv.add(Type::Uint32.addConst(), "id_post", "idPost",
           {synEnv.addInitialiser("const unsigned int idPost = $(_ind)[$(id_syn)];")});

will only read the postsynaptic index from memory into a register if it's required.

Other inclusions

Due to the long time it's taken me to tie this down, sadly, this PR also includes a bunch of other stuff as well as various syntactic improvements that it made sense to include as I reimplemented the code generation for various features.

Structural plasticity

The syntax I originally developed for this in the GeNN 4.XX version creaked at the seams rather but, using the new transpiler functionality, I've implemented a for_each_synapse language extension that behaves like a normal for-loop (admittedly one where stuff like id_post magically appears inside it) rather than a scary macro:

remove_synapse_model = create_custom_connectivity_update_model(
    "remove_synapse",
    var_name_types=[("a", "scalar")],
    row_update_code=
    """
    for_each_synapse {
        if(id_post == id_pre) {
            remove_synapse();
            break;
        }
    }
    """)

Python feature tests

Some are still outstanding waiting on future PRs but the majority of the feature tests are now ported to PyGeNN + pytest. I've tried to merge similar tests together into larger models to reduce the time it takes to run the test suit and have implemented variants like with/without delay and with/without batching using parameterisation (https://docs.pytest.org/en/7.3.x/how-to/parametrize.html). As you might imagine, this was a very painful process but it did find a lot of bugs and the result is way less cumbersome and actually tests PyGeNN properly! @tnowotny one thing that came out of this is that we were performing statistical tests on the generation of random numbers from discrete distributions i.e. the binomial distribution we added in #498 correctly. I think a chi-squared test is the right test for this but I struggled to figure out how to use it against a series of samples which might result in a "gappy" histogram if you see what I mean.

Syntax simplification

There will be more of this to come as some stuff has got a bit convoluted but, for now:

The row build and diagonal build state variables in sparse/toeplitz connectivity building code were really ugly and confusing. Sparse connectivity init snippets now just let the user write whatever sort of loop they want and do the initialisation outside and toeplitz reuses the for_each_synapse structure described above to do similar.
GLOBALG and INDIVIDUALG confuse almost all to new users and are really only used withStaticPulse weight update models. Same functionality can be achieved with a StaticPulseConstantWeight version with the weight as a parameter. Then I've renamed all the 'obvious' SynapseMatrixType variants so you just chose SPARSE, DENSE, TOEPLITZ or PROCEDURAL (with DENSE_PROCEDURALG and PROCEDURAL_KERNELG for more unusual options)
Extra global parameters only support the 'pointer' form, awaiting a PR to implement settable parameters to replace the other sort

Future

Aside from preventing users from doing things that the compiler would allow but don't actually work in GeNN, generating rather nicer code and giving users better error messages, this doesn't actually do a whole lot. However, with the AST representation, a whole load of things become possible. First target will be generating the nasty semi-vectorised code you need to get good half-precision performance in CUDA.

…ences

…ng() method to convert from std::filesystem::path to std::string and THEN to filesystem::path

… presynaptic output

…ranspiler

* move outputs into genn so archive and warnings can hopefully find

…stic - only delayed variables should always be copied

# Conflicts: # include/genn/genn/code_generator/environment.h

tnowotny

I feel it is time now to start merging (as you suggested). Since we talked this through no particular thoughts about deal-breaking problems have occurred to me. SoI will approve these stages now.

neworderofjamie added 30 commits July 27, 2023 15:59

tidy whitespace

3373605

re-enable fixed number total connectivity test

f4aca43

fixed some custom update code

af1453f

start of custom update test

878a52f

fixed bug in EnvironmentLocalCacheBase which broke variable refer…

c94ff77

…ences

custom update code consists of multiple statements not an expression!

585d6c3

fixed various small bugs relating to non-per neuron custom updates

533ec36

more work on custom update test

a3a90ee

removed tests of deprecated features

481d366

added current source to custom update test

9a390bc

unit tests use experiments/filesystem on GCC older than 8

4b7e937

unit tests rename of Models::VarInit

7c90974

unit tests remove ModelSpecMerged backend constructor parameter

c5bb2bb

missing macro to add EGP refs to custom update model from C++

8e9bee6

unit tests fixed bad merge

61c24f1

unit tests one more additional ModelSpecMerged constructor parameter

cc624b4

Correctly handle std::experimental::filesystem namespace and use stri…

3f8ed10

…ng() method to convert from std::filesystem::path to std::string and THEN to filesystem::path

more bad mergery

c46eac6

small tidy

58b7023

WIP test_spike_propagation/test_reverse feature test

9ba2f95

fixed bug in NeuronInitGroupMerged::OutSynPreOutput

a0690a6

added 'reverse' spike propagation test for addToPre functionality

50d90a1

Fixed bugs and tidied up code generation for SIMT synapse groups with…

32270f5

… presynaptic output

removed re-implemented test

e707a9e

removed re-implemented test

24e0adf

feature test for presynaptic output from post-learn code

da4efe0

fixed a few typos in post-learn code generation

ad3acf3

fixed a few more bugs with post-learn code generation

ca02fec

add some presynaptic paralellism tests

bc330ff

and test for presynaptic parallelism with dendritic delay

5164f43

fixed typo in pytest

b9e36c8

neworderofjamie added the enhancement label Aug 15, 2023

neworderofjamie added this to the GeNN 5.0.0 milestone Aug 15, 2023

neworderofjamie added 12 commits August 15, 2023 13:08

isTrueSpikeRequired needed in neuron hash

c1ddd05

Merge branch 'transpiler' of https://github.com/genn-team/genn into t…

2b1c0c0

…ranspiler

fixed test - time_min is different depending on precision

b4b2975

build pygenn with coverage support

2af369b

bloody pip

3ad62eb

give up on pip entirely

5ee79de

add coverage compiler flags if PyGeNN is built with coverage

087d317

missing colon

07aeeaf

coverage linker arguments

a31da6e

new bash script for gathering combined feature and unit test coverage

b08fd1d

run gather coverage from jenkins and upload both

cbf088b

* tee compile output to seperate file for warnings

45c5b69

* move outputs into genn so archive and warnings can hopefully find

neworderofjamie marked this pull request as ready for review August 16, 2023 08:27

neworderofjamie requested a review from tnowotny August 16, 2023 08:27

neworderofjamie added 3 commits August 16, 2023 11:19

Always copy logic in EnvironmentLocalCacheBase was overly pessimi…

46dfb23

…stic - only delayed variables should always be copied

fixed failing test

3870b19

fix bug in environment

ca12c31

# Conflicts: # include/genn/genn/code_generator/environment.h

neworderofjamie changed the title ~~Transpiler~~ Step 1 -Transpiler Sep 1, 2023

This was linked to issues Oct 27, 2023

Dangerous conflict potential of user and system code #385

Closed

Smarter fusing of pre and postsynaptic models #470

Open

neworderofjamie mentioned this pull request Nov 24, 2023

Step 5 - Dynamic and typed parameters #607

Merged

neworderofjamie changed the title ~~Step 1 -Transpiler~~ Step 1 - Transpiler Dec 2, 2023

neworderofjamie mentioned this pull request Dec 4, 2023

Step 6 - Fused event generation and postsynaptic spike-like events #609

Merged

neworderofjamie requested a review from FabianSchubert December 6, 2023 17:18

tnowotny approved these changes Jan 8, 2024

View reviewed changes

neworderofjamie merged commit a1ab0d6 into genn_5 Jan 9, 2024
1 check failed

neworderofjamie deleted the transpiler branch January 9, 2024 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Step 1 - Transpiler #595

Step 1 - Transpiler #595

neworderofjamie commented Aug 15, 2023 •

edited

tnowotny left a comment

Step 1 - Transpiler #595

Step 1 - Transpiler #595

Conversation

neworderofjamie commented Aug 15, 2023 • edited

Transpiler

Implementation

1 - Scanning

2 - Parsing

3 - Type-checking

4 - Pretty printing

Language

Integration

Other inclusions

Structural plasticity

Python feature tests

Syntax simplification

Future

tnowotny left a comment

Choose a reason for hiding this comment

neworderofjamie commented Aug 15, 2023 •

edited