Move probe expansion into codegen #3155

viktormalik · 2024-05-07T10:52:30Z

The previous probe expansion approach tried to minimize the amount of LLVM functions generated by emitting a single function for all probe matches in most cases. While this was efficient, it came with a couple of drawbacks:

It is necessary to generate a separate LLVM function for each match (e.g. when the probe builtin is used). This leads to having two very similar loops for iterating matches in BPFtrace::add_probe and in CodegenLLVM::visit(Probe) which is quite confusing and hard to maintain.
libbpf needs one BPF program (i.e. one LLVM function) per probe so if we want to delegate program loading (and possibly attachment) to libbpf (which we do), we cannot use this approach. See [RFC] Probe expansion in codegen #3005 for more details.

This refactors probe expansion by moving most of it into codegen. Overall, we now distinguish two types of probe expansion:

Full expansion - A separate LLVM function is generated for each match. This is used for most expansions now.
Multi expansion - Used for k(u)probes when k(u)probe_multi is available. Generates one LLVM function and one BPF program for all matches and attaches the expanded functions via bpf_link_create_opts.

This allows to drop a lot of duplicated code. The expansion for "full" is done in CodegenLLVM::visit(Probe), the expansion for "multi" is done in BPFtrace::add_probe.

A drawback of this approach is that we generate substantially larger ELF objects for expansions of probe types which do not support multi-probes (e.g. kfuncs and tracepoints) as we generate duplicate LLVM functions. This is something we can live with for now since multi-attachment is not the main use-case for these probe types (e.g. attaching to many kfuncs is very slow) and there's usually an alternative to use multi-kprobes. In addition, we will probably add a third expansion type using LLVM symbol aliases but that requires libbpf changes.

One particular area where this refactoring caused problems is unit tests in tests/bpftrace.cpp. Previously, it was sufficient to generate a simple ast::Probe and pass it to BPFtrace::add_probe since that was where most of the expansion was done. Now that the expansion was moved to codegen, we need to do full parser -> field analyser -> clang parser -> semantic analyser -> codegen sequence.

Also, the problem comes with USDT probes as it is not possible to easily mock USDTHelper which is a fully static class. Since we need to override AttachPoint::usdt::num_locations from tests, we allow to do that via a new internal env variable BPFTRACE_TEST_USDT_NUM_LOCATIONS.

Checklist

Language changes are updated in man/adoc/bpftrace.adoc
User-visible and non-trivial changes updated in CHANGELOG.md
The new behaviour is covered by tests

src/ast/passes/codegen_llvm.cpp

viktormalik · 2024-05-07T13:25:18Z

Update: I had to drop unit tests for single-wildcard targets for uprobe/USDT added in #2757. The reason is that tests in tests/bpftrace.cpp now run semantic analyser which in turn runs discovery of all running processes on the system for such probes, which is not something that should be run in unit tests (and it cannot be mocked in this case). FWIW, I don't think that those tests would add much value b/c all the code added by #2757 would be replaced by mocks anyways.

ajor

Are "full expansion" and "multi expansion" meant to be mutually exclusive? An enum instead of two bools might be clearer/safer if that's the case.

I haven't dug into the cause, but I can trigger a crash on this branch by setting both full and multi expansion to true on a single probe with this script:

kprobe:*:__x64_sys_rea* { print("a") }

CHANGELOG.md

ajor · 2024-05-15T16:36:21Z

src/ast/attachpoint_parser.cpp

+  // As the C++ language supports function overload, a given function name
+  // (without parameters) could have multiple matches even when no
+  // wildcards are used.
+  if (has_wildcard(ap_->func) || has_wildcard(ap_->target) ||


Don't we need full expansion for wildcarded executable paths? i.e.:

if (has_wildcard(ap_->target) ap_->need_full_expansion = true;

In such a case, we do multi-expansion for each expanded path. See BPFtrace::add_probe for details.

viktormalik · 2024-05-16T11:19:14Z

Are "full expansion" and "multi expansion" meant to be mutually exclusive? An enum instead of two bools might be clearer/safer if that's the case.

Indeed they are, a great suggestion!

I haven't dug into the cause, but I can trigger a crash on this branch by setting both full and multi expansion to true on a single probe with this script:
kprobe:*:__x64_sys_rea* { print("a") }

Nice catch! It should be fixed now.

viktormalik · 2024-05-16T11:22:09Z

While rebasing onto master, I noticed that uprobe_multi expansion doesn't work with placing uprobes after the funtion prologue. I opened #3173 to track that issue and this PR respects the current behavior (i.e. when uprobe expansion is needed, use kprobe_multi and attach to function entries, otherwise attach after function prologue).

ajor

Just a partial review so far - I'll try and get through bpftrace.cpp tomorrow.

I'm not keen on using environment variables to mock behaviour - it runs the risk of something dodgy happening if a user sets the env var at run time. I've recently started thinking about how we can make bpftrace a "trusted compiler" to allow it to be used by a non-root user (for use in containers with BPF token), and for that to work the compiler's behaviour has to be pretty well defined.

How about this as a way of mocking static classes: https://godbolt.org/z/37czorj84

src/bpftrace.cpp

viktormalik · 2024-05-23T14:35:42Z

I'm not keen on using environment variables to mock behaviour - it runs the risk of something dodgy happening if a user sets the env var at run time. I've recently started thinking about how we can make bpftrace a "trusted compiler" to allow it to be used by a non-root user (for use in containers with BPF token), and for that to work the compiler's behaviour has to be pretty well defined.

It turned out that the only method that we need to mock doesn't need to be static so I made USDTHelper non-static and created MockUSDTHelper which mocks the find method.

The previous probe expansion approach tried to minimize the amount of LLVM functions generated by emitting a single function for all probe matches in most cases. While this was efficient, it came with a couple of drawbacks: - It is necessary to generate a separate LLVM function for each match (e.g. when the 'probe' builtin is used). This leads to having two very similar loops for iterating matches in BPFtrace::add_probe and in CodegenLLVM::visit(Probe) which is quite confusing and hard to maintain. - libbpf needs one BPF program (i.e. one LLVM function) per probe so if we want to delegate program loading (and possibly attachment) to libbpf (which we do), we cannot use this approach. See [1] for more details. This refactors probe expansion by moving most of it into codegen. Overall, we now distinguish two types of probe expansion: Full expansion - A separate LLVM function is generated for each match. This is used for most expansions now. Multi expansion - Used for k(u)probes when k(u)probe_multi is available. Generates one LLVM function and one BPF program for all matches and attaches the expanded functions via bpf_link_create_opts. This allows to drop a lot of duplicated code. The expansion for "full" is done in CodegenLLVM::visit(Probe), the expansion for "multi" is done in BPFtrace::add_probe. A drawback of this approach is that we generate substantially larger ELF objects for expansions of probe types which do not support multi-probes (e.g. kfuncs and tracepoints) as we generate duplicate LLVM functions. This is something we can live with for now since multi-attachment is not the main use-case for these probe types (e.g. attaching to many kfuncs is very slow) and there's usually an alternative to use multi-kprobes. One particular area where this refactoring caused problems is unit tests in tests/bpftrace.cpp. Previously, it was sufficient to generate a simple ast::Probe and pass it to BPFtrace::add_probe since that was where most of the expansion was done. Now that the expansion was moved to codegen, we need to do full parser -> field analyser -> clang parser -> semantic analyser -> codegen sequence. With this change, some tests had to be dropped, especially the tests with a single wildcard for uprobe/USDT target. The reason is that semantic analyser expands these wildcards by searching all paths on the system which is something that cannot be mocked and therefore should not be run in unit tests (e.g. it prevents running the unit tests as non-root). Also, the problem comes with USDT probes as USDTHelper was originally fully static which prevented its mocking. Fortunately, we only need to mock the find() method which doesn't have to be static, so this refactors USDTHelper and some of its users and introduces MockUSDTHelper which mocks the find method for unit tests. [1] bpftrace#3005

viktormalik requested review from ajor, danobi and fbs as code owners May 7, 2024 10:52

viktormalik force-pushed the codegen-expansion branch from 222e56d to 3be0722 Compare May 7, 2024 10:53

github-advanced-security bot found potential problems May 7, 2024

View reviewed changes

src/ast/passes/codegen_llvm.cpp Dismissed Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from 3be0722 to 63e4c72 Compare May 7, 2024 13:20

viktormalik force-pushed the codegen-expansion branch 2 times, most recently from 10f6f74 to dde56bd Compare May 10, 2024 09:30

ajor self-assigned this May 15, 2024

ajor reviewed May 15, 2024

View reviewed changes

viktormalik force-pushed the codegen-expansion branch from dde56bd to 305b515 Compare May 16, 2024 11:19

viktormalik force-pushed the codegen-expansion branch from 305b515 to d3da3ae Compare May 17, 2024 09:04

ajor reviewed May 20, 2024

View reviewed changes

src/bpftrace.cpp Outdated Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from d3da3ae to 5dfc728 Compare May 23, 2024 14:32

viktormalik force-pushed the codegen-expansion branch from 5dfc728 to e18dac6 Compare May 23, 2024 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move probe expansion into codegen #3155

Move probe expansion into codegen #3155

viktormalik commented May 7, 2024 •

edited

viktormalik commented May 7, 2024

ajor left a comment

ajor May 15, 2024

viktormalik May 16, 2024

viktormalik commented May 16, 2024

viktormalik commented May 16, 2024

ajor left a comment

viktormalik commented May 23, 2024

Move probe expansion into codegen #3155

Are you sure you want to change the base?

Move probe expansion into codegen #3155

Conversation

viktormalik commented May 7, 2024 • edited

Checklist

viktormalik commented May 7, 2024

ajor left a comment

Choose a reason for hiding this comment

ajor May 15, 2024

Choose a reason for hiding this comment

viktormalik May 16, 2024

Choose a reason for hiding this comment

viktormalik commented May 16, 2024

viktormalik commented May 16, 2024

ajor left a comment

Choose a reason for hiding this comment

viktormalik commented May 23, 2024

viktormalik commented May 7, 2024 •

edited