[MLIR] Specialize active callbacks to their own function #735

erick-xanadu · 2024-05-13T19:06:02Z

Context: Enzyme allows one to specify custom gradients for specific functions. In order to specify custom gradients for callbacks, callbacks need to be specialized to their own specific functions. E.g., instead of having the following code:

func.call @callback(%identifier, %argc, %retc, %0, %1, ..., %m, %n)

And be unable to register custom gradients for @callback. Specialize callbacks to their identifiers like so:

func.func @active_callback_123(%arg0, %arg1, ..., %argm, %argn) {
  %identifier = llvm.constant ...
  %argc = llvm.constant ...
  %retc = llvm.constant ...
  llvm.call @callback(%identifier, %argc, %retc, %arg0, %arg1, ... %argm, %argn)
  return
}

  // ...
  func.call @active_callback_123(%0, %1, ... %m, %n)
  // ..

And now we can register a custom gradient for callback_123 and any other callback_456.

Description of the Change:

Iterate over all ActiveCallbackOps and create the specialize function. Each ActiveCallbackOp will be annotated with the specialized function.
During the lowering of ActiveCallbackOps to LLVM-IR replace with a call to the specialized function.

[sc-60494]

doc/changelog.md

codecov · 2024-05-13T21:07:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.04%. Comparing base (7248c12) to head (00cf68c).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #735   +/-   ##
=======================================
  Coverage   98.04%   98.04%           
=======================================
  Files          69       69           
  Lines        9536     9538    +2     
  Branches      762      763    +1     
=======================================
+ Hits         9350     9352    +2     
  Misses        151      151           
  Partials       35       35

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

… the callback

dime10

Thanks Erick, looks good! Regarding reusing the transformation from results into arguments and memrefs into struct pointers, is that something that would be applicable to this PR?

dime10 · 2024-05-29T17:02:39Z

frontend/catalyst/compiler.py

+        "specialize-active-callback-pass",
        "annotate-function",
        "lower-mitigation",
        "lower-gradients",


Should this go directly before gradient lowering?

I think I was tying the callbacks to gradients because of the registration with enzyme. I was also thinking what is the lowest point in the pipeline it is first needed.

Is there a reason it should be the first thing in the pipeline?

dime10 · 2024-05-29T17:03:41Z

frontend/catalyst/compiler.py

@@ -142,6 +142,7 @@ def run_writing_command(command: List[str], compile_options: Optional[CompileOpt
 QUANTUM_COMPILATION_PASS = (
    "QuantumCompilationPass",
    [
+        "specialize-active-callback-pass",
        "annotate-function",


Unrelated to this PR but I cannot tell at all what this pass is for or related to (from its name) 😅

Do you have a different name that you prefer? Would outline-active-callback-pass make more sense?

Oh I was referring to annotate-function

mlir/include/Catalyst/IR/CatalystOps.td

mlir/include/Catalyst/Transforms/Passes.td

dime10 · 2024-05-29T17:12:56Z

mlir/lib/Catalyst/Transforms/BufferizationPatterns.cpp

@@ -110,8 +110,47 @@ struct BufferizePythonCallOp : public OpConversionPattern<PythonCallOp> {
            bufferArgs.push_back(newBuffer);
        }


An inactive callback has no results right?

Ah maybe I got this wrong because I though debug.callback -> inactive_callback, but I guess the active_callback also calls the inactive callback.

I am not sure if you were saying that I should get rid of the results in the InactiveCallbackOp. I just did. The reason why I didn't get rid of it is because I was still exploring different ways to implement this. To be honest, I have a better way in mind now 😅 but I will prioritize getting this PR in and subsequent PRs will focus on cleaning up.

but I will prioritize getting this PR in and subsequent PRs will focus on cleaning up

Generally I will say that I don't much like the practice of splitting up PRs along a "rough first implementation" / "cleanup" line, because it makes it almost impossible to track the deficiencies of the first PR and make sure they all got addressed in the second PR.
A split along functionality is much more sensible imo.

This PR is fine but something to keep in mind for the future.

I agree with you, but it is difficult to reconcile the "please close tickets before the end of iteration" with "no rough draft implementation PRs" when:

we did not have time to design the whole epic in advance to forsee everything in each story

stories build upon previous stories that may not be used until later.

I do not know if it would be possible to have tickets that do not correspond to merging of PRs but instead to just get the implementation. And a final ticket for merging the implementation.

That to me, seems appropriate for large features like this one. It would satisfy the opinion of:

Small PRs are better (as it help for reviews)

No draft PRs (as everything is already finished except for merging)

But would put a lower priority on "close tickets before the end of the story" and "each ticket must be a PR"

mlir/lib/Catalyst/Transforms/SpecializeActiveCallbackPass.cpp

…s-to-their-own-function

erick-xanadu · 2024-05-30T18:41:42Z

@dime10

Thanks Erick, looks good! Regarding reusing the transformation from results into arguments and memrefs into struct pointers, is that something that would be applicable to this PR?

No. But maybe a future PR might change this. I'm thinking whether using memrefs directly in the specialization and later undergoing the pointer-to-struct ABI transform would be good in terms of readability / invariants. I think so, but I will implement it in a different PR.

erick-xanadu · 2024-06-03T14:35:58Z

Closing in favour of #782

erick-xanadu requested review from dime10 and rauletorresc May 13, 2024 19:11

rauletorresc reviewed May 13, 2024

View reviewed changes

doc/changelog.md Show resolved Hide resolved

erick-xanadu force-pushed the eochoa/2024-05-09/specialize-callbacks-part-two branch from de89893 to 09d4f02 Compare May 13, 2024 20:56

erick-xanadu mentioned this pull request May 15, 2024

[Frontend] Adds fwd and bwd to pure_callbacks #743

Closed

erick-xanadu added 16 commits May 24, 2024 14:57

Rename PythonCallOp to InactiveCallbackOp

55bb334

Add ActiveCallbackOp

f8707ac

Inactive and Active callbacks are equivalent.

cffb2c8

Use ActiveCallbackOp for pure_callback.

27d920a

Add skeleton for spacializing active callbacks

dcc607c

Add FlatSymbolRefAttr which will contain the specialized function for…

f941c8f

… the callback

Skeleton for matching and rewriting

037023c

Use correct name

1eed22e

Switch from LLVM to Func

0f6ef8d

Use all parameters.

e363f84

Correct lowering

37a5479

Lower active_callback_${id} to an LLVM function.

c23e08a

Changelog

796f47b

Failure

d52f512

Simplify

8c40bc7

Changelog

e1a1cc4

erick-xanadu force-pushed the eochoa/2024-05-09/specialize-callbacks-part-two branch from 09d4f02 to e1a1cc4 Compare May 24, 2024 19:16

dime10 requested changes May 29, 2024

View reviewed changes

erick-xanadu added 6 commits May 30, 2024 13:51

Remove unnecessary include

05a477b

Pass by reference

0642544

Variable names as documentation

cee7a35

Change from llvm.func to func.func

2bf6cea

Use func.func instead of llvm.func

4ea0c4a

Style

20aaf14

erick-xanadu added 7 commits May 30, 2024 14:01

Hard code empty list as return type for inactive callback

03c1a61

Remove result types from inactive callback

5e8f361

Fix changelog

05351b2

Documentation

93e20c2

Test multiple callback calls

0272ad5

Correct documentation

77825a5

Merge branch 'main' into eochoa/2024-05-24/specialize-active-callback…

00cf68c

…s-to-their-own-function

erick-xanadu mentioned this pull request Jun 3, 2024

[MLIR] Add specialization for all callbacks #782

Open

erick-xanadu closed this Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLIR] Specialize active callbacks to their own function #735

[MLIR] Specialize active callbacks to their own function #735

erick-xanadu commented May 13, 2024 •

edited

codecov bot commented May 13, 2024 •

edited

dime10 left a comment

dime10 May 29, 2024

erick-xanadu May 30, 2024

dime10 May 31, 2024

dime10 May 29, 2024

erick-xanadu May 30, 2024

dime10 May 31, 2024

dime10 May 29, 2024

dime10 May 29, 2024

erick-xanadu May 30, 2024

dime10 May 31, 2024

erick-xanadu May 31, 2024 •

edited

erick-xanadu commented May 30, 2024

erick-xanadu commented Jun 3, 2024

		@@ -110,8 +110,47 @@ struct BufferizePythonCallOp : public OpConversionPattern<PythonCallOp> {
		bufferArgs.push_back(newBuffer);
		}

[MLIR] Specialize active callbacks to their own function #735

[MLIR] Specialize active callbacks to their own function #735

Conversation

erick-xanadu commented May 13, 2024 • edited

codecov bot commented May 13, 2024 • edited

Codecov Report

dime10 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

erick-xanadu May 31, 2024 • edited

Choose a reason for hiding this comment

erick-xanadu commented May 30, 2024

erick-xanadu commented Jun 3, 2024

erick-xanadu commented May 13, 2024 •

edited

codecov bot commented May 13, 2024 •

edited

erick-xanadu May 31, 2024 •

edited