feat(crd-generator): Add CRD-Generator Maven plugin #5979

baloo42 · 2024-05-03T14:04:21Z

Description

Add CRD-Generator Maven plugin (using api-v2).

Examples: https://github.com/baloo42/crd-generator-maven-examples

mvn crd-generator:help -Ddetail=true -Dgoal=generate

crd-generator:generate

  Available parameters:

    classpath (Default: WITH_RUNTIME_DEPENDENCIES)
      The classpath which should be used during the CRD generation.
      Choice of:
      * PROJECT_ONLY: Only classes in the project.
      * WITH_RUNTIME_DEPENDENCIES: Classes from the project and any runtime
      dependencies.
      * WITH_COMPILE_DEPENDENCIES: Classes from the project and any compile time
      dependencies.
      * WITH_ALL_DEPENDENCIES: Classes from the project, compile time and
      runtime dependencies.
      * WITH_ALL_DEPENDENCIES_AND_TESTS: Classes from the project (including
      tests), compile time, runtime and test dependencies.
      User property: fabric8.crd-generator.classpath

    customResourceClasses
      Custom Resource classes, which should be considered to generate the CRDs.
      If set, scanning is disabled.
      User property: fabric8.crd-generator.customResourceClasses

    dependenciesToScan
      Dependencies which should be scanned for Custom Resources.
      User property: fabric8.crd-generator.dependenciesToScan

    exclusions
      Exclusions, used to filter Custom Resource classes after scanning.
      User property: fabric8.crd-generator.exclusions

    forceIndex (Default: false)
      If true, a Jandex index will be created even if the directory or JAR file
      contains an existing index.
      User property: fabric8.crd-generator.forceIndex

    forceScan (Default: false)
      If true, directories and JAR files are scanned even if Custom Resource
      classes are given.
      User property: fabric8.crd-generator.forceScan

    implicitPreserveUnknownFields (Default: false)
      If true, x-kubernetes-preserve-unknown-fields: true will be added on
      objects which contain an any-setter or any-getter.
      User property: fabric8.crd-generator.implicitPreserveUnknownFields

    inclusions
      Inclusions, used to filter Custom Resource classes after scanning.
      User property: fabric8.crd-generator.inclusions

    outputDirectory (Default:
    ${project.build.outputDirectory}/META-INF/fabric8/)
      The output directory where the CRDs are emitted.
      User property: fabric8.crd-generator.outputDirectory

    parallel (Default: true)
      If true, the CRDs will be generated in parallel.
      User property: fabric8.crd-generator.parallel

    skip (Default: false)
      If true, execution will be skipped.
      User property: fabric8.crd-generator.skip

Type of change

Bug fix (non-breaking change which fixes an issue)
Feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change
Chore (non-breaking change which doesn't affect codebase;
test, version modification, documentation, etc.)

Checklist

Code contributed by me aligns with current project license: Apache 2.0
I Added CHANGELOG entry regarding this change
I have implemented unit tests to cover my changes
I have added/updated the javadocs and other documentation accordingly
No new bugs, code smells, etc. in SonarCloud report
I tested my code in Kubernetes
I tested my code in OpenShift

baloo42 · 2024-05-08T07:52:39Z

@manusa Can you have second look on this? Is this now going into the right direction?

The logic to collect and load custom resource classes is now included in an own module "crd-generator-collector" intended to be shared between maven-plugin, gradle-plugin, cli etc.

TBD:

Do we have to care about Zip Bombs, if we collect custom resource classes from a jar file? (I have added some safeguards but sonar is not recognizing it)
Do we need reset methods? (the collector classes contain a state)
Do we want to include the examples in some way? --> https://github.com/baloo42/crd-generator-maven-examples

A prototype for the CLI which is using the same base can be found here: baloo42#9

baloo42 · 2024-05-08T14:24:46Z

@shawkins, @andreaTP Maybe you can have a look on this too.

Marc has mentioned that he is not sure about the final / future purpose of the new module (me too), so I'll try to explain a few thoughts I had.

To use the CRD-Generator from CLI, Maven, Gradle etc. we need in general two main features which are not implemented in the api-v2 module:

Search / Collect / Filter custom resources classes
Customizable class loading

The new crd-generator-collector module implements both features via the main component CustomResourceCollector which can be used in the following way:

CustomResourceCollector customResourceCollector = new CustomResourceCollector()
        .withClasspathElements(<additional classpath elements>)
        .withFilesToIndex(<e.g. directory with class files or jar file with class files>)

CustomResourceInfo[] customResourceInfos = customResourceCollector.findCustomResources();

CRDGenerator crdGenerator = new CRDGenerator()
        .customResources(customResourceInfos)
        .inOutputDir(outputDirectory);
        
crdGenerator.detailedGenerate()

In conclusion this approach is some kind of "pre-processor" for the CRDGenerator.

Another approach would be to implement it as a facade ("processor"):

CRDGeneratorFacade facade = new CRDGeneratorFacade()
        .withClasspathElements(<additional classpath elements>)
        .withFilesToIndex(<e.g. directory with class files or jar file with class files>)
        .inOutputDir(outputDirectory)

facade.detailedGenerate()

This approach is nicer from a user perspective but we need to passthrough every option (or use some kind of nested builder pattern).

A third approach would be to implement both features in the api-v2 module itself and make jandex an optional dependency (or allow to extend it by using a service loader).

CRDGenerator crdGenerator = new CRDGenerator()
        .withClasspathElements(<additional classpath elements>)
        .withFilesToIndex(<e.g. directory with class files or jar file with class files>)
        .inOutputDir(outputDirectory)

crdGenerator.detailedGenerate()

This approach might be the nicest but it would also be the most advanced: the api-v2 module has now to deal somehow with additional dependencies like Jandex and the code base of the api-v2 would grow.

Which approach do you prefer? Other thoughts?

shawkins · 2024-05-08T16:10:34Z

Which approach do you prefer? Other thoughts?

I'd like to keep the core api as simple as possible - whatever is needed to satisfy Quarkus should be good.

The first option seems good enough. I don't really see much need for coupling between the collector and the crd generator - it doesn't really need to provide CustomResourceInfo, just classes. I mentioned on a pr somewhere that the collector to me just looks like something that you could provide a list of interfaces / base classes, and a list of annotations, and it will just find everything that matches that.

baloo42 · 2024-05-08T16:50:04Z

I'd like to keep the core api as simple as possible - whatever is needed to satisfy Quarkus should be good.
👍

The first option seems good enough. I don't really see much need for coupling between the collector and the crd generator - it doesn't really need to provide CustomResourceInfo, just classes.

Yes, you are right. Providing only classes could basically do the job but then we cannot implement high level filtering (e.g. by group or version). Do we need high level filtering? If yes: High level filtering could also be implemented in api-v2 so that we can stay by the strategy to return only classes instead of CustomResourceInfo's.

I mentioned on a pr somewhere that the collector to me just looks like something that you could provide a list of interfaces / base classes, and a list of annotations, and it will just find everything that matches that.

Yes, at the moment the base interface (HasMetadata) and the additional required annotations (Version, Group) to identify a custom resource class are hardcoded. But this can be easily changed to be configurable (with sane defaults) if you think this is useful. Do we need to search for other Interfaces/Annotations?

andreaTP · 2024-05-08T16:57:22Z

Completely agree with what @shawkins said, let's keep the core API as simple as possible.
I don't think that filtering is required, and can be implemented in the tooling when necessary.

shawkins · 2024-05-08T17:08:04Z

Do we need high level filtering? If yes: High level filtering could also be implemented in api-v2 so that we can stay by the strategy to return only classes instead of CustomResourceInfo's.

Right I'd vote for putting that into the api if needed.

It would probably be good to consider whether having a public constructor on CustomResourceInfo makes sense - that opens up a different usage model for the CRDGenerator that isn't bound to the typical annotations.

Do we need to search for other Interfaces/Annotations?

Not yet. I was just highlighting that it doesn't really need to be seen as that specific to CRs.

baloo42 · 2024-05-08T17:30:43Z

Thanks for the feedback. To sum it up, I have the following todo's so far:

Return a list of custom resource classes instead of a list of CustomResourceInfo's
Remove high level filtering from crd-generator-collector module with no replacement atm
~~Try to generalize the used interfaces / annotations to identify a custom resource~~

shawkins · 2024-05-08T17:34:01Z

Try to generalize the used interfaces / annotations to identify a custom resource

Don't worry about this step just yet, what you currently have should work fine for the first iteration.

… of CustomResourceInfo's

baloo42 · 2024-05-09T19:00:27Z

...ator/collector/src/main/java/io/fabric8/crd/generator/collector/CustomResourceCollector.java

+    return customResourceClasses;
+  }
+
+  public Class<? extends HasMetadata>[] findCustomResourceClasses() {


This method is required because the CRDGenerator accepts only an array.

Can we extend the CRDGenerator with an additional method?

public CRDGenerator customResourceClasses(Collection<Class<? extends HasMetadata>> crClasses) { return customResources(crClasses.stream().map(CustomResourceInfo::fromClass).toArray(CustomResourceInfo[]::new)); }

baloo42 · 2024-05-09T19:04:13Z

...ector/src/main/java/io/fabric8/crd/generator/collector/JandexCustomResourceClassScanner.java

+   *
+   * @see JandexCustomResourceClassScanner#findCustomResourceClasses(IndexView)
+   */
+  private Index createBaseIndex() {


This can be optimized if required.
Existing indices of the fabric8 modules can be used or a specialized index can be created at build time.

Do we need further optimization within this PR?

I agree that we should reuse the existing index is possible - since users wil need to modify their pom / gradle file to use these plugins, it doesn't seem out of the question to just document also requiring the jandex plugin be configured as well for projects that want to create crds.

I'm currently writing some usage docs about the different use cases, including when additional configuration is required. This should help to get an overview from the user perspective.

Some notes on a more technically perspective:

Jandex is only required to "scan" for Custom Resource classes. If the list of classes is given, no scanning at all is required. (The classloader does the heavy lifting)

The implementation detects existing indices in the scan targets automatically and prefers those. Only if a scan target does not contain an index, an own in-memory index will be created. This means having an existing index is just optimization, never a requirement.

Custom Resource classes must be loaded before they can be used by the CRDGenerator itself. Any dependency which is required to load those classes must be in the classpath.

A scan target must be also in the classpath. But not every classpath element is a scan target.

Additional indexed classes beside the "base-index" are only required, if the user wants to scan a target with Custom Resource classes which don't implement the CustomResource or HasMetadata interface directly. In this case, the user must ensure that the used intermediate class is included somehow in the underlying composite index. For example, he can add the dependency which includes the class to the list of scan targets, so that an index is automatically created if needed.

In conclusion, a user doesn't need to know anything about Jandex. Even for the edge case from the last bullet point, the user only has to care about including all "relevant" classes to the list of scan targets and not Jandex specifically.

(But if his project already uses Jandex, the scans will be a little bit faster.)

Keeping this in mind, optimizing the creation of the base index is mostly about performance.
Only if there are other "intermediate" classes like "CustomResource", then we should add them.

I wrote an integration test for the "intermediate class" case (last bullet point from above):

ddc3bec#diff-0488fc4e2f91d3e88ce94d75d1317631a14f2854194a2cf68354be4712e9c2cb

In this case, the user must add the dependency, which contains the intermediate class, to the list of dependencies to scan. Under the hood the classes are now added to the composite index and Custom Resources Classes will be found by the collector.

crd-generator/maven-plugin/pom.xml

baloo42 · 2024-05-09T19:13:12Z

crd-generator/collector/src/main/java/io/fabric8/crd/generator/collector/JandexIndexer.java

+
+  private int maxJarEntries = 100000;
+  private long maxBytesReadFromJar = 100000000; // 100 MB
+  private long maxClassFileSize = 1000000; // 1 MB


Do we need limits?

crd-generator/collector/pom.xml

rohanKanojia · 2024-05-15T02:59:30Z

...ator/maven-plugin/src/test/java/io/fabric8/crd/generator/maven/plugin/ClasspathTypeTest.java

+import static org.junit.jupiter.api.Assertions.assertTrue;
+import static org.mockito.Mockito.when;
+
+@EnabledForJreRange(min = JRE.JAVA_17)


Why do we want to run these tests in Java 17 and above only? Is it possible to remove it?

I get some mockito related errors and havn't figured out yet why this is only the case on a lower versions than 17. (see https://github.com/fabric8io/kubernetes-client/actions/runs/9052896736/job/24871094873#step:4:36770)
Any ideas?

But in general I'm not really happy with both tests in the maven-plugin module. A lot stuff must be mocked, only to test some simple things...

I'm open to rewrite both tests enterily if someone can point me into the right direction. e.g. writing e2e tests with tool x. (The java-generator maven plugin doesn't include any own tests, so I cannot use it as template)

Do you think you can use maven invoker plugin here? We use it for JKube integration tests to verify generated YAML files.

Added the first integration tests, is this the right direction?

In my opinion, it looks okay, but we might want to get opinions from other team members.

Maybe we can make the verification check more strict (it might help us find the differences in generated YAML). In Eclipse JKube we compare whole YAML manifests in expected/ directory with generated output.

Added more integration tests and now the content is also verified.
Should I keep the unit tests or remove them?

It's up to you. If you think they're redundant feel free to remove them.

rohanKanojia · 2024-05-15T03:02:02Z

crd-generator/maven-plugin/README.md

@@ -0,0 +1,180 @@
+# CRD-Generator - Maven Plugin
+
+Maven plugin for the CRD-Generator: Generate CRDs from Java model during Maven builds.


This document should be either linked in https://github.com/fabric8io/kubernetes-client/blob/main/doc/CRD-generator.md or we should move it in doc/ folder like the rest of the documentation files.

Good point, we havn't talked about where the documentation for the generator-v2 should be placed, how it should be structured and especially how it should be distinguished to generator-v1.

My intention was to write first for each generator-v2 extension (maven-plugin, gradle-plugin, cli) an own README until at least two of them are merged.
Afterwards, it should be easier to write some common notes about generator-v2 including migration paths and deprecations / roadmap into the central docs. And of course the content from the README's can be moved again, too.

If I should do this within this PR, I need some guidence on how to introduce generator-v2. Otherwise we should probably create a dedicated issue for the common part of the generator-v2 docs.

sonarcloud · 2024-05-16T22:20:54Z

Quality Gate failed

Failed conditions
1 Security Hotspot
75.0% Coverage on New Code (required ≥ 80%)

See analysis details on SonarCloud

baloo42 mentioned this pull request May 3, 2024

CRD-Generator Maven plugin baloo42/kubernetes-client#7

Closed

baloo42 force-pushed the crd-generator-maven-plugin branch 2 times, most recently from fa9d221 to dc50235 Compare May 8, 2024 06:59

baloo42 added 16 commits May 9, 2024 20:56

feat(crd-generator): Add CRD-Generator Maven plugin

e097b2d

Add crd-generator-collector module

891166a

Refactor crd-generator-maven-plugin to use collector module

3680105

Adjusted log levels and allow to force create indices

7921d00

Enable parallel by default in crd-generator maven plugin

f197ba1

Remove not necessary classes from base index

8fd2615

Cleanup

24024c7

More cleanup

1de6930

Fix base index (must also include CustomResource)

2b64039

Fix includes/excludes and some code smells

e6dc848

Add CustomResourceCollector tests

8692862

Cleanup, add tests and fix code smells

eacd7e1

Add tests for crd-generator-maven-plugin and fix code smells

f4fe3c8

Fix code smells

e22a3c7

Remove high level filtering and return an array of cr classes instead…

3ac2865

… of CustomResourceInfo's

Refactor Jandex utility methods to be more reusable

2842f0d

baloo42 force-pushed the crd-generator-maven-plugin branch from 3705e74 to 2842f0d Compare May 9, 2024 18:56

baloo42 commented May 9, 2024

View reviewed changes

Polish var names, add javadoc, improve error handling, refactor tests

a576044

baloo42 added 3 commits May 12, 2024 18:19

Fix CrdGeneratorMojoTest

c04e849

Fix code smells

3996b8e

Try to fix tests

b90b230

baloo42 marked this pull request as ready for review May 12, 2024 19:24

baloo42 requested review from andreaTP, metacosm, manusa, oscerd, rohanKanojia, shawkins and sunix as code owners May 12, 2024 19:24

rohanKanojia reviewed May 13, 2024

View reviewed changes

crd-generator/collector/pom.xml Outdated Show resolved Hide resolved

baloo42 added 2 commits May 13, 2024 18:02

Use same jandex version for core module and maven plugin

ec86a68

Add README for crd-generator maven plugin

4141076

rohanKanojia reviewed May 15, 2024

View reviewed changes

baloo42 added 2 commits May 15, 2024 10:04

Fix README

2139478

Add integration tests using maven-invoker plugin

b79bae8

baloo42 force-pushed the crd-generator-maven-plugin branch from cc4ec43 to b79bae8 Compare May 15, 2024 09:54

baloo42 added 5 commits May 15, 2024 17:49

Add more integration tests and verify content

2492a6e

Add integration tests for intermediate custom resource classes

ddc3bec

Remove CrdGeneratorMojoTest - already covered by integration test

8b177bb

Fix ClasspathTypeTest

e4b022b

Fix groovy files formatting

59dffef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(crd-generator): Add CRD-Generator Maven plugin #5979

feat(crd-generator): Add CRD-Generator Maven plugin #5979

baloo42 commented May 3, 2024 •

edited

baloo42 commented May 8, 2024

baloo42 commented May 8, 2024

shawkins commented May 8, 2024

baloo42 commented May 8, 2024

andreaTP commented May 8, 2024

shawkins commented May 8, 2024

baloo42 commented May 8, 2024 •

edited

shawkins commented May 8, 2024

baloo42 May 9, 2024

baloo42 May 9, 2024

shawkins May 13, 2024 •

edited

baloo42 May 13, 2024

baloo42 May 15, 2024

baloo42 May 9, 2024

rohanKanojia May 15, 2024

baloo42 May 15, 2024

rohanKanojia May 15, 2024

baloo42 May 15, 2024

rohanKanojia May 15, 2024

baloo42 May 15, 2024

rohanKanojia May 15, 2024

rohanKanojia May 15, 2024

baloo42 May 15, 2024

sonarcloud bot commented May 16, 2024

		@@ -0,0 +1,180 @@
		# CRD-Generator - Maven Plugin

		Maven plugin for the CRD-Generator: Generate CRDs from Java model during Maven builds.

feat(crd-generator): Add CRD-Generator Maven plugin #5979

Are you sure you want to change the base?

feat(crd-generator): Add CRD-Generator Maven plugin #5979

Conversation

baloo42 commented May 3, 2024 • edited

Description

Type of change

Checklist

baloo42 commented May 8, 2024

baloo42 commented May 8, 2024

shawkins commented May 8, 2024

baloo42 commented May 8, 2024

andreaTP commented May 8, 2024

shawkins commented May 8, 2024

baloo42 commented May 8, 2024 • edited

shawkins commented May 8, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shawkins May 13, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonarcloud bot commented May 16, 2024

Quality Gate failed

baloo42 commented May 3, 2024 •

edited

baloo42 commented May 8, 2024 •

edited

shawkins May 13, 2024 •

edited