Add support for scalar product operator #67

semi-h · 2024-03-01T15:20:08Z

We need a scalar product operator for calculating the enstrophy in the domain. The CUDA kernel can be implemented in a more performant way, there are lots of examples out there, but the current implementation is not bad. We only need to run this once in maybe 100/1000 iterations for post-processing so not worried too much.

pbartholomew08 · 2024-03-01T15:49:54Z

Could we just use the dot_product intrinsic? NVIDIA claim to accelerate all these

semi-h · 2024-03-01T16:04:03Z

We'll probably have padding in our data structure when the grid size not a multiple of SZ and then if we use a library to calculate the dot product it'll be hard to eliminate the padded entries.

JamieJQuinn

Looks solid, just needs some documentation.

JamieJQuinn · 2024-03-04T10:58:31Z

src/omp/backend.f90

@@ -304,6 +305,16 @@ subroutine vecadd_omp(self, a, x, b, y)

   end subroutine vecadd_omp

+   real(dp) function scalar_product_omp(self, x, y) result(s)
+      implicit none
+


Suggested change

! TODO incomplete implementation

JamieJQuinn · 2024-03-04T11:00:45Z

src/cuda/backend.f90

+
+      s = sum_d
+
+      call MPI_Allreduce(MPI_IN_PLACE, s, 1, MPI_DOUBLE_PRECISION, MPI_SUM, &


Since we're thinking about mixed/reduced precision, should the precision passed to MPI derive from the actual data type? Not sure what the typical way to do this is in MPI codes. @pbartholomew08, @Nanoseb?

JamieJQuinn · 2024-03-04T11:03:39Z

src/omp/backend.f90

@@ -329,31 +340,27 @@ subroutine copy_into_buffers(u_send_s, u_send_e, u, n, n_blocks)

   end subroutine copy_into_buffers

-   subroutine set_fields_omp(self, u, v, w, u_in, v_in, w_in)
+   subroutine set_field_omp(self, f, arr)


Unclear argument names. Perhaps field and data?

JamieJQuinn · 2024-03-04T11:08:07Z

src/cuda/kernels/reorder.f90

@@ -199,6 +199,27 @@ attributes(global) subroutine axpby(n, alpha, x, beta, y)

   end subroutine axpby

+   attributes(global) subroutine scalar_product(s, x, y, n)


Caller needs to understand this is pencil sum to set correct n_threads/blocks. This should be documented here.

semi-h added 4 commits March 1, 2024 14:22

Add scalar_product procedure in backends.

9129656

Implement a scalar product kernel in CUDA backend.

11608d4

Call the scalar product kernel in CUDA backend.

ce7b3f8

Simplify get/set_fields so that they work on a single field.

030f382

semi-h requested review from JamieJQuinn, pbartholomew08 and Nanoseb March 1, 2024 15:43

semi-h added 2 commits March 4, 2024 07:56

Convert scalar_product subroutine to a function.

ed12712

Make scalar_product function work on multiple ranks.

ce9289f

JamieJQuinn requested changes Mar 4, 2024

View reviewed changes

semi-h merged commit 85246f3 into xcompact3d:main Mar 5, 2024
2 checks passed

This was referenced Mar 5, 2024

Single precision support for MPI messages #70

Open

Documentation regarding the loops over groups in the data structure. #71

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for scalar product operator #67

Add support for scalar product operator #67

semi-h commented Mar 1, 2024

pbartholomew08 commented Mar 1, 2024

semi-h commented Mar 1, 2024

JamieJQuinn left a comment

JamieJQuinn Mar 4, 2024

JamieJQuinn Mar 4, 2024

JamieJQuinn Mar 4, 2024

JamieJQuinn Mar 4, 2024


		s = sum_d

		call MPI_Allreduce(MPI_IN_PLACE, s, 1, MPI_DOUBLE_PRECISION, MPI_SUM, &

		@@ -199,6 +199,27 @@ attributes(global) subroutine axpby(n, alpha, x, beta, y)

		end subroutine axpby

		attributes(global) subroutine scalar_product(s, x, y, n)

Add support for scalar product operator #67

Add support for scalar product operator #67

Conversation

semi-h commented Mar 1, 2024

pbartholomew08 commented Mar 1, 2024

semi-h commented Mar 1, 2024

JamieJQuinn left a comment

Choose a reason for hiding this comment

JamieJQuinn Mar 4, 2024

Choose a reason for hiding this comment

JamieJQuinn Mar 4, 2024

Choose a reason for hiding this comment

JamieJQuinn Mar 4, 2024

Choose a reason for hiding this comment

JamieJQuinn Mar 4, 2024

Choose a reason for hiding this comment