New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

POC output with URring #2357

Open

ser-0xff wants to merge 17 commits into apple:main from ordo-one:uring-out

Contributor

ser-0xff commented Jan 24, 2023

Use URing on Linux.

URing on Linux provides an API to demultiplex events and perform socket operations.
NIO originally designed to demultiplex events and perform all IO in a single thread, while URing socket operations are asynchronous. It requires more significant changes in the NIO taking into account the requirement to additionally manage memory buffers for pending read and write operations.

The one idea was to try to minimise PR as possible, so that PR send outbound data using URing, but still read inbound data using base socket API. Reading inbound data with URing is a next step.

Some highlights regarding changes in the PR

The URing API using on Linux can be enabled by the conditional compilation.
In some places I saw '#if defined(SWIFTNIO_USE_IO_URING) && (os(Linux) || os(Android))
As of now we have no possibility even to compile it for Android, not saying to test, so all URing related code is under just os(Linux). Let's decide later what can we do with Android.
NIO used some buffers stored in the EventLoop (iovecs, storageRefs etc) which are shared across all channels registered in the event loop. That approach does not work with URing because of its asynchronous nature. There are not too many ways to workaround it for URing, so I moved them to the channel, so with URing each channel has own iovecs and storageRefs buffers. Probably we could cache them... not clear what is better... let's discuss...
URing has no analogue for the sendfile() system call.
Instead they suggest to splice() the file via intermediate pipe. It works, but requires an intermediate pipe. There are not too many options how to manage intermediate pipes, but I can't see a good one. So, as of now we just cache intermediate pipes and reuse them...
URing has no analogue for sendmmsg() system call.
Unfortunately did not found a good way to workaround that. Currently just send messages one by one. One possible optimisation here is to collect all messages with the same destination address and same ancillary data and send them with one sendmsg(), but did not found such obvious optimisation in the NIO, so probably it is not a case.
I had to disable few tests which checks the order of system calls. Have no idea how we could make them work for URing, because URing is asynchronous. Let's discuss them later. All other tests works on my environment, but I will not be surprised if they will not work somewhere else.

Will add some information later within comments to PR if will recall something important.


          POC output with URring

ff96091

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

11 similar comments

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

swift-server-bot commented Jan 24, 2023

Can one of the admins verify this patch?

ser-0xff mentioned this pull request

Linux: full io_uring I/O #1831

Open

Lukasa reviewed

View reviewed changes

Contributor

Lukasa left a comment

Thanks for this patch! I've left a few structural notes in the diff. Note that I haven't done a detailed code review yet, I'm just laying out some thoughts. As we work on the structure I'll continue to re-review the implementation. I've also not deeply looked into the changes to SelectorUring and LinuxUring yet, those are my next round of review. Generally though, this is looking really solid!

Sources/NIOPosix/SelectorGeneric.swift Outdated Show resolved Hide resolved

Sources/NIOPosix/PendingWritesManager.swift Show resolved Hide resolved

Sources/NIOPosix/SocketChannel.swift Outdated

+                      let storageRefs = eventLoop.storageRefs
+                      let controlMessageStorage = eventLoop.controlMessageStorage
+              #endif
+                      self.pendingWrites = PendingDatagramWritesManager(msgs: msgs,

Contributor

Lukasa Jan 24, 2023

I don't love having the pending writes managers take ownership of pointers allocated outside their scope. Can we have two different initialisers, one where they allocate the pointers themselves and one where they don't, and then keep track of what happened with a flag? Then we can drop most of the compile time conditionals, which will make our lives a lot easier.

Contributor Author

ser-0xff Jan 24, 2023 •

edited

I do not love that either...
Had an impression having a different initialisers will require bigger changes in tests...
But may be we could use something different?
For example we could introduce a cache of IOVector buffers in the EventLoop, and then each channel could use that cache. Without URing that cache will always have a one buffer. With URing - probably more than one but less than total number of channels. We could save some memory in that case. And the buffer management logic will not require any conditional compilation at all in that case. What do you think?

Contributor

Lukasa Jan 24, 2023

I’m certainly open to that approach. I wonder if it’s worth landing that cache as a separate PR by itself, actually.

Contributor Author

ser-0xff Jan 25, 2023 •

edited

I could probably try to implement that cache in a separate branch,
then we will merge it, and then I will merge into uring-out branch, and then we will continue with it.
What do you think?

Contributor

Lukasa Jan 25, 2023

Yeah, that seems like a good idea to me.

Contributor

hassila Jan 26, 2023

So that's #2358 - just commenting to tie it together with this comment chain.

Lukasa reviewed

View reviewed changes

Sources/NIOPosix/LinuxUring.swift

                   }
               }
+              private func _debugPrint(_ s: @autoclosure () -> String) {
+                  #if SWIFTNIO_IO_URING_DEBUG

Contributor

Lukasa Jan 24, 2023

Any reason we need this conditional?

Contributor Author

ser-0xff Jan 25, 2023

That condition was before I started to work on it, just left it...

Contributor

hassila Jan 25, 2023 •

edited

I think it's a remnant of the original bringup to not having to define out every debugPrint that is in place IIRC (~40).

There are additionally three more specific defines for more in-depth debugging, but they were in few places and are completely defined out (https://github.com/search?q=repo%3Aapple%2Fswift-nio%20SWIFTNIO_IO_URING_DEBUG&type=code) .

Perhaps there's a better way to do this (maybe with the macro support coming up? Basically, one would like to be able to build with debug logging during development but to completely remove traces of it for production as this may be a fairly performance sensitive thing).

Contributor

Lukasa Jan 25, 2023

Oh I see, ok.

Sources/NIOPosix/SocketChannel.swift Outdated

+                      let storageRefs = eventLoop.storageRefs
+                      let controlMessageStorage = eventLoop.controlMessageStorage
+              #endif
+                      self.pendingWrites = PendingDatagramWritesManager(msgs: msgs,

Contributor

Lukasa Jan 24, 2023

I’m certainly open to that approach. I wonder if it’s worth landing that cache as a separate PR by itself, actually.

Sources/NIOPosix/PendingWritesManager.swift Show resolved Hide resolved

Sources/NIOPosix/SelectorGeneric.swift Outdated Show resolved Hide resolved

Sources/NIOPosix/LinuxUring.swift Outdated Show resolved Hide resolved

Sources/NIOPosix/LinuxUring.swift Outdated Show resolved Hide resolved

Sources/NIOPosix/LinuxUring.swift Outdated Show resolved Hide resolved

ser-0xff added 3 commits

February 8, 2023 13:47


          Merge remote-tracking branch 'upstream/main' into uring-out

f807679


          Some polishes.

5cef4a7


          Report tests that can't work properly with URing as skipped.

c544c17

Contributor Author

ser-0xff commented Feb 9, 2023 •

edited

I got the point regarding the conditional compilation.
How do you think should we move the conditional compilation in that place more deeply leaving the async API available even for a case if the backend is synchronous?

Contributor

Lukasa commented Feb 13, 2023

I'm inclined to suggest that we have most backends implement the functions with fatalError, and only have the uring selector implement them. Then we can ensure that at runtime we only call those functions when we know we have the uring selector, which we should know either as a compile-time static or as event loop state.


          Remove some conditional compilation.

2b3e441

Contributor Author

ser-0xff commented Feb 16, 2023

Could you have a look at PR?
Is it appropriate way to remove conditional compilation in that particular case from your point of view?

Contributor

Lukasa commented Feb 16, 2023

Yes, I think that works fine.


          Merge pull request #1 from ordo-one/uring-out-wip

ec570cb

Remove conditional compilation.

Lukasa reviewed

View reviewed changes

Sources/NIOPosix/PendingDatagramWritesManager.swift

		private var bufferPool: Pool<PooledBuffer>
		private var buffer: PooledBuffer?

Contributor

Lukasa Feb 16, 2023

Why do we need the local buffer reference?

Contributor Author

ser-0xff Feb 17, 2023 •

edited

There are 2 reasons for that:

we will need to put that pooled buffer back to the pool when async operation will be completed
we should keep a memory block which is used for the async operation while the operation will not be completed

And we also have 2 options how to do it:

tie that pooled buffer to the asynchronous write request
keep it locally, so when async request will be completed use local stored object to put it back to the pool

ser-0xff added 3 commits

February 17, 2023 09:09


          Remove some conditional compilation.

f397a26


          Remove some conditional compilation.

3539f5b


          Disable possibility to send files with URing.

7fffd46

Contributor Author

ser-0xff commented Feb 17, 2023 •

edited

Could you have a look at PR?
I tried to disable possibility to send files with URing, but end up with a lot of conditional compilation in tests.
As far as I understand test runner parse swift code and generate a code calling tests, so I have no better idea how could disable those tests when running with URing.
What do you think?

Contributor

Lukasa commented Feb 24, 2023

I'm entirely happy with us adding this conditional compilation into the tests, as it's small and clear. 👍

ser-0xff added 5 commits

February 26, 2023 12:18


          Merge pull request #2 from ordo-one/uring-out-wip

cdc5f66

Disable possibility to send files with URing.


          Merge branch 'apple:main' into uring-out

3bdc831


          Merge branch 'main' into uring-out

efd22cd


          Merge branch 'main' into uring-out

50d1497


          Handle properly socket write error (close connection).

388b16f

Contributor

hassila commented Sep 15, 2023

So what would be good next steps if we want to move this forward?

ser-0xff added 3 commits

September 21, 2023 02:34


          Merge branch 'main' into uring-out

877fa47


          Merge branch 'main' into uring-out

f6c9355


          Stabilize tests.

35b76a3

Contributor Author

ser-0xff commented Sep 22, 2023 •

edited

I merged most recent changes from the main into the feature branch and stabilised tests, so all tests (except skipped are passed now).
Had a lot of fun with the test stabilisation,
it still seems to me the idea to have both SYNC and ASYNC code at the same time in the source code is not good and it would be much better separate them with #ifdefs. The point here is that on runtime the only one approach should work, but codebase has a lot of conditional branches, and sometimes it happen that SYNC IO is triggered when ASYNC is supposed to because source code was not fixed properly. It would not even happen if we would completely move the SYNC functionality under #ifdef. Not sure I found all places where SYNC IO can be triggered in ASYNC mode.
But still, all tests pass now, would be nice if you could have a look at PR.

ser-0xff marked this pull request as ready for review

September 22, 2023 08:08

ser-0xff requested a review from Lukasa

September 22, 2023 08:08

hassila mentioned this pull request

Add NIOFilesystem #2615

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment