TCP/UDP: new function is_listening: t -> ~port:int -> callback option #508

hannesm · 2023-04-05T13:09:19Z

This is useful for proxies/middleware/interception of requests, a running example is let's encrypt and the HTTP challenge.

The methodology is as follows:

the unikernel requests (via https from let's encrypt) a challenge and solves it (using a private key, some cryptographic computations)
the let's encrypt server (wants to proof the ownership of the hostname in the certificate signing request) requests via HTTP (port 80) a specific resource (http://example.com/.well-known/acme-challenge/...)
the unikernel needs to properly reply to that challenge

Now, one path (that we took until now) is to treat this .well-knwon/acme-challenge very special in any unikernel that we wrote.

Another path is to create a let's encrypt http challenge library that takes a stack, and whenever it needs it registers itself for port 80, proxying everything it is not interested in, to the old handler (thus, is_listening), and serving the .well-known/acme-challenge.

Concurrent updates to the "listen" hashtable are dangerous of course, great care has to be taken (if some other parts of the application as well re-register listeners). But I'm confident since listen, unlisten, and is_listening are pure (not in Lwt monad), it's fine and can be dealt with. Another option would be to implement a real protocol/locking around the shared global resource of listening ports (but I'd first see whether we run into such troubles).

Another example is the let's encrypt ALPN challenge, where the process is as follows:

the unikernel requests (via https from let's encrypt) a challenge and solves it (using a private key, some cryptographic computations)
the let's encrypt server (wants to proof the ownership of the hostname in the signing request) connects via TLS on port 443 with a specific ALPN string
the unikernel needs to reply with a specially craftes self-signed certificate

This can, as above, be implemented by a temporary proxy while the challenge is in process -- without service interruptions for other parties (web browser, ...)

This is useful for proxies/middleware/interception of requests, a running example is let's encrypt and the HTTP challenge. The methodology is as follows: - the unikernel requests (via https from let's encrypt) a challenge and solves it (using a private key, some cryptographic computations) - the let's encrypt server (wants to proof the ownership of the hostname in the certificate signing request) requests via HTTP (port 80) a specific resource (http://example.com/.well-known/acme-challenge/...) - the unikernel needs to properly reply to that challenge Now, one path (that we took until now) is to treat this .well-knwon/acme-challenge very special in any unikernel that we wrote. Another path is to create a let's encrypt http challenge library that takes a stack, and whenever it needs it registers itself for port 80, proxying everything it is not interested in, to the old handler (thus, is_listening), and serving the .well-known/acme-challenge. Concurrent updates to the "listen" hashtable are dangerous of course, great care has to be taken (if some other parts of the application as well re-register listeners). But I'm confident since listen, unlisten, and is_listening are pure (not in Lwt monad), it's fine and can be dealt with. Another option would be to implement a real protocol/locking around the shared global resource of listening ports (but I'd first see whether we run into such troubles). Another example is the let's encrypt ALPN challenge, where the process is as follows: - the unikernel requests (via https from let's encrypt) a challenge and solves it (using a private key, some cryptographic computations) - the let's encrypt server (wants to proof the ownership of the hostname in the signing request) connects via TLS on port 443 with a specific ALPN string - the unikernel needs to reply with a specially craftes self-signed certificate This can, as above, be implemented by a temporary proxy while the challenge is in process -- without service interruptions for other parties (web browser, ...)

hannesm · 2023-04-11T10:18:22Z

I added a second function, TCP.unread : flow -> Cstruct.t -> unit which purpose is to push some data back into the flow.

I'm not convinced this is the right thing to do (though it is very convenient for my use case). The implementation is rather basic (and works fine for my use case, but not for generality - where you may have a task already blocking on read while unread is called).

I'd like to finish and evaluate the prototype I have before merging this here..

hannesm · 2023-04-11T10:34:04Z

src/stack-unix/tcpv4v6_socket.mli

@@ -17,7 +17,6 @@

 include Tcpip.Tcp.S
  with type ipaddr = Ipaddr.t
-   and type flow = Lwt_unix.file_descr


any insight whether this is needed somewhere?

hannesm · 2023-04-11T10:35:20Z

src/tcp/user_buffer.ml

@@ -59,6 +59,9 @@ module Rx = struct
    | None -> 0
    | Some b -> Cstruct.length b

+  let add_l t s =
+    ignore(Lwt_dllist.add_l (Some s) t.q)


any idea whether any other things must be updated? I frankly don't understand much of the add_r below, but it deals with various cur_size and max_size.

also, do t.readers need to be notified?

cur_size and max_size seem to be r(elated to a window of available data in the buffer (cur_size is what is currently available and max_size the bound for available data, but it should be possible to exceed that limit with the linked list data structure).
To keep things going, I think it's best to update cur_size and call notify_size_watcher to say that the data is online. Something like (the first comparison in add_r seems to be there to avoid exceeding max_size (again), I'm not sure the problem could be anything other than higher memory consumption, but it may be best to take care of that?):

let add_l t s = match Lwt_dllist.take_opt_l t.readers with | None -> t.cur_size <- Int32.(add t.cur_size (of_int (seglen s))); ignore(Lwt_dllist.add_l (Some s) t.q) notify_size_watcher t | Some w -> Lwt.return (Lwt.wakeup u s)

hannesm · 2023-04-11T10:37:44Z

src/stack-unix/tcpv4v6_socket.ml

@@ -78,6 +149,10 @@ let dst fd =
    in
    ip, port

+let unread fd buf =
+  let buf = Cstruct.append buf fd.buf in
+  fd.buf <- buf


what needs to be handled (for a complete, general API) is if a lwt task is already in Lwt_cstruct.read -- where the read should be cancelled and the buf provided here being returned to the caller.

hannesm requested a review from dinosaure April 5, 2023 13:13

dinosaure approved these changes Apr 11, 2023

View reviewed changes

provide TCP.unread

deb7a3b

hannesm marked this pull request as draft April 11, 2023 10:19

hannesm commented Apr 11, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TCP/UDP: new function is_listening: t -> ~port:int -> callback option #508

TCP/UDP: new function is_listening: t -> ~port:int -> callback option #508

hannesm commented Apr 5, 2023

hannesm commented Apr 11, 2023

hannesm Apr 11, 2023 •

edited

hannesm Apr 11, 2023

hannesm Apr 11, 2023

palainp Apr 11, 2023

hannesm Apr 11, 2023

TCP/UDP: new function is_listening: t -> ~port:int -> callback option #508

Are you sure you want to change the base?

TCP/UDP: new function is_listening: t -> ~port:int -> callback option #508

Conversation

hannesm commented Apr 5, 2023

hannesm commented Apr 11, 2023

hannesm Apr 11, 2023 • edited

Choose a reason for hiding this comment

hannesm Apr 11, 2023

Choose a reason for hiding this comment

hannesm Apr 11, 2023

Choose a reason for hiding this comment

palainp Apr 11, 2023

Choose a reason for hiding this comment

hannesm Apr 11, 2023

Choose a reason for hiding this comment

hannesm Apr 11, 2023 •

edited