Skip to content
Snippets Groups Projects
  1. Dec 21, 2022
  2. Sep 21, 2022
    • Dylan Yudaken's avatar
      io_uring: add IORING_SETUP_DEFER_TASKRUN · c0e0d6ba
      Dylan Yudaken authored
      Allow deferring async tasks until the user calls io_uring_enter(2) with
      the IORING_ENTER_GETEVENTS flag. Enable this mode with a flag at
      io_uring_setup time. This functionality requires that the later
      io_uring_enter will be called from the same submission task, and therefore
      restrict this flag to work only when IORING_SETUP_SINGLE_ISSUER is also
      set.
      
      Being able to hand pick when tasks are run prevents the problem where
      there is current work to be done, however task work runs anyway.
      
      For example, a common workload would obtain a batch of CQEs, and process
      each one. Interrupting this to additional taskwork would add latency but
      not gain anything. If instead task work is deferred to just before more
      CQEs are obtained then no additional latency is added.
      
      The way this is implemented is by trying to keep task work local to a
      io_ring_ctx, rather than to the submission task. This is required, as the
      application will want to wake up only a single io_ring_ctx at a time to
      process work, and so the lists of work have to be kept separate.
      
      This has some other benefits like not having to check the task continually
      in handle_tw_list (and potentially unlocking/locking those), and reducing
      locks in the submit & process completions path.
      
      There are networking cases where using this option can reduce request
      latency by 50%. For example a contrived example using [1] where the client
      sends 2k data and receives the same data back while doing some system
      calls (to trigger task work) shows this reduction. The reason ends up
      being that if sending responses is delayed by processing task work, then
      the client side sits idle. Whereas reordering the sends first means that
      the client runs it's workload in parallel with the local task work.
      
      [1]:
      Using https://github.com/DylanZA/netbench/tree/defer_run
      
      
      Client:
      ./netbench  --client_only 1 --control_port 10000 --host <host> --tx "epoll --threads 16 --per_thread 1 --size 2048 --resp 2048 --workload 1000"
      Server:
      ./netbench  --server_only 1 --control_port 10000  --rx "io_uring --defer_taskrun 0 --workload 100"   --rx "io_uring  --defer_taskrun 1 --workload 100"
      
      Signed-off-by: default avatarDylan Yudaken <dylany@fb.com>
      Link: https://lore.kernel.org/r/20220830125013.570060-5-dylany@fb.com
      
      
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      c0e0d6ba
  3. Aug 23, 2022
  4. Aug 13, 2022
  5. Jul 25, 2022
Loading