Revisit IoDispatcher #1670

a10y · 2024-12-12T18:15:13Z

Capturing convo with @AdamGS. We are forcing all users of the vortex-io crate to take the overhead of dispatching the IO requests, even if they are already running in a Tokio runtime context.

We could change the way we do IO to make it more ergonomic for users that are just using Tokio already, for example we could force all of our read futures to be Send and simply spawn tasks when a Tokio runtime is already installed. A potential strawman new trait might look like this:

pub trait IO {
  // We need to choose if all futures are Send or !Send, and the only way to make this work without
  // assuming always having an external dispatcher is to force them all to be Send
  fn read_bytes(&self, range: Range<usize>) -> impl Future<Output = Bytes> + Send + Sync + 'static;
}

// We have a non-dispatched impl for tokio files
struct LocalTokioFile(File);

impl IO for LocalTokioFile {
  async fn read_bytes(&self, range: Range<usize>) -> Bytes {
    ...
  }
}

struct TokioFileDispatched {
  // We submit a request (byte range) as well as a handle for how the remote thread
  // can send us back a result (the Bytes read)
  submitter: Sender<(Range<usize>, Sender<Bytes>)>,
}

impl IO for TokioFileDispatched {
  fn read_bytes(&self, range: Range<usize>) -> impl Future<Output = Bytes> {
    // Run on the remote thread.
    let (tx, rx) = oneshot::channel();
    submitter.submit((range, tx));
    return rx;
  }
}

You'd have separate IO implementations for a source that controls how the futures are launched, allowing you to specialize sending them to a remote runtime or launching them immediately.

The text was updated successfully, but these errors were encountered:

robert3005 · 2024-12-12T18:34:28Z

Even if you already have a tokio runtime you likely don’t want to mix your cpu and io workloads

AdamGS · 2024-12-13T11:13:24Z

As you know, I believe that's true, but the current design doesn't allow you to to have fine grained control over that, you HAVE to spin up an additional runtime, even you already architected your system to use multiple runtimes.

Its also much harder to extend the Dispatcher trait than a more IO focused trait, forcing users into caring about a whole new problem space.

a10y mentioned this issue Dec 12, 2024

ci: use cargo-hack and build all valid feature combos #1653

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revisit IoDispatcher #1670

Revisit IoDispatcher #1670

a10y commented Dec 12, 2024

robert3005 commented Dec 12, 2024

AdamGS commented Dec 13, 2024

Revisit IoDispatcher #1670

Revisit IoDispatcher #1670

Comments

a10y commented Dec 12, 2024

robert3005 commented Dec 12, 2024

AdamGS commented Dec 13, 2024