You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We need to include support within PyActiveStorage for the situation where the remote server does not support ActiveStorage but the client has requested ActiveStorage support - in this situation we should fail over to calculate the operations ourselves. To do that, we need our version of reduce_chunk to grab the necessary blocks and do the operations itself, as it currently does for POSIX.
In the long-term we would hope that netcdf4-python would do this transparently, but for the moment we need to use h5netcdf to do it.
The text was updated successfully, but these errors were encountered:
This is a similar scenario to when S3 active storage is broken or too busy to handle the request.
Should activestorage.s3.reduce_chunk handle these cases transparently, or raise an error that is handled by Active which propagates the request to activestorage.storage.reduce_chunk? I lean towards the former approach, keeping all S3 interaction within the s3 module. In that case it would make sense to extract some of the Numpy operations to a common module to be shared by the storage and s3 modules.
I think we need to handle s3 independently of s3 active storage. There are going to be a lot of use cases where the dask workflow has identified a need to bring all the data back to the client whether or not there is active storage present.
We think the error needs to propagate up to PyActiveStorage so it can avoid making unnecessary repeated requests which would introduce extra latency on each block.
Context: each computational chunk in Dask has it's own PyActiveStorage instance ... they are likely to be requesting in parallel, so once a computational chunk sees a problem it should give up using active, but some may still work fine.
We need to include support within PyActiveStorage for the situation where the remote server does not support ActiveStorage but the client has requested ActiveStorage support - in this situation we should fail over to calculate the operations ourselves. To do that, we need our version of
reduce_chunk
to grab the necessary blocks and do the operations itself, as it currently does for POSIX.In the long-term we would hope that netcdf4-python would do this transparently, but for the moment we need to use
h5netcdf
to do it.The text was updated successfully, but these errors were encountered: