Specifically in the case that data is stored in object store, we should make it possible to cache data locally. It's possible/likely this is just leveraging an existing datafusion capability or library (e.g. https://github.com/apache/arrow-rs-object-store) and exposing configuration controls to the users.