Thanks for taking this survey and helping to grow Ray Data! This survey's results will be used to plan the Ray Data roadmap for v2.10+. The goal is to understand your workload needs in data loading for training.
Please describe your workload in the following questions.
If you foresee the workload changing in the future (e.g., you will require larger data scale in the future), please also note a range and an approximate timeline if possible.
What is your goal? Ex: “I want to train ResNet-50 on the ImageNet dataset and do it as (quickly|cheaply) as possible.”
What type of preprocessing do you want to do? Ex: “I want to randomly crop each image.”
What is the cluster scale? Ex: “1-4 GPUs, and I want up to 4 additional CPU-only nodes.”