Subtask 0 & 1: - Set a benchmark for evaluating the RoboticVisionPipeline. - Upload a dataset of 50 images to be used for testing and benchmarking. Requirements: - Define evaluation criteria and metrics (e.g., accuracy, precision, recall, grasp success rate, spatial relation extraction quality). - Prepare and document the dataset upload process.