Subtask 2: - Record the results and accuracy of the RoboticVisionPipeline on the 50-image dataset. Requirements: - Collect output metrics: labels, confidences, masks, boxes, grasps, and scene descriptions for each image. - Calculate and summarize accuracy, precision, recall, and other relevant statistics for each stage of the pipeline. - Document process and results for reproducibility.