Generate segmentation masks for objects in an image
Visualise outputs of VideoMAE
Image Retrieval on the Food101 dataset