Spaces:
Runtime error
This demo loads the
FlaxCLIPVisionMBartforConditionlGenerationpresent in themodeldirectory of this repository. The checkpoint is loaded fromckpt/ckpt-49499which is pre-trained checkpoint with 70k steps. 100 random validation set examples are present in thereferences.tsvwith respective images in theimagesdirectory.We provide
English Translationof the generated caption and reference captions for users who are not well-acquainted with the other languages. This is done usingmtranslateto keep things flexible enough and needs internet connection as it uses the Google Translate API. We will also add the original captions soon.The sidebar contains generation parameters such as
Number of Beams,Top-P,Temperaturewhich will be used when generating the caption.Lastly, one can choose the
Languageof the caption in the dropdown below to generate a caption in that particular language.