Annotate forms by uploading images
Calculate KV cache size for language models
Extract text from images using OCR