bioclip-2-demo / README.md
egrace479's picture
Update metadata parquet column description
cf2d1c8 verified

A newer version of the Gradio SDK is available: 5.35.0

Upgrade
metadata
title: Bioclip 2 Demo
emoji: 😻
colorFrom: pink
colorTo: gray
sdk: gradio
sdk_version: 5.33.0
app_file: app.py
pinned: false
license: mit
models:
  - imageomics/bioclip-2
datasets:
  - imageomics/TreeOfLife-200M

This app is modified from the original BioCLIP Demo to run inference with BioCLIP 2 and uses pybioclip.

Due to space persistent storage limitations, embeddings are fetched from the TreeOfLife-200M repo. The images will be retrieved from an S3 bucket, as with the origin, described below.

Note that if this space is duplicated, the sample image portion will not work.

bioclip-2/metadata.parquet: metadata file for fetching TreeOfLife-200M sample images (up to 3 available per taxa) from an S3 bucket.

  • uuid: unique identifier for the image within the TreeOfLife-200M dataset.
  • eol_page_id: identifier of EOL page for the most specific taxa of the image (where available). Note that an image's association to a particular page ID may change with updates to the EOL (or image provider's) hierarchy. However, EOL taxon page IDs are stable. "https://eol.org/pages/" + eol_page_id links to the page.
  • gbif_id: GBIF identifier for the occurrence from which the image is sourced.
  • gbif_taxon_id: identifier used by GBIF for the most specific taxa of the image (where available). "https://gbif.org/species/" + gbif_taxon_id links to the page.
  • kingdom: kingdom to which the subject of the image belongs (all Animalia).
  • phylum: phylum to which the subject of the image belongs.
  • class: class to which the subject of the image belongs.
  • order: order to which the subject of the image belongs.
  • family: family to which the subject of the image belongs.
  • genus: genus to which the subject of the image belongs.
  • species: species to which the subject of the image belongs.
  • file_path: image filepath to fetch image from S3 bucket (<folder>/<uuid>.jpg, folders are first two characters of the uuid).