Spaces:

AI4PD
/

hexviz

Sleeping

aksell commited on Mar 31, 2023

Commit

9d9c196

1 Parent(s): 8799d0b

Copy attention tensor to CPU for streamlit caching

Streamlit tries to cache it using numpy (I think), and
this does not work because the tensor is on the GPU.
Interestingly it only causes issues for ZymCTRL not for
TAPE-BERT.

Files changed (1) hide show

hexviz/attention.py +2 -1

hexviz/attention.py CHANGED Viewed

@@ -89,7 +89,8 @@ def get_attention(
     else:
         raise ValueError(f"Model {model_type} not supported")
-    return attentions
 def unidirectional_avg_filtered(attention, layer, head, threshold):
     num_layers, num_heads, seq_len, _ = attention.shape

     else:
         raise ValueError(f"Model {model_type} not supported")
+    # Transfer to CPU to avoid issues with streamlit caching
+    return attentions.cpu()
 def unidirectional_avg_filtered(attention, layer, head, threshold):
     num_layers, num_heads, seq_len, _ = attention.shape