Spaces:

AI4PD
/

hexviz

Sleeping

App Files Files Community

aksell commited on Apr 21, 2023

Commit

c663b1c

1 Parent(s): a2cfd88

Remove docks from main page and list supported models

Browse files

Files changed (2) hide show

hexviz/pages/2_📄Documentation.py +7 -1
hexviz/🧬Attention_Visualization.py +21 -15

hexviz/pages/2_📄Documentation.py CHANGED Viewed

@@ -40,7 +40,13 @@ TODO: Add examples of attention patterns
 Read more about attention patterns in fex [Revealing the dark secrets of BERT](https://arxiv.org/abs/1908.08593).
-# FAQ
 1. I can't see any attention- "bars" in the visualization, what is wrong? -> Lower the `minimum attention`.
 2. How are sequences I input folded? -> Using https://esmatlas.com/resources?action=fold
 """

 Read more about attention patterns in fex [Revealing the dark secrets of BERT](https://arxiv.org/abs/1908.08593).
+## Protein Language models in Hexviz
+Hexviz currently supports the following models:
+1. [ProtBERT](https://huggingface.co/Rostlab/prot_bert_bfd)
+2. [ZymCTRL](https://huggingface.co/nferruz/ZymCTRL)
+3. [TapeBert](https://github.com/songlab-cal/tape/blob/master/tape/models/modeling_bert.py) - a nickname coined in BERTOLOGY meets biology for the Bert Base model pre-trained on Pfam in [TAPE](https://www.biorxiv.org/content/10.1101/676825v1). TapeBert is used extensively in BERTOlogy meets biology.
+## FAQ
 1. I can't see any attention- "bars" in the visualization, what is wrong? -> Lower the `minimum attention`.
 2. How are sequences I input folded? -> Using https://esmatlas.com/resources?action=fold
 """

hexviz/🧬Attention_Visualization.py CHANGED Viewed

@@ -124,7 +124,6 @@ attention_pairs, top_residues = get_attention_pairs(
     head=head,
     threshold=min_attn,
     model_type=selected_model.name,
-    ec_class=ec_class,
     top_n=n_highest_resis,
 )
@@ -197,28 +196,35 @@ def get_3dview(pdb):
 xyzview = get_3dview(pdb_id)
 showmol(xyzview, height=500, width=800)
-st.markdown(f"""
-Visualize attention weights from protein language models on protein structures.
-Currently attention weights for PDB: [{pdb_id}](https://www.rcsb.org/structure/{pdb_id}) from layer: {layer_one}, head: {head_one} above {min_attn} from {selected_model.name.value}
-are visualized as red bars. The {n_highest_resis} residues with the highest sum of attention are labeled.
-Visualize attention weights on protein structures for the protein language models TAPE-BERT, ZymCTRL and ProtBERT.
-Pick a PDB ID, layer and head to visualize attention.
-""", unsafe_allow_html=True)
 chain_dict = {f"{chain.id}": chain for chain in list(structure.get_chains())}
 data = []
-for att_weight, _ , chain, resi in top_residues:
     res = chain_dict[chain][resi]
     el = (att_weight, f"{res.resname:3}{res.id[1]}")
     data.append(el)
-df = pd.DataFrame(data, columns=['Total attention (disregarding direction)', 'Residue'])
-st.markdown(f"The {n_highest_resis} residues with the highest attention sum are labeled in the visualization and listed below:")
 st.table(df)
-st.markdown("""Clik in to the [Identify Interesting heads](#Identify-Interesting-heads) page to get an overview of attention
-            patterns across all layers and heads
-            to help you find heads with interesting attention patterns to study here.""")
 """
 The attention visualization is inspired by [provis](https://github.com/salesforce/provis#provis-attention-visualizer).
-"""

     head=head,
     threshold=min_attn,
     model_type=selected_model.name,
     top_n=n_highest_resis,
 )
 xyzview = get_3dview(pdb_id)
 showmol(xyzview, height=500, width=800)
+st.markdown(
+    f"""
+Pick a PDB ID, layer and head to visualize attention from the selected protein language model ({selected_model.name.value}).
+""",
+    unsafe_allow_html=True,
+)
 chain_dict = {f"{chain.id}": chain for chain in list(structure.get_chains())}
 data = []
+for att_weight, _, chain, resi in top_residues:
     res = chain_dict[chain][resi]
     el = (att_weight, f"{res.resname:3}{res.id[1]}")
     data.append(el)
+df = pd.DataFrame(data, columns=["Total attention (disregarding direction)", "Residue"])
+st.markdown(
+    f"The {n_highest_resis} residues with the highest attention sums are labeled in the visualization and listed here:"
+)
 st.table(df)
+st.markdown(
+    """
+### Check out the other pages
+[🗺️Identify Interesting heads](Identify_Interesting_Heads) give a birds-eye view of attention patterns for a model,
+this can help you pick what specific attention heads to look at for your protein.
+[📄Documentation](Documentation) has information on protein language models, attention analysis and hexviz."""
+)
 """
 The attention visualization is inspired by [provis](https://github.com/salesforce/provis#provis-attention-visualizer).
+"""