DeepShape
DeepShape is a deep convolutional neural network designed to predict molecular phenotypes from DNA sequences. Unlike traditional models that rely solely on one-hot encoded DNA sequences, DeepShape integrates DNA structural attributes indicative of local shape: minor groove width (MGW), helical twist (HelT), propeller twist (ProT), roll, and electrostatic potential (EP). This combination enhances the interpretability of the model and helps identify regulatory patterns that are not apparent from sequence information alone.
DeepShape is built upon DeeperDeepSEA, a PyTorch-based deep learning model originally designed to predict chromatin features from DNA sequence alone as implemented in Selene.
https://github.com/ni-lab/DeepShape
License
DeepShape is licensed under the MIT License. Portions of this software are derived from Selene, which is licensed under the Clear BSD License.
See the LICENSE file for full details.
Evaluation results
- AUROC (TF) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.948
- AUPRC (TF) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.353
- AUROC (DNase) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.907
- AUPRC (DNase) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.472
- AUROC (Histone) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.850
- AUPRC (Histone) on ENCODE + Roadmap Epigenomics (919 targets)self-reported0.367