Repository landing page
Illustrations of the TR-TS gaps of the SeqFold2D-1.4M model.
Abstract
(A) Stral-NR100 as TR and Archi-NR100 as TS. (B) Stral-NR80 as TR and Archi-Stral-NR80 as TS. The first pair of violins shows the F1 scores for the entire TR (left, tan) and TS (right, blue) set and the following pairs show the scores for each RNA family. Averaged scores are shown as dashed lines (white) and at the very top. The parentheses above show the sequence counts in numbers (for the entire set or families with 1% share). The families existing in one set only are shown as “nan” for the other set, e.g., 23S rRNA in Archi-NR100 only.</p- Image
- Figure
- Biochemistry
- Genetics
- Molecular Biology
- Infectious Diseases
- Virology
- Biological Sciences not elsewhere classified
- Information Systems not elsewhere classified
- revealed collectively across
- outperform existing methods
- div >< p
- de novo </
- common benchmark datasets
- statistical underpinning raises
- several recent dl
- sequence similarity decreases
- generalizability thus poses
- machine learning models
- deep learning models
- statistical learning
- deep learning
- dl models
- based models
- various pathways
- varied similarities
- unseen sets
- unseen sequences
- traditional algorithms
- ranging architectures
- quantitative study
- physical laws
- module architecture
- model generalizability
- minimal two
- major hurdle
- inverse correlation
- generalizability depends
- future advances
- evolutionary information
- e .,
- degrades rapidly
- crucial question