UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the
khairi abidi
khairi
AI & ML interests
Language Modeling, Protein Language Modeling, Protein Annotation
Recent Activity
updated
a dataset
about 2 hours ago
khairi/Uniref50-Protein-Instructions
updated
a dataset
about 2 hours ago
khairi/Uniref50-500K-Protein-Instructions
published
a dataset
about 2 hours ago
khairi/Uniref50-500K-Protein-Instructions