Cookies on this website
We use cookies to ensure that we give you the best experience on our website. If you click 'Continue' we will assume that you are happy to receive all cookies and you will not see this message again. Click 'Find out more' for information on how to change your cookie settings.

A systematic computational analysis of protein sequences containing known nuclear domains led to the identification of 28 novel domain families. This represents a 26% increase in the starting set of 107 known nuclear domain families used for the analysis. Most of the novel domains are present in all major eukaryotic lineages, but 3 are species specific. For about 500 of the 1200 proteins that contain these new domains, nuclear localization could be inferred, and for 700, additional features could be predicted. For example, we identified a new domain, likely to have a role downstream of the unfolded protein response; a nematode-specific signalling domain; and a widespread domain, likely to be a noncatalytic homolog of ubiquitin-conjugating enzymes.

Type

Journal article

Journal

Genome Res

Publication Date

01/2002

Volume

12

Pages

47 - 56

Keywords

Amidohydrolases, Amino Acid Motifs, Amino Acid Sequence, Animals, Caenorhabditis elegans, Caenorhabditis elegans Proteins, Cell Nucleus, Databases, Protein, Humans, Molecular Sequence Data, Multigene Family, Nuclear Proteins, Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase, Phylogeny, Protein Structure, Tertiary, Species Specificity