17 research outputs found
New distance measure for comparing protein using cellular automata image.
One of the first steps in protein sequence analysis is comparing sequences to look for similarities. We propose an information theoretical distance to compare cellular automata representing protein sequences, and determine similarities. Our approach relies in a stationary Hamming distance for the evolution of the automata according to a properly chosen rule, and to build a pairwise similarity matrix and determine common ancestors among different species in a simpler and less computationally demanding computer codes when compared to other methods
Beta-globin protein dendrograms from SHD (a) and p-distance (b) values.
Clade A is represented by squares and Clade B by circles. Different animal groups are represented by colors.</p
Dendrograms from the ND 5 protein from (a) SHD and (b) p-distance.
The four families are well grouped in both dendrograms: Didelphidae (blue), Muridae (green), Balaenopteridae (red), and Hominidae (black).</p
Encoding of amino acids, deletions and missing protein sequence data after alignment.
Code based on molecular structure of amino acid side chains by Chaudhuri et al. [18].</p
Dendrograms from the ND 6 protein obtained from distance matrices using (a) SHD and (b) p-distance, with the indication of family groupings: Macropodidae (blue), Muridae (green), Phocidae (red), and Hominidae (black).
Dendrograms from the ND 6 protein obtained from distance matrices using (a) SHD and (b) p-distance, with the indication of family groupings: Macropodidae (blue), Muridae (green), Phocidae (red), and Hominidae (black).</p
Hamming distance between the cellular automata image for some different mammalian species and Human, as a function of the number of steps.
Hamming distance between the cellular automata image for some different mammalian species and Human, as a function of the number of steps.</p
Transferrin protein dendrograms from (a) SHD and (b) p-distance distance matrices.
Transferrin protein dendrograms from (a) SHD and (b) p-distance distance matrices.</p