Previous Articles     Next Articles

Numerical characterization of DNA sequences based on multisets of pseudo amino acids and its applications#br#

  

  1. (College of Mathematics and Physics, Bohai University, Jinzhou 121013, China)
  • Online:2015-07-25 Published:2015-08-03

Abstract: According to a mapping of codons and amino acids and stop signal, the sequence of pseudo amino acid for DNA sequence was proposed. Then, by means of the multiset, a 21\|dimensional numerical vector of a DNA sequence was constructed. On the basis of the vector, the similarity distance between any two DNA sequences can be calculated. The phylogenetic analysis on three datasets (S segment of hantaviruses, complete genome sequences of Tomato yellow leaf curl virus and complete genome sequences of human rhinovirus) demonstrated the effectiveness of the proposed method.

Key words: DNA, pseudo amino acid, multiset, phylogenetic analysis