Keywords and Synonyms
Directed binary character compatibility
Problem Definition
Let \( { S = \{s_1,s_2,\dots,s_n\} } \) be a set of elements called objects, and let \( { C = \{c_1,c_2,\dots,c_m\} } \) be a set of functions from S to \( { \{0,1\} } \) called characters. For each object \( { s_i \in S } \) and character \( { c_j \in C } \), it is said that s i has c j if \( { c_j(s_i) = 1 } \) or that s i does not have c j if \( { c_j(s_i) = 0 } \), respectively (in this sense, characters are binary). Then the set S and its relation to C can be naturally represented by a matrix M of size \( { (n \times m) } \) satisfying \( { M[i,j] = c_j(s_i) } \) for every \( { i \in \{1,2,\dots,n\} } \) and \( { j \in \{1,2,\dots,m\} } \). Such a matrix M is called a binary character state matrix.
Next, for each \( s_i \in S \), define the set \( C_{s_i} = \{c_j \in C \colon s_i {\text{\ has\ }} c_j\} \). A phylogeny for S is a tree whose leaves are bijectively labeled by S, and a directed perfect...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note that Theorem 4 does not contradict Theorem 3; in fact, Gusfield's lower bound argument considers an input matrix consisting mostly of 1's.
- 2.
When this requirement is too strict, one can relax it to permit errors; for example, let characters be associated with more than one edge in the phylogeny (i. e., allow each character to emerge many times) but minimize the total number of associations (Camin–Sokal optimization), or keep the requirement that each character emerges only once but allow it to be lost multiple times (Dollo parsimony) [4,5]
Recommended Reading
Agarwala, R., Fernández-Baca, D., Slutzki, G.: Fast algorithms for inferring evolutionary trees. J. Comput. Biol. 2, 397–407 (1995)
Estabrook, G.F., Johnson, C.S., Jr., McMorris, F.R.: An algebraic analysis of cladistic characters. Discret. Math. 16, 141–147 (1976)
Estabrook, G.F., Johnson, C.S., Jr., McMorris, F.R.: A mathematical foundation for the analysis of cladistic character compatibility. Math. Biosci. 29, 181–187 (1976)
Felsenstein, J.: Inferring Phylogenies. Sinauer Associates, Inc., Sunderland (2004)
Fernández-Baca, D.: The Perfect Phylogeny Problem. In: Cheng, X., Du, D.-Z. (eds.) Steiner Trees in Industry, pp. 203–234. Kluwer Academic Publishers, Dordrecht (2001)
Gusfield, D.M.: Efficient algorithms for inferring evolutionary trees. Networks 21, 19–28 (1991)
Gusfield, D.M.: Algorithms on Strings, Trees, and Sequences. Cambridge University Press, New York (1997)
Hanisch, D., Zimmer, R., Lengauer, T.: ProML – the Protein Markup Language for specification of protein sequences, structures and families. In: Silico Biol. 2, 0029 (2002). http://www.bioinfo.de/isb/2002/02/0029/
Kanj, I.A., Nakhleh, L., Xia, G.: Reconstructing evolution of natural languages: Complexity and parametrized algorithms. In: Proceedings of the 12th Annual International Computing and Combinatorics Conference (COCOON 2006). Lecture Notes in Computer Science, vol. 4112, pp. 299–308. Springer, Berlin (2006)
McMorris, F.R.: On the compatibility of binary qualitative taxonomic characters. Bull. Math. Biol. 39, 133–138 (1977)
Setubal, J.C., Meidanis, J.: Introduction to Computational Molecular Biology. PWS Publishing Company, Boston (1997)
Acknowledgments
Supported in part by Kyushu University, JSPS (Japan Society for the Promotion of Science), and INRIA Lille - Nord Europe.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag
About this entry
Cite this entry
Jansson, J. (2008). Directed Perfect Phylogeny (Binary Characters). In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30162-4_112
Download citation
DOI: https://doi.org/10.1007/978-0-387-30162-4_112
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30770-1
Online ISBN: 978-0-387-30162-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering