iBet uBet web content aggregator. Adding the entire web to your favor.

Author Notes

Abstract

One of the most startling discoveries in avian molecular phylogenetics is that the volant tinamous are embedded in the flightless ratites, but this topology remains controversial because recent morphological phylogenies place tinamous as the closest relative of a monophyletic ratite clade. Here, we integrate new phylogenomic sequences from 1,448 nuclear DNA loci totaling almost 1 million bp from the extinct little bush moa, Chilean tinamou, and emu with available sequences from ostrich, elegant crested tinamou, four neognaths, and the green anole. Phylogenetic analysis using standard homogeneous models and heterogeneous models robust to common topological artifacts recovered compelling support for ratite paraphyly with the little bush moa closest to tinamous within ratites. Ratite paraphyly was further corroborated by eight independent CR1 retroposon insertions. Analysis of morphological characters reinterpreted on a 27-gene paleognath topology indicates that many characters are convergent in the ratites, probably as the result of adaptation to a cursorial life style.

phylogenomics, ratite paraphyly, CR1 retroposon insertions, little bush moa, ancient DNA, morphological convergence

Introduction

Among the most contentious issues in resolving the avian Tree of Life is the phylogenetic relationships within the Paleognathae, an ancient clade that diverged from the remaining crown-group Neognathae at the base of the avian tree (Cracraft et al. 2004; Hackett et al. 2008; Mayr 2011). Paleognaths share a paleognathous bony palate, open ilioschiatic fenestra, and rhampothecal grooves (Cracraft 1974), and include the volant tinamous of Central and South America and the large flightless ratites, which are distributed across the southern landmasses except Antarctica. Ratites lack a keeled sternum, and comprise the extant African ostrich, South American rheas, and the Australasian emu, cassowaries and kiwi, in addition to the recently extinct Madagascan elephant birds and New Zealand moa. Within this clade of charismatic species, perhaps no group has captured the imagination more than the wingless moa (Aves: Dinornithiformes) of New Zealand following the first description of a moa femur by the renowned anatomist Sir Richard Owen (Owen 1839). The relatively recent extinction of the nine species of moa following the Polynesian colonization of New Zealand in 13th century, along with a wealth of discovered moa remains, has enabled studies not only of their morphology and paleontology, but also the use of ancient DNA (aDNA) to elucidate aspects of moa evolutionary biology and life history (Allentoft and Rawlence 2012).

Morphological and molecular systematists currently agree that paleognaths constitute a monophyletic clade, but phylogenetic relationships among paleognaths are disputed (fig. 1). Most recent morphological studies support ratite monophyly, with tinamous as the closest relative of the ratites (Livezey and Zusi 2007; Bourdon et al. 2009; Worthy and Scofield 2012). Conversely, analysis of characters of the tongue apparatus and cranial osteology supports ratite paraphyly (but not a moa–tinamou clade), and suggests that these characters might better retain phylogenetic signal, whereas postcranial skeletal characters might exhibit homoplasy due to convergence (Bock and Bühler 1990; Elzanowski 1995; Johnston 2011). Most earlier molecular studies of mitochondrial DNA also supported ratite monophyly (e.g., Haddrath and Baker 2001), but there were exceptions (e.g., Braun and Kimball 2002). However, studies using nuclear DNA, but lacking sequences from moa, recovered a paraphyletic clade that nested tinamous within the ratites and placed the ostrich as the closest relative of all other paleognaths (Hackett et al. 2008; Harshman et al. 2008; [N.B. We use the term “paraphyly” to refer to a nonmonophyletic ratite clade that includes some, but not all, descendants of the most recent common ancestor of ratites and tinamous. Harshman et al. (2008) instead characterized the ratites as “polyphyletic” because it is inferred that their uniting characters (e.g., a keel-less sternum) arose via convergence]). This novel topology was supported by a reanalysis of mitochondrial genomes that accounted for a base compositional bias at third codon positions (Phillips et al. 2010), and recovered a highly supported moa–tinamou clade (fig. 1). Most recently, separate analyses of 27 nuclear loci including moa sequences (Haddrath and Baker 2012), and 40 nuclear loci that lacked moa sequences (Smith et al. 2013), both recovered tinamous nested within the ratites. Resolving paleognath relationships is essential to investigate the discord between morphological and molecular topologies, and for interpretations of phenotypic evolution and biogeography within this group. Ratite monophyly is consistent with evolution from a flightless common ancestor and a predominantly vicariant mode of speciation following the breakup of Gondwana, whereas ratite paraphyly requires that either tinamous regained flight from a flightless paleognath ancestor or, perhaps more likely, that multiple independent losses of flight have occurred across the ratites (Bock and Bühler 1990; Harshman et al. 2008; Phillips et al. 2010; Haddrath and Baker 2012; Smith et al. 2013).

Fig. 1.

Competing hypotheses of phylogenetic relationships in the Paleognathae. Ratite monophyly posits evolution from a flightless common ancestor. The ratite paraphyly hypothesis that places the volant tinamous as the closest relative of moa (Phillips et al. 2010; Haddrath and Baker 2012; Smith et al. 2013) instead posits either that ratites and tinamous evolved from a volant common ancestor and that flight has been lost multiple times in the ratites or that tinamous have regained flight. Bird drawings are modified from del Hoyo et al. (1992) with permission from Lynx Edicions, Barcelona, Spain.

Open in new tab Download slide

High-throughput sequencing (HTS) can produce large-scale phylogenomic data sets that reflect genome-wide phylogenetic signal, and can also maximize the amount of nuclear sequence recovered from endogenous aDNA, which is typically degraded into short fragments (Millar et al. 2008; Rizzi et al. 2012; Huynen et al. 2012). A major advantage to using DNA sequences to test competing phylogenetic hypotheses is the large number of phylogenetically informative characters they provide, which are key to estimating species trees under concatenation or jointly with gene trees. Discordance between gene histories and species histories can arise when incomplete lineage sorting (ILS) occurs across short internal branches, even in deep time (Edwards et al. 2005; Degnan and Rosenberg 2009; Oliver 2013). Simulations using the multispecies coalescent suggest that phylogenomic discordance can potentially mislead estimation of evolutionary relationships of taxa under concatenation because stochastic population processes are not taken into account (Oliver 2013). Conversely, concatenation is expected to yield a good estimate of species histories when ILS is rare or absent, as the most common gene tree topology should reflect the true history of species. Short internal branches associated with the diversification of neoavian orders deep in the Avian Tree of Life suggest that phylogenomic discordance arising through ILS could underpin the substantial difficulties encountered in reconstructing some higher-level avian relationships (Poe and Chubb 2004; Edwards et al. 2005; Hackett et al. 2008; Matzke et al. 2012; Oliver 2013). ILS could prove similarly problematic within the Paleognathae, where the order of deep branches is very difficult to resolve (Haddrath and Baker 2012). Therefore, a much larger number of gene histories needs to be sampled in the aDNA genome of an extinct moa and their putative closest relatives, the tinamous, to more stringently test whether tinamous are truly embedded within the ratites or whether this relationship is an artifact resulting from ILS. However, with large phylogenomic data sets, it is also imperative to ensure that systematic error from heterogeneity in base composition and exchange rates, as well as mutational saturation and long-branch attraction, do not overwhelm phylogenetic signal to produce incorrect, yet highly supported, topologies (Jeffroy et al. 2006; Philippe et al. 2011; Telford and Copley 2011).

Here, we combine high-throughput sequences from aDNA of the extinct little bush moa (Anomalopteryx didiformis), and DNA of the extant Chilean tinamou (Nothoprocta perdicaria) and emu (Dromaius novaehollandiae), with available phylogenomic sequences from two additional paleognaths as well as four representative neognath taxa and the green anole to serve as outgroups. We assembled the largest phylogenomic data set to date comprised of 854 ultraconserved elements (UCEs) and 594 single-copy protein-coding genes to more thoroughly test competing hypotheses of ratite paraphyly versus monophyly. Phylogenies constructed with Bayesian concatenation and maximum pseudolikelihood species tree approaches were checked by fitting heterogeneous models of evolution which are robust to most systematic biases (Lartillot and Philippe 2004; Foster 2004; Foster et al. 2009). Independent information was obtained from retroposon insertions to corroborate inferences drawn from sequence-based analyses. To investigate why molecular and morphological trees are at such odds in supporting different hypotheses of paleognath relationships, we reinterpreted morphological characters on the molecular topology to determine whether concerted convergence in morphology might have occurred.

Results

Support for Ratite Paraphyly from UCEs and Protein-Coding Genes

The total data set comprises 1,133,474 bp of aligned nuclear DNA. UCEs total 337,982 bp of sequence from 854 loci (supplementary data S1, Supplementary Material online), and protein-coding genes total 795,492 bp of sequence from 594 loci (supplementary data S2, Supplementary Material online). Alignment data are available from the Dryad Digital Repository: doi:10.5061/dryad.9g3cm. Notably, an HTS approach recovered sequence for 841 of 854 UCEs and 593 of 594 protein-coding genes from aDNA of the little bush moa. Moa sequence totaled 901,554 bp across all loci, or 79.5% of sites across total alignment lengths, with a mean depth of coverage of 10.85 ± 0.0008 SE and mean quality score of 36.46 ± 0.0024 SE.

Phylogenetic analyses of both UCEs and protein-coding genes under either concatenation in MrBayes or with the gene tree/species tree approach implemented in MP-EST recovered identical topologies, with the exception that only UCE sequences were available for the ostrich and elegant crested tinamou (fig. 2). In addition to the expected relationships for the four neognath representatives, this topology supports ratite paraphyly by embedding the tinamous within the ratites in a highly supported clade with the little bush moa. Successive nodes to this embedded moa + tinamou clade are formed with the emu, followed by the ostrich. Maximum support is observed at all nodes except that separating emu from (moa + tinamou) in the maximum pseudolikelihood species tree for UCEs, where multiscale bootstrap support falls to 95%.

Fig. 2.

Phylogenetic analyses of 854 UCE and 594 protein-coding gene data sets in MrBayes and MP-EST. Bayesian posterior probabilities are shown (below nodes) along with the MP-EST bootstrap percentages based on 1,000 replicates (above nodes).

Open in new tab Download slide

Graphical exploration of observed transitions and transversions plotted against corrected genetic distance shows linear trends for UCEs and for first and second codon positions of protein-coding genes, indicating that they are not saturated with superimposed substitutions (supplementary fig. S1, Supplementary Material online). This conclusion is supported by Xia et al.’s (2003) test, where the entropy-based indices of substitution saturation (Iss) estimated for UCEs and first and second codon positions are all significantly smaller than their respective estimated critical values regardless of the symmetry of the tree (results not shown). Plots of transitions and transversions for third codon positions exhibit a slight deviation from linearity in comparisons between the ingroup and outgroup taxa suggesting that multiple substitutions have occurred between more distantly related taxa at these sites (supplementary fig. S1, Supplementary Material online). However, the Iss estimate for third codon positions is 0.3750, which is significantly less than the critical value of 0.8577 for a symmetrical tree (t_[_d_f_{= 129,112}_] = 442.6, P_[_two_-tailed_] < 0.00001), and the critical value of 0.7837 for an extreme asymmetrical tree (t_[_d_f_{= 129,112}_] = 334.5, P_[_two_-tailed] < 0.00001) and indicates that these sites have not experienced substitutional saturation to a degree that would obscure the underlying phylogenetic signal.

Compositional χ² tests using posterior predictive simulations showed statistically significant compositional nonstationarity for both UCEs and protein-coding genes (supplementary table S4, Supplementary Material online; both P < 0.000001). However, observed differences in mean GC content among species were associated with small to moderate effect sizes (supplementary tables S5–S7, Supplementary Material online; maximum effect size [Cohen’s d] of 0.21 for UCEs and 0.43 for protein-coding genes). Nevertheless, phylogenies were built with heterogeneous models in P4 (Foster et al. 2009) and PhyloBayes (Lartillot and Philippe 2004; Lartillot et al. 2013) to confirm that the topologies recovered from homogeneous models in MrBayes and MP-EST did not simply reflect systematic biases in compositional or exchange rate nonstationarity. For UCEs, the best P4 heterogeneous model incorporated seven rate matrices and six compositional matrices (supplementary data S3, Supplementary Material online; model 7GTR_6C), with Bayes factor (BF) comparisons providing evidence for a strong improvement in model fit relative to the homogeneous 1GTR_1C model (2ln BF = 1,002.5). The CAT–GTR model in PhyloBayes further improved model fit, with 2ln BF = 21,355.1 relative to the best 7GTR_6C model in P4. PhyloBayes chains failed to stabilize for amino acid translations of protein-coding genes; thus, results are shown for P4 only. The best P4 model, with a single JTT rate matrix and six compositional matrices (1JTT_6C), shows strong evidence for improved model fit relative to the homogeneous 1JTT_1C model (2ln BF = 2,038.5).

There was no longer a significant departure (at α = 0.05) from the null distribution of χ² values generated from posterior predictive simulations with the addition of at least two compositional matrices for UCEs and three compositional matrices for protein-coding genes. All topologies built with heterogeneous models for both UCEs and protein-coding genes were identical to those constructed with homogeneous models. Maximal node support occurred throughout, except at the node separating the zebra finch and budgie in the PhyloBayes tree for UCEs where posterior probability (PP) was 0.77 (supplementary data S3, Supplementary Material online).

Corroboration of a Moa–Tinamou Clade from CR1 Retroposon Insertions

In silico bioinformatic approaches identified 17 candidate retroposon insertions that were shared by the Chilean tinamou and little bush moa, but were absent from the emu. Experimental validation from polymerase chain reaction (PCR) amplification and sequencing across a wider sampling of paleognaths provided unequivocal evidence for eight CR1 insertions that support a moa–tinamou node (fig. 3). All insertions were shared by additional tinamou species, but were absent from all non-moa ratites (supplementary table S3, Supplementary Material online; alignments are deposited in the Dryad Digital Repository: doi:10.5061/dryad.9g3cm). These eight loci had orthologous flanking sequences, with CR1 insertions of the same size, subfamily, orientation, and degree of truncation, in addition to recognizable target site duplications (TSDs), indicating they are homologous insertion events. The remaining nine insertion events could not be unambiguously interpreted due to missing sequences, deletions, and/or unrecognizable TSDs; however, no information contradicting ratite paraphyly was observed (supplementary table S3, Supplementary Material online).

Fig. 3.

Eight retroelement insertions shared by little bush moa and four tinamou species identified in the current study (light gray sphere with black numbering), mapped onto a published paleognath topology constructed from 27 nuclear genes and 21 retroelements (Haddrath and Baker 2012; published CR1 insertions are indicated by dark gray spheres with white numbering). Scale bar indicates time in millions of years. Asterisks mark species included in comparisons to morphological data. Bird drawings are modified from del Hoyo et al. (1992) with permission from Lynx Edicions, Barcelona, Spain.

Open in new tab Download slide

Searches to identify CR1 insertions supporting the two alternative hypotheses of relationships among moa, emu, and Chilean tinamou identified no insertions that were shared by the emu and tinamou, but were absent from moa. However, we identified one insertion (locus 7292) that was shared by emu and moa, but absent from the Chilean tinamou. Experimental validation confirmed the presence of this CR1 insertion in all investigated ratites, but its absence in the great tinamou and the Chilean tinamou (supplementary table S3, Supplementary Material online; Dryad Digital Repository: doi:10.5061/dryad.9g3cm). The probability that the observed pattern of CR1 insertions (e.g., [8 1 0]) arose by chance is P = 0.0005 (Waddell et al. 2001), providing strong support for the moa–tinamou clade.

Interpreting Morphological Characters on the Molecular Tree

The morphological characters described by Worthy and Scofield (2012) were clustered into cliques of mutually compatible characters relative to a molecular topology constructed from a 27-gene data set for eight paleognaths and an outgroup taxon (fig. 4A, part d; also see supplementary fig. S2, Supplementary Material online, for the 27-gene topology, and supplementary data S4, Supplementary Material online, for additional information regarding morphological characters). The largest clique (clique 1) groups 35 characters, or 27% of all morphological characters included in the analysis, and encompasses three predominant phylogenetic signals. In addition to 14 of 18 characters that support uncontested shallower nodes that are common to the morphological and molecular topologies (i.e., grouping emu and cassowary, or greater and lesser rhea; shown as green boxes in fig. 4A, part a), clique 1 encompasses all characters that support an inferred homoplasious grouping of moa with kiwi (n = 7, shown as gold links in fig. 4A, part f), and 11 of 14 characters where the tinamou and outgroup share a character state that differs from the ratites, in direct refutation of ratite paraphyly (drawn as blue links). Nodes that are unique to the molecular tree are supported by eight morphological characters (red boxes in fig. 4A, part b). Five of these characters unite moa and tinamou; of these, four (characters 26, 39, 45, and 114) co-occur in clique 4, whereas the fifth (character 96) is positioned within the closest relative of this clique. Inspection of character states across all 26 species scored for morphology by Worthy and Scofield (2012) revealed that two of these characters (96 and 114) would also group the elephant birds with moa and tinamou. Although this result is not inconsistent with the molecular topology, we cannot evaluate the potential for homoplasious signal within this larger clade.

Fig. 4.

Analysis of morphological characters. (A) Cliques of compatible morphological characters from UPGMA clustering of pairwise dissimilarity scores (d), with numbers assigned to cliques with n > 5 constituent members (e). Outer orbitals show characters that provide unambiguous support for nodes shared by the morphological and molecular phylogenies [(emu, cassowary); (rheas)] (a), for nodes that are unique to the molecular phylogeny [(moa, tinamou); (emu, cassowary, kiwi); (emu, cassowary, kiwi, rheas)] (b), or for nodes unique to the morphological phylogeny [(ratite monophyly); (moa, kiwi); (ostrich, rhea); (emu, cassowary, ostrich, rhea)] (c). Inner circle (f) depicts inferred homoplasies from plotting each morphological character on the 27-gene molecular topology. Outer ideograms in part (f) echo the position of each clique, and inner links join characters with the same inferred homoplasious signal (drawn in Circos [Krzywinski et al. 2009]). Characters grouping tinamou with the outgroup are drawn in blue, those placing kiwi with moa are in gold, ostrich with rheas are in teal, and (emu, cassowary, ostrich, rhea)/(kiwi, moa, tinamou, outgroup) are shown in red. Other inferred homoplasious groupings with at least two constituent members are drawn in light gray. (B) Proportion of characters from each anatomical category in the full morphological data set, in the seven largest cliques, and in support of ratite monophyly (grouping tinamou + outgroup) versus paraphyly (grouping moa + tinamou).

Open in new tab Download slide

Each of the seven largest cliques contains characters from across body compartments (fig. 4B). There is limited support that cranial characters are better able to recover nodes that are unique to the molecular topology. The three morphological characters that unambiguously support a moa–tinamou node are cranial characters, versus only 2 of 14 characters that directly refute paraphyly (Fisher’s exact test, P₍_two_-tailed) = 0.01). Cliques 2 and 4 combined, with characters that support nodes found only in the molecular tree, are also more heavily weighted toward cranial characters relative to the remaining five largest cliques, which contain no support for nodes found only on the molecular tree (fig. 4B; cliques 2+4: 13 skull vs. 11 nonskull characters; cliques 1+3+5+6+7: 25 skull vs. 56 nonskull characters; χ²_(d_f_{= 1)} = 4.35, P = 0.037). However, only 8 of the 24 characters (33%) that support any node in the molecular tree (including uncontested nodes shared with the morphological topology, green + red boxes shown in fig. 4A, parts a and b, respectively) are cranial characters, which is almost identical to the proportion of cranial characters in the total data set (42 of 129 characters, or 32.6%). Furthermore, the proportion of characters that are free from any inferred homoplasies does not differ for cranial and postcranial groupings (7 of 42 cranial characters vs. 13 of 87 postcranial; χ²_(d_f_{= 1}₎ = 0.007, P = 0.917).

Discussion

Using the first large phylogenomic nuclear data set for paleognaths to include aDNA sequences from an extinct moa from New Zealand, we recover compelling support for the ratite paraphyly hypothesis originally proposed by Bock and Bühler (1990), and supported by molecular studies (Harshman et al. 2008; Phillips et al. 2010; Haddrath and Baker 2012; Smith et al. 2013). Congruent topologies that place the little bush moa as the closest relative of tinamous, nested within the ratites, are obtained from large, independent data sets of two different types of nuclear loci (UCEs and protein-coding genes), and are supported by both Bayesian concatenation and maximum pseudolikelihood species tree analytical approaches. Systematic biases that can affect tree topology, such as compositional and exchange rate heterogeneity or long-branch attraction, can likely be ruled out. Phylogenetic analyses with heterogeneous models of evolution (Lartillot and Philippe 2004; Foster 2004; Foster et al. 2009) more robust to these artifacts recovered the same tree topology with strong support. Long-branch attraction could potentially occur if sequences are saturated by superimposed parallel substitutions; however, we demonstrated that sufficient phylogenetic signal is retained in the analyzed data sets. Furthermore, translation of protein-coding genes and use of the site-heterogeneous CAT mixture model for UCEs, which is believed to be robust to long-branch attraction (Lartillot et al. 2007), should also safeguard against erroneous inference caused by mutational saturation.

Independent corroboration from eight retroposon insertions also resolutely supports moa as the closest relative of the tinamous, and five previously published insertions (Haddrath and Baker 2012) further support ratite paraphyly. These results agree with previous phylogenetic analyses of smaller data sets that included ancient mtDNA or nuclear DNA from moa that also recovered a moa–tinamou clade (Phillips et al. 2010; Haddrath and Baker 2012), and with nuclear gene phylogenies (Harshman et al. 2008; Smith et al. 2013) that lacked moa sequence but nevertheless recovered tinamous embedded in the ratites. We therefore reject the traditional hypothesis of ratite monophyly in which tinamous are placed as the closest relative of the ratites.

In addition to the much more powerful test of the ratite paraphyly hypothesis afforded by the large number of gene histories recovered from HTS of the little bush moa, emu, and Chilean tinamou, we gained strong genomic evidence against a role for ILS as an explanation for the observed relationship between moa and tinamous. ILS would result in gene tree heterogeneity and should also produce conflicting patterns in observed retroposon insertions (Edwards et al. 2005; Matzke et al. 2012). Although the single conflicting retroposon insertion found in all the ratite taxa we sampled but absent in two tinamou species could represent an instance of ILS, it is probably instead a secondary loss that occurred by near-perfect deletion of the CR1 element in the tinamou common ancestor (refer to Dryad Digital Repository: doi:10.5061/dryad.9g3cm retroelement locus 7292 for a multiple sequence alignment showing the boundaries of tinamou flanking sequence relative to the retroelement insertion observed in ratites). Although transposable element insertions are viewed as virtually homoplasy free, rare instances of parallel insertions and precise deletions are documented (e.g., Ray et al. 2006; Han et al. 2011) and underscore the utility of genome-level approaches to more rigorously evaluate support for alternative phylogenetic hypotheses. We caution that it was not possible to search for additional retroposon insertions that might occur in other ratite genomes not sampled here, and which could be affected by ILS across very short internal branches separating the major lineages of non-ostrich paleognaths (e.g., see fig. 3 and Haddrath and Baker 2012). However, the branch connecting the moa–tinamou clade to other ratites is reasonably long. Thus, the likelihood of ILS is expected to be very low at this node unless ancestral population size was very large because there would have been approximately 10–15 Myr (Haddrath and Baker 2012) for genetic drift to fix alternative alleles in either species. The next step to resolve phylogenetic relationships of major lineages across the paleognath tree requires the construction of similarly large phylogenomic data sets for other taxa not sampled here. Heterogeneity in gene tree topologies is expected to pose significant challenges in resolving some deep divergences in the avian tree (Poe and Chubb 2004; Edwards et al. 2005; Haddrath and Baker 2012), and a large number of gene histories and rare genomic events might therefore be required to reconstruct phylogenetic relationships among major paleognath lineages.

The robust support for ratite paraphyly from phylogenomic data contrasts with several recent morphological topologies that place tinamous as the closest relative of a monophyletic ratite clade (Livezey and Zusi 2007; Bourdon et al. 2009; Worthy and Scofield 2012), and suggests that multiple independent losses of flight have likely occurred within the ratites (Bock and Bühler 1990; Harshman et al. 2008; Phillips et al. 2010; Smith et al. 2013). Flightlessness, which is documented across 15 avian orders, is commonly associated with extensive modifications to the skeleto-muscular system, with paedomorphic reduction of the pectoral apparatus and peramorphosis leading to an enlarged or more robust pelvic girdle and hind limbs (Livezey 1995; Cubo and Arthur 2001). Our reinterpretation of morphological data identified several characters that are consistent with the molecular topology, especially at shallower nodes that are shared with the morphological tree. However, we also identified many inferred homoplasies that are likely to have arisen through convergence associated with independent losses of flight and adaptation to a cursorial lifestyle within ratites. We note that a full interpretation of signal present in the morphological data set was not possible due to the large number of extinct forms surveyed (11 of 23 ingroup taxa). However, comparisons to the 27-gene molecular topology did include at least one representative from all extant paleognath lineages, in addition to the extinct little bush moa.

Beyond inferring homoplasies at individual characters, the morphological data set is clustered into cliques of mutually compatible characters that could reflect concerted convergence across suites of characters in response to shared selective pressures or developmental processes (Holland et al. 2010; Blanke et al. 2013). Similar to a previous analysis in cormorants and shags (Holland et al. 2010), cliques encompass characters from across body compartments including the skull, and direct functional links are not necessarily apparent among clique members. Cranial and tongue characters, which might be less affected by convergence due to the modularity of cranial evolution relative to postcranial structures (Bhullar et al. 2012), have previously recovered ratite paraphyly, although not a moa–tinamou clade (Bock and Bühler 1990; Johnston 2011). However, Cubo and Arthur (2001) found significant associations between peramorphosis of the pelvic apparatus and of the skull among lineages of flightless birds. The loss of flight might lessen selective pressures for a lightweight skull and associated musculature, and a concomitant increase in muscle insertion areas (Gussekloo and Cubo 2013). We found limited support that cranial characters can better recover nodes consistent with the molecular topology, but did not find that cranial characters in general are less affected by inferred homoplasies. These observations suggest that the use of skull characters alone might not offer a panacea to resolve the ongoing disconnect between paleognath phylogenies estimated from molecular and morphological data. However, investigating the underlying basis for shared phylogenetic signals within character cliques could lead to a better understanding of the functional and ontogenetic interrelationships of their constituent members.

Materials and Methods

Sample Information

Source information for all specimens associated with new sequence data is provided in supplementary table S1, Supplementary Material online.

HTS of Paleognaths

Phylogenomic sequence data were retrieved from draft genome assemblies built from 454 and Illumina paired-end reads for the emu (D. novaehollandiae), and Illumina paired-end reads for the Chilean tinamou (N. perdicaria; Baker AJ, unpublished data). Additionally, unassembled 101 bp paired-end reads were obtained from two lanes of an Illumina HiSeq 2500 platform for the extinct little bush moa (A. didiformis).

Ultraconserved Genomic Elements (UCEs)

Sequences for 854 previously published avian UCEs were obtained from the Dryad Digital Repository doi:10.5061/dryad.64dv0tg1 (Faircloth et al. 2012) for chicken (Gallus gallus), emu, ostrich (Struthio camelus), elegant crested tinamou (Eudromia elegans), and green anole (Anolis carolinensis [reptilian outgroup]; supplementary data S1, Supplementary Material online). Published emu sequences were compared with the draft genome assembly and degenerate bases were introduced at variable sites. BLAST (Altschul et al. 1997) searches were used to retrieve each UCE from the publicly available genome sequences for three additional species: turkey (Meleagris gallopavo, using chicken query sequences), zebra finch (Taeniopygia guttata), and budgerigar (Melopsittacus undulatus, both queried with the banded pitta, Pitta guajana [Faircloth et al. 2012]), as well as from the draft Chilean tinamou assembly (queried with elegant crested tinamou). Moa UCEs were assembled by mapping reads to emu reference sequences with Stampy v. 1.0.17 (Lunter and Goodson 2011), and moa–emu alignments were visualized with Integrative Genomics Viewer v. 2.2 (Robinson et al. 2011). Moa consensus sequences were constructed using SAMtools v. 0.1.18 (Li et al. 2009) and custom Perl scripts, and required a minimum coverage depth of 2 and Phred quality score Q ≥ 13 at each position.

Protein-Coding Genes

Ensembl BioMart (Kinsella et al. 2011) was used to identify one-to-one orthologs with conserved exon–intron structure in the chicken, turkey, zebra finch, and green anole for which there also existed homology-based annotations in the draft emu and Chilean tinamou genomes (n = 594; supplementary data S2, Supplementary Material online). Computationally derived gene models for emu and tinamou were curated manually in Apollo Genome Annotation Curation Tool v. 1.11.6 (Lewis et al. 2002), and sequences were retrieved as cDNAs. Budgerigar sequences were identified with BLASTn searches of zebra finch queries against the set of predicted budgie cDNAs available in Ensembl Pre! (ftp://ftp.ensembl.org/pub/pre/fasta/cdna/melopsittacus_undulatus/, last accessed April 24, 2014). Moa consensus sequences were generated from alignment to emu references as described above.

Assessment of Mutational Saturation

Mutational saturation was assessed by graphical exploration of patterns of transitions and transversions as well as using the Xia et al. (2003) information entropy-based index of substitution saturation (Iss). Both approaches were performed in DAMBE v. 5.2.57 (Xia 2013). Observed transitional and transversional substitutions were plotted against corrected genetic distances estimated using a GTR model for the UCE data set and for each of the three codon positions in the protein-coding gene data set. The extent of substitutional saturation for each data partition was then formally tested by comparing the Iss to Iss.c, the critical value at which saturation is sufficient to cause sequences to begin to fail to recover the true topology (Xia et al. 2003).

Isolation of CR1 Retroelements

Two in silico approaches were employed to isolate shared CR1 retroelement insertions that support a moa–tinamou clade observed in phylogenies reconstructed from UCEs and protein-coding genes. First, CR1 repeats were identified in emu and Chilean tinamou scaffolds with RepeatMasker v. 3.3.0 (Smit et al. 2010), and sequences were retrieved for each CR1 plus up to 500 bp of nonrepetitive flanking DNA in both directions. Tinamou CR1-containing regions were used as queries in BLASTn searches of the emu genome, and candidates were retained when both tinamou flanking sequences could be aligned with emu up to the CR1 boundary, but where emu lacked the intervening repeat. BLAST searches against the data set of moa sequencing reads flagged potential candidates when a moa read overlapped the CR1-flank boundary or when one member of a moa read pair fell within the flanking sequence and the second aligned within the CR1 repeat (in both cases, supporting a shared CR1 in tinamou and moa that is absent from the emu).

Second, we inverted the search strategy by repeat-masking the raw moa reads and searching for read pairs where one partner was annotated as a CR1 repeat and its partner was nonrepetitive. The nonrepetitive partner was used to query the tinamou draft genome, after which the CR1-containing read was used in a bl2seq search against the tinamou hit sequence extended by up to 2 kb in the expected direction based on moa read pair orientation. Candidate tinamou sequences that matched both members of a moa read pair were used to query the emu genome, and were retained when the corresponding genomic region in emu lacked a CR1 insertion.

Similar bioinformatic methods were used to search for CR1 insertions supporting the two alternative hypotheses (i.e., grouping emu and moa to the exclusion of tinamou, or emu and tinamou to the exclusion of moa). For the former hypothesis, candidate CR1-containing regions in emu that lacked an insert in tinamou were flagged if a moa read overlapped the CR1-flank boundary or if members of a moa read pair aligned within the flank and the CR1 repeat. For the latter hypothesis, candidate regions containing CR1 insertions in both the emu and tinamou were flagged if a single moa read was split across the insertion site and aligned to the flanking sequence on either side of the repeat (i.e., indicating that moa lacks a CR1 insertion that is shared by emu and tinamou).

Primers were designed in FastPCR v. 5.1.68 beta1 (Kalendar et al. 2009) within regions flanking each candidate CR1 insertion using consensus sequences from emu, tinamou, and moa. In vitro screening for the presence or absence of homologous insertions was performed for ostrich, Southern cassowary, lesser rhea, all five kiwi species, and three additional tinamou species (supplementary table S1, Supplementary Material online). PCR amplifications followed a touchdown protocol (Haddrath and Baker 2012), and all amplified products were recovered (Dean and Greenwald 1995), sequenced with BigDye v. 3.1 chemistry following standard protocols, and resolved on an ABI 3730 Genetic Analyzer (Applied Biosystems, Foster City, CA). Queries against the RepeatMasker database identified which amplified products contained CR1 insertions.

Phylogenomic Analysis

Sequences for phylogeny reconstruction were aligned with Muscle v. 3.8.31 (Edgar 2004) and inspected in BioEdit v. 7.1.1 (Hall 1999). Best-fitting models of nucleotide substitution for UCEs were chosen according to the corrected Akaike information criterion (AICc) in jModelTest v. 2.1.1 (Posada 2008), and single-locus maximum-likelihood trees were built in PhyML v. 20120412 (Guindon and Gascuel 2003) from 1,000 bootstrap replicates. For protein-coding genes, the best partitioning scheme was determined with PartitionFinder v. 1.0.1 (Lanfear et al. 2012). Gene trees were estimated from 1,000 bootstrap replicates in RaxML v. 7.2.8 (Stamatakis et al. 2005). For both UCEs and protein-coding genes, species trees were estimated from the set of individual gene trees with MP-EST v. 1.2 (Liu et al. 2010).

Analyses were also conducted on separate concatenated data sets for UCEs and protein-coding genes using Bayesian inference as implemented in MrBayes v. 3.1.2 (Huelsenbeck and Ronquist 2001). The concatenated UCE data set was partitioned to apply the best-fitting models of nucleotide substitution. The protein-coding gene data set was partitioned according to codon position with the best-fitting model for each partition determined using AICc in jModelTest. MrBayes runs were performed with four chains per run, set for 50 million generations and sampling every 5,000 generations. All other run parameters were set as default and a 25% burnin was applied.

MrBayes runs were performed on the CIPRES Science Gateway portal (Miller et al. 2010); all other phylogenetic analyses, including those detailed below, were conducted on the GPC supercluster at the SciNet HPC consortium (Loken et al. 2010).

Bayesian Phylogenetic Reconstructions Using Heterogeneous Models

Bayesian inference incorporating heterogeneous models of evolution was used to investigate the influence of nonstationarity in base composition and exchange rate upon the topologies recovered from MrBayes concatenated and MP-EST species tree approaches. For computational tractability, data sets were simplified through the removal of constant sites and singletons and by translation of protein-coding genes, which produced alignments of 10,424 bp for UCEs and 16,849 amino acid residues for protein-coding genes. PhyloBayes MPI v. 1.4 (Lartillot et al. 2013) was used to build topologies with CAT profile mixture models, which attempt to account for heterogeneity in the substitution process through the use of multiple site classes associated with unique substitution matrices, and with the total number of classes itself estimated as a free parameter. Parameter estimates and consensus trees were derived from three independent runs per data set run for a minimum of 50,000 cycles using a CAT–GTR substitution scheme.

A second approach to modeling heterogeneity in the evolutionary process was performed in P4 (Foster et al. 2009), which incorporates tree-heterogeneous models that allow multiple compositional and/or rate matrices across the topology. Parameters for “base” homogeneous models (i.e., with single compositional and rate matrices) were estimated with jModelTest for UCEs and with ProtTest v. 3.3 (Darriba et al. 2011) for the amino acid data set. Consensus estimates were obtained from a minimum of three chains per model that were run for a minimum of 2 million generations each, with sampling every 100 generations and with rate and composition matrices modeled as free parameters and initially assigned randomly upon the starting tree. Adequacy of P4 models to account for compositional nonstationarity was assessed with compositional χ² tests where the observed χ² value was compared with a null distribution generated from posterior predictive simulations from the post burnin sample for each investigated model (Foster 2004).

Postrun diagnostics and examination of raw trace data were performed in Tracer v. 1.5 (Rambaut and Drummond 2009). Convergence among runs was assessed with the bpcomp subprogram in PhyloBayes and from the calculation of average standard deviation of split frequencies (ASDOSS) in P4. Log marginal likelihoods were calculated using eq. (16) of Newton and Raftery (1994) in P4 and were used in turn to compute BFs for model comparisons. Following Kass and Raftery (1995), values of 2ln BF >6 were interpreted as providing strong evidence for the rejection of one model in favor of an alternative.

Comparison of Molecular and Morphological Data

We used the morphological data set of Worthy and Scofield (2012) to investigate the discordance between topologies constructed from morphological and molecular data. Of the 26 taxa scored for morphology, published sequence data were available from 27 nuclear genes in nine species (including 7 of the 12 extant paleognaths scored for morphology): elegant crested tinamou, little bush moa, ostrich, emu, Southern cassowary (Casuarius casuarius), lesser rhea (Pterocnemia pennata), greater rhea (Rhea americana), Southern brown kiwi (Apteryx australis), and the outgroup magpie goose (Anseranas semipalmata) (Haddrath and Baker 2012). A molecular phylogeny was estimated in MrBayes from the concatenated data set of 31,013 bp of aligned DNA, with gene sequences consisting entirely of exonic nucleotides partitioned by codon position and sequences spanning introns partitioned by gene. All run parameters follow those for MrBayes analyses detailed above.

Excess and retention indices were calculated with PAUP* v. 4.0b10 (Swofford 2001) for each morphological character (N.B. only 129 of 179 morphological characters remained informative with reduced taxon sampling). Median values are reported from calculations using 1,000 trees drawn from the PP distribution from the Bayesian analysis of molecular data after chains reached convergence. These indices assess the fit of each morphological character on the molecular topology, with high excess and low retention values, respectively, indicating poor agreement (Holland et al. 2010).

Cliques of mutually compatible morphological characters, which could be indicative of concerted convergence in response to similar selective regimes, were identified following Blanke et al. (2013). Pairwise excess indices, which measure the dissimilarity between characters as the additional number of tree steps relative to the minimum number needed to map individual characters onto the molecular topology, were calculated for each pair of morphological characters using PAUP* and custom Perl scripts. UPGMA clustering of the pairwise dissimilarity matrix in PAUP* produced character cliques of compatible members whose pairwise dissimilarity equals zero.

Acknowledgments

This work was supported by the Natural Science and Engineering Research Council of Canada to A.J.B. and the Royal Ontario Museum Governors Fund to A.J.B. Sequencing was performed in Toronto, Canada, at the Ontario Institute for Cancer Research (OICR) and The Centre for Applied Genomics (TCAG) at Sick Kids Hospital. Computations were performed on the GPC supercomputer at the SciNet HPC Consortium and the CIPRES Science Gateway Portal. SciNet is funded by the Canada Foundation for Innovation under the auspices of Compute Canada, the Government of Ontario, Ontario Research Fund—Research Excellence, and the University of Toronto.

References

Allentoft

Rawlence

Moa’s ark or volant ghosts of Gondwana? Insights from nineteen years of ancient DNA research on the extinct moa (Aves: Dinornithiformes) of New Zealand

Ann Anat.

2012

, vol.

194

(pg.

)

Altschul

Madden

Schäffer

Zhang

Miller

Lipman

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

Nucleic Acids Res.

1997

, vol.

(pg.

3389

3402

)

Bhullar

B-AS

Marugán-Lobón

Racimo

Bever

Rowe

Norell

Abzhanov

Birds have paedomorphic dinosaur skulls

Nature

2012

, vol.

487

(pg.

223

226

)

Blanke

Greve

Wipfler

Beutel

Holland

Misof

The identification of concerted convergence in insect heads corroborates Palaeoptera

Syst Biol.

2013

, vol.

(pg.

250

263

)

Bock

Bühler

Schuchmann

K-L

Schmidt-Keonig

van den Elsen

The evolution and biogeographical history of the palaeognathous birds

Current topics in avian biology

1990

Proceedings of the International Centennial Meeting of the Deutsche Ornithologen Gesellschaft; 1998. Garmisch-Partenkirchen (Germany): Verlag der Deutschen Ornithologen-Gesellschaft. p. 31–36

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Bourdon

de Ricqles

Cubo

A new transantarctic relationship: morphological evidence for a Rheidae–Dromaiidae–Casuariidae clade (Aves, Palaeognathae, Ratitae)

Zool J Linn Soc Lond.

2009

, vol.

156

(pg.

641

663

)

Google Scholar

Crossref

WorldCat

Braun

Kimball

Examining basal avian divergences with mitochondrial sequences: model complexity, taxon sampling, and sequence length

Syst Biol.

2002

, vol.

(pg.

614

625

)

Cracraft

Phylogeny and evolution of the ratite birds

Ibis

1974

, vol.

116

(pg.

494

521

)

Google Scholar

Crossref

WorldCat

Cracraft

Barker

Braun

Harshman

Dyke

Feinstein

Stanley

Cibois

Schikler

Beresford

et al.

Cracraft

Donoghue

Phylogenetic relationships among modern birds (Neornithes): toward an avian tree of life

Assembling the tree of life

2004

New York

Oxford University Press

(pg.

468

489

)

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Cubo

Arthur

Patterns of correlated character evolution in flightless birds: a phylogenetic approach

Evol Ecol.

2001

, vol.

(pg.

693

702

)

Google Scholar

Crossref

WorldCat

Darriba

Taboada

Doallo

Posada

ProtTest 3: fast selection of best-fit models of protein evolution

Bioinformatics

2011

, vol.

(pg.

1164

1165

)

Dean

Greenwald

Use of filtered pipette tips to elute DNA from agarose gels

BioTechniques

1995

, vol.

pg.

980

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Degnan

Rosenberg

Gene tree discordance, phylogenetic inference and the multispecies coalescent

Trends Ecol Evol.

2009

, vol.

(pg.

332

340

)

del Hoyo

Elliot

Sargatal

Handbook of the birds of the world

Barcelona

1992

pg.

Lynx Edicions

Google Scholar

OpenURL Placeholder Text

WorldCat

Edgar

MUSCLE: multiple sequence alignment with high accuracy and high throughput

Nucleic Acids Res.

2004

, vol.

(pg.

1792

1797

)

Edwards

Jennings

Shedlock

Phylogenetics of modern birds in the era of genomics

Proc R Soc B Biol Sci.

2005

, vol.

272

(pg.

979

992

)

Google Scholar

Crossref

WorldCat

Elzanowski

Cretaceous birds and avian phylogeny

Cour Forsch Inst Senckenberg.

1995

, vol.

181

(pg.

)

Google Scholar

OpenURL Placeholder Text

WorldCat

Faircloth

McCormack

Crawford

Harvey

Brumfield

Glenn

Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales

Syst Biol.

2012

, vol.

(pg.

717

726

)

Foster

Modeling compositional heterogeneity

Syst Biol.

2004

, vol.

(pg.

485

495

)

Foster

Cox

Embley

The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods

Philos Trans R Soc B Biol Sci.

2009

, vol.

364

(pg.

2197

2207

)

Google Scholar

Crossref

WorldCat

Guindon

Gascuel

A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood

Syst Biol.

2003

, vol.

(pg.

696

704

)

Gussekloo

SWS

Cubo

Flightlessness affects cranial morphology in birds

Zoology

2013

, vol.

116

(pg.

)

Hackett

Kimball

Reddy

Bowie

RCK

Braun

Chojnowski

Cox

Han

K-L

Harshman

et al.

A phylogenomic study of birds reveals their evolutionary history

Science

2008

, vol.

320

(pg.

1763

1768

)

Haddrath

Baker

Complete mitochondrial DNA genome sequences of extinct birds: ratite phylogenetics and the vicariance biogeography hypothesis

Proc R Soc B Biol Sci.

2001

, vol.

268

(pg.

939

945

)

Google Scholar

Crossref

WorldCat

Haddrath

Baker

Multiple nuclear genes and retroposons support vicariance and dispersal of the palaeognaths, and an early Cretaceous origin of modern birds

Proc R Soc B Biol Sci.

2012

, vol.

279

(pg.

4617

4625

)

Google Scholar

Crossref

WorldCat

Hall

BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT

Nucleic Acids Symp.

1999

, vol.

(pg.

)

Google Scholar

OpenURL Placeholder Text

WorldCat

Han

K-L

Braun

Kimball

Reddy

Bowie

RCK

Braun

Chojnowski

Hackett

Harshman

Huddleston

et al.

Are transposable element insertions homoplasy free? An examination using the avian tree of life

Syst Biol.

2011

, vol.

(pg.

375

386

)

Harshman

Braun

Huddleston

Bowie

RCK

Chojnowski

Hackett

Han

K-L

Kimball

Marks

et al.

Phylogenomic evidence for multiple losses of flight in ratite birds

Proc Natl Acad Sci U S A.

2008

, vol.

105

(pg.

13462

13467

)

Holland

Spencer

Worthy

Kennedy

Identifying cliques of convergent characters: concerted evolution in the cormorants and shags

Syst Biol.

2010

, vol.

(pg.

433

445

)

Huelsenbeck

Ronquist

MRBAYES: Bayesian inference of phylogenetic trees

Bioinformatics

2001

, vol.

(pg.

754

755

)

Huynen

Millar

Lambert

Resurrecting ancient animal genomes: the extinct moa and more

BioEssays

2012

, vol.

(pg.

661

669

)

Jeffroy

Brinkmann

Delsuc

Philippe

Phylogenomics: the beginning of incongruence?

Trends Genet.

2006

, vol.

(pg.

225

231

)

Johnston

New morphological evidence supports congruent phylogenies and Gondwana vicariance for palaeognathous birds

Zool J Linn Soc.

2011

, vol.

163

(pg.

959

982

)

Google Scholar

Crossref

WorldCat

Kalendar

Lee

Schulman

FastPCR software for PCR primer and probe design and repeat search

Genes Genom Genomics.

2009

, vol.

(pg.

)

Google Scholar

OpenURL Placeholder Text

WorldCat

Kass

Raftery

Bayes factors

J Am Stat Assoc.

1995

, vol.

(pg.

773

795

)

Google Scholar

Crossref

WorldCat

Kinsella

Kähäri

Haider

Zamora

Proctor

Spudich

Almeida-King

Staines

Derwent

Kerhornou

et al.

Ensembl BioMarts: a hub for data retrieval across taxonomic space

Database

2011

, vol.

2011

pg.

bar030

Krzywinski

Schein

Birol

Connors

Gascoyne

Horsman

Jones

Marra

Circos: an information aesthetic for comparative genomics

Genome Res.

2009

, vol.

(pg.

1639

1645

)

Lanfear

Calcott

SYW

Guindon

PartitionFinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses

Mol Biol Evol.

2012

, vol.

(pg.

1695

1701

)

Lartillot

Brinkmann

Philippe

Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model

BMC Evol Biol.

2007

, vol.

pg.

Lartillot

Philippe

A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process

Mol Biol Evol.

2004

, vol.

(pg.

1095

1109

)

Lartillot

Rodrigue

Stubbs

Richer

PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures profiles in a parallel environment

Syst Biol.

2013

, vol.

(pg.

611

615

)

Lewis

Searle

SMJ

Harris

Gibson

Iyer

Richter

Wiel

Bayraktaroglu

Birney

Crosby

et al.

Apollo: a sequence annotation editor

Genome Biol.

2002

, vol.

(pg.

RESEARCH0082.1

0082.14

)

Google Scholar

Crossref

WorldCat

Handsaker

Wysoker

Fennell

Ruan

Homer

Marth

Abecasis

Durbin

The sequence alignment/map format and SAMtools

Bioinformatics

2009

, vol.

(pg.

2078

2079

)

Liu

Edwards

A maximum pseudo-likelihood approach for estimating species trees under the coalescent model

BMC Evol Biol.

2010

, vol.

pg.

302

Livezey

McNamara

Heterochrony and the evolution of avian flightlessness

Evolutionary change and heterochrony

1995

New York

Wiley

(pg.

169

193

)

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Livezey

Zusi

Higher-order phylogeny of modern birds (Theropoda, Aves: Neornithes) based on comparative anatomy. II. Analysis and discussion

Zool J Linn Soc.

2007

, vol.

149

(pg.

)

Loken

Gruner

Groer

Peltier

Bunn

Craig

Henriques

Dempsey

C-H

Chen

et al.

SciNet: lessons learned from building a power-efficient top-20 system and data centre

J Phys: Conf Ser.

2010

, vol.

256

pg.

012026

Google Scholar

Crossref

WorldCat

Lunter

Goodson

Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads

Genome Res.

2011

, vol.

(pg.

936

939

)

Matzke

Churakov

Berkes

Arms

Kelsey

Brosius

Kriegs

Schmitz

Retroposon insertion patterns of neoavian birds: strong evidence for an extensive incomplete lineage sorting era

Mol Biol Evol.

2012

, vol.

(pg.

1497

1501

)

Mayr

Metaves, Mirandornithes, Strisores and other novelties—a critical review of the higher-level phylogeny of neornithine birds

J Zool Syst Evol Res.

2011

, vol.

(pg.

)

Google Scholar

Crossref

WorldCat

Millar

Huynen

Subramanian

Mohandesan

Lambert

New developments in ancient genomics

Trends Ecol Evol.

2008

, vol.

(pg.

386

393

)

Miller

Pfeiffer

Schwartz

Creating the CIPRES Science Gateway for inference of large phylogenetic trees

2010

Proceedings of the Gateway Computing Environments Workshop (GCE); New Orleans (LA). Nov 14, 2010. Institute of Electrical and Electronics Engineers. p. 1–8

Google Scholar

Newton

Raftery

Approximate Bayesian inference with the weighted likelihood bootstrap

J R Stat Soc B Methodol.

1994

, vol.

(pg.

)

Google Scholar

OpenURL Placeholder Text

WorldCat

Oliver

Microevolutionary processes generate phylogenomic discordance at ancient divergences

Evolution

2013

, vol.

(pg.

1823

1830

)

Owen

On the bone of an unknown struthious bird from New Zealand

Proc Zool Soc Lond.

1839

, vol.

(pg.

169

171

)

Google Scholar

OpenURL Placeholder Text

WorldCat

Philippe

Brinkmann

Lavrov

Littlewood

DTJ

Manuel

Wörheide

Baurain

Resolving difficult phylogenetic questions: why more sequences are not enough

PLoS Biol.

2011

, vol.

pg.

e1000602

Phillips

Gibb

Crimp

Penny

Tinamous and moa flock together: mitochondrial genome sequence analysis reveals independent losses of flight among ratites

Syst Biol.

2010

, vol.

(pg.

107

)

Poe

Chubb

Birds in a bush: five genes indicate explosive evolution of avian orders

Evolution

2004

, vol.

(pg.

404

415

)

Posada

jModelTest: phylogenetic model averaging

Mol Biol Evol.

2008

, vol.

(pg.

1253

1256

)

Rambaut

Drummond

2009

Tracer v1.5. [cited 2014 Apr 24]. Available from: http://tree.bio.ed.ac.uk/software/tracer/

Ray

Xing

Salem

A-H

Batzer

SINEs of a nearly perfect character

Syst Biol.

2006

, vol.

(pg.

928

935

)

Rizzi

Lari

Gigli

De Bellis

Caramelli

Ancient DNA studies: new perspectives on old samples

Genet Sel Evol.

2012

, vol.

pg.

Robinson

Thorvaldsdóttir

Winckler

Guttman

Lander

Getz

Mesirov

Integrative genomics viewer

Nat Biotechnol.

2011

, vol.

(pg.

)

Smit

AFA

Hubley

Green

2010

RepeatMasker Open-3.0. [cited 2014 Apr 24]. Available from: http://www.repeatmasker.org

Smith

Braun

Kimball

Ratite nonmonophyly: independent evidence from 40 novel loci

Syst Biol.

2013

, vol.

(pg.

)

Stamatakis

Ludwig

Meier

RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees

Bioinformatics

2005

, vol.

(pg.

456

463

)

Swofford

2001

PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version 4. Sunderland (MA): Sinauer Associates

Telford

Copley

Improving animal phylogenies with genomic data

Trends Genet.

2011

, vol.

(pg.

186

195

)

Waddell

Kishino

Ota

A phylogenetic foundation for comparative mammalian genomics

Genome Inform.

2001

, vol.

(pg.

141

154

)

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Worthy

Scofield

Twenty-first century advances in knowledge of the biology of moa (Aves: Dinornithiformes): a new morphological analysis and moa diagnoses revised

N Z J Zool.

2012

, vol.

(pg.

153

)

Google Scholar

Crossref

WorldCat

Xia

DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution

Mol Biol Evol.

2013

, vol.

(pg.

1720

1728

)

Xia

Xie

Salemi

Chen

Wang

An index of substitution saturation and its application

Mol Phylogenet Evol.

2003

, vol.

(pg.

)

Author notes

Associate editor: Nicolas Vidal

Download all slides

Month:	Total Views:
January 2017	2
February 2017	41
March 2017	40
April 2017	42
May 2017	17
June 2017	19
July 2017	14
August 2017	50
September 2017	33
October 2017	55
November 2017	26
December 2017	77
January 2018	76
February 2018	67
March 2018	138
April 2018	181
May 2018	209
June 2018	122
July 2018	163
August 2018	70
September 2018	86
October 2018	80
November 2018	93
December 2018	69
January 2019	44
February 2019	87
March 2019	110
April 2019	308
May 2019	226
June 2019	145
July 2019	201
August 2019	202
September 2019	172
October 2019	107
November 2019	97
December 2019	92
January 2020	87
February 2020	108
March 2020	89
April 2020	105
May 2020	58
June 2020	90
July 2020	50
August 2020	69
September 2020	71
October 2020	86
November 2020	88
December 2020	137
January 2021	79
February 2021	76
March 2021	108
April 2021	90
May 2021	68
June 2021	29
July 2021	49
August 2021	49
September 2021	28
October 2021	69
November 2021	87
December 2021	71
January 2022	109
February 2022	68
March 2022	80
April 2022	66
May 2022	66
June 2022	37
July 2022	39
August 2022	47
September 2022	87
October 2022	85
November 2022	89
December 2022	111
January 2023	52
February 2023	69
March 2023	46
April 2023	62
May 2023	54
June 2023	27
July 2023	51
August 2023	60
September 2023	53
October 2023	81
November 2023	61
December 2023	81
January 2024	64
February 2024	52
March 2024	35
April 2024	87
May 2024	43
June 2024	57
July 2024	54
August 2024	31
September 2024	76
October 2024	73
November 2024	80

Article Contents

Genomic Support for a Moa–Tinamou Clade and Adaptive Morphological Convergence in Flightless Ratites

Abstract

Introduction

Results

Support for Ratite Paraphyly from UCEs and Protein-Coding Genes

Corroboration of a Moa–Tinamou Clade from CR1 Retroposon Insertions

Interpreting Morphological Characters on the Molecular Tree

Discussion

Materials and Methods

Sample Information

HTS of Paleognaths

Ultraconserved Genomic Elements (UCEs)

Protein-Coding Genes

Assessment of Mutational Saturation

Isolation of CR1 Retroelements

Phylogenomic Analysis

Bayesian Phylogenetic Reconstructions Using Heterogeneous Models

Comparison of Molecular and Morphological Data

Acknowledgments

References

Author notes

Supplementary data

Citations

Views

Altmetric

Email alerts

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Genomic Support for a Moa–Tinamou Clade and Adaptive Morphological Convergence in Flightless Ratites

Abstract

Introduction

Results

Support for Ratite Paraphyly from UCEs and Protein-Coding Genes

Corroboration of a Moa–Tinamou Clade from CR1 Retroposon Insertions

Interpreting Morphological Characters on the Molecular Tree

Discussion

Materials and Methods

Sample Information

HTS of Paleognaths

Ultraconserved Genomic Elements (UCEs)

Protein-Coding Genes

Assessment of Mutational Saturation

Isolation of CR1 Retroelements

Phylogenomic Analysis

Bayesian Phylogenetic Reconstructions Using Heterogeneous Models

Comparison of Molecular and Morphological Data

Acknowledgments

References

Author notes

Supplementary data

Citations

Views

Altmetric

Email alerts

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only