Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The effects of fossil taxa, hypothetical predicted ancestors, and a molecular scaffold on pseudoextinction analyses of extant placental orders

Abstract

Pseudoextinction analyses, which simulate extinction in extant taxa, use molecular phylogenetics to assess the accuracy of morphological phylogenetics. Previous pseudoextinction analyses have shown a failure of morphological phylogenetics to place some individual placental orders in the correct superordinal clade. Recent work suggests that the inclusion of hypothetical ancestors of extant placental clades, estimated by ancestral state reconstructions of morphological characters, may increase the accuracy of morphological phylogenetic analyses. However, these studies reconstructed direct hypothetical ancestors for each extant taxon based on a well-corroborated molecular phylogeny, which is not possible for extinct taxa that lack molecular data. It remains to be determined if pseudoextinct taxa, and by proxy extinct taxa, can be accurately placed when their immediate hypothetical ancestors are unknown. To investigate this, we employed molecular scaffolds with the largest available morphological data set for placental mammals. Each placental order was sequentially treated as pseudoextinct by exempting it from the molecular scaffold and recoding soft morphological characters as missing for all its constituent species. For each pseudoextinct data set, we omitted the pseudoextinct taxon and performed a parsimony ancestral state reconstruction to obtain hypothetical predicted ancestors. Each pseudoextinct order was then evaluated in seven parsimony analyses that employed combinations of fossil taxa, hypothetical predicted ancestors, and a molecular scaffold. In treatments that included fossils, hypothetical predicted ancestors, and a molecular scaffold, only 8 of 19 pseudoextinct placental orders (42%) retained the same interordinal placement as on the molecular scaffold. In treatments that included hypothetical predicted ancestors but not fossils or a scaffold, only four placental orders (21%) were recovered in positions that are congruent with the scaffold. These results indicate that hypothetical predicted ancestors do not increase the accuracy of pseudoextinct taxon placement when the immediate hypothetical ancestor of the taxon is unknown. Hypothetical predicted ancestors are not a panacea for morphological phylogenetics.

Introduction

Charles Darwin [1] provided the world with its first glimpse of a modern phylogeny with contemporary taxa deployed at its tips and ancestors at internal nodes. Ever since, evolutionary biologists have attempted to decipher the relationships between living and extinct members of the Tree of Life. Morphological characters were the main source of data for phylogenetic analyses for much of the 20th century [2, 3], but since the 1960s systematists have increasingly relied on different types of genetic data and now genomic data with the advent of massively parallel DNA sequencing [3, 4]. Molecular data are preferred by most systematists due to large sample size [2, 3, 5] and greater phylogenetic reliability [6]. Whereas molecular data are readily available for many extant taxa, the procurement of DNA sequences has proven much more challenging for recently extinct species and impossible for long extinct species [7]. The dearth of molecular data for most extinct species, which comprise the vast majority of species that have lived on Earth, mandates the continued use of morphological data to study the phylogenetic relationships of extinct taxa to each other and to extant taxa [5].

Despite this demonstrated need, morphological characters are now rarely collected for phylogenetic studies of extant taxa [8]. Thus, many contemporary phylogenies are built using only molecular data, although fossils remain critical for time-scaling these phylogenies [3]. Additionally, the inclusion of fossil taxa in phylogenetic analyses can be critical for elucidating temporal sequences of character evolution because extinct taxa may preserve morphological character combinations that are not present in extant taxa [9, 10]. To include extinct species in phylogenetic analyses we need to continue to collect morphological data [8, 11]. At the same time, we must also assess the accuracy of morphological phylogenetic methods given potential problems with correlated homoplasy that can result in morphological support for groups that are polyphyletic based on molecular data [1215].

In the case of the 19 extant orders of placental mammals, there are both congruent and incongruent clades that have emerged from molecular and morphological studies of interordinal relationships. Both types of data provide robust support for Glires and Paenungulata, but there are far more cases of incongruence [13]. Molecular data sets provide incontrovertible support for four superorders of placental mammals: Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires [1621]. However, only Xenarthra has been recovered by traditional morphological data sets [12]. Instead, morphological phylogenies have recovered groups such as Edentata (Xenarthra + Pholidota), Lipotyphla, Ungulata, and Volitantia [2226], all of which are incompatible with one or more of the four molecular superorders. These problems with the phylogenetic placement of extant placental orders based on morphological phylogenetics have raised concerns pertaining to the reliability of morphology to place extinct taxa in the eutherian tree [12, 15, 27].

Previous studies have examined the placement of extinct taxa in phylogenies by simulating extinction in extant taxa (also called ‘pseudoextinction’) for which phylogenetic relationships are robustly supported. This is achieved by coding molecular characters and soft tissue characters as missing for each pseudoextinct taxon prior to phylogenetic analysis [12, 15, 2830]. These techniques have been applied to species within Tenrecidae [28], placental mammal orders [12, 15], groups of vertebrates and invertebrates [30], and species within the order Primates [29]. Asher and Hofreiter [28] and Pattinson et al. [29] concluded that morphological data sets were reliable for phylogenetic reconstruction whereas Springer et al. [12, 15] and Sansom & Wills [30] found that morphological data sets were inadequate for reconstructing the relationships of pseudoextinct taxa. While morphology may be able to place extinct species within extant families or orders [28, 29], the accuracy of morphology for placing more inclusive pseudoextinct groups is poor [12, 15, 30]. Thus, morphology appears to reconstruct fossil relationships more accurately when a pseudoextinct species has an extant, close relative that can be placed using molecular data.

Small sample size was noted as one potential reason for the poor performance of morphology in reconstructing higher-level relationships among placental mammals [12]. All but one of the aforementioned pseudoextinction studies [15] were based on an average of <500 morphological characters for each pseudoextinct group.

The large data set of O’Leary et al. [31], which contains >4500 characters, allowed pseudoextinction analyses to be revisited with a significantly increased morphological sample size [15]. Thirteen of the 18 placental orders (72%) moved to an interordinal position that violated the well-supported molecular scaffold when they were treated as pseudoextinct (Xenarthra [Cingulata + Pilosa] was treated as a single order in [15]). Three of these orders (Afroscoricida, Eulipotyphla, Cetartiodactyla) were also rendered polyphyletic. Only Lagomorpha, Rodentia, Proboscidea, Sirenia, and Hyracoidea retained their correct interordinal positions [15]. These results showcase that even large morphological data sets are subject to the problems of homoplasy.

Beck and Baillie [32] assessed the accuracy of morphological phylogenetics when hypothetical predicted ancestors (HPAs) were added to O’Leary et al.’s [31] morphological data set. These authors enforced a molecular scaffold from Meredith et al. [33] for 46 extant mammals and reconstructed ancestral states for each internal node. All of these predicted ancestors were then added to O’Leary et al.’s [31] morphological matrix and their impact on phylogeny reconstruction was assessed in unconstrained phylogenetic analyses. Beck and Baillie [32] recovered a morphological phylogeny that is largely congruent with Meredith et al.’s [33] DNA-based phylogeny including the correct placement of each placental order in its well-supported superorder (i.e., Afrotheria, Xenarthra, Euarchontoglires, Laurasiatheria). An important conclusion from this study is that predicted ancestors have the potential to reconcile estimates of phylogeny that are inferred from morphological and molecular data sets. Indeed, application of Beck and Baillie’s [32] method resulted in most of the same clades that were present in the molecular scaffold that was used to reconstruct ancestral morphological characters. Similar results were obtained by Asher et al. [34] when these methods were applied to a 219-character morphological data set for Glires.

It is important to note that Beck and Baillie [32] and Asher et al. [34] used hypothetical predicted ancestors for different purposes. Beck and Baillie’s [32] analyses with hypothetical predicted ancestors represented an ideal scenario for finding real fossils with character combinations that can link morphologically disparate mammalian taxa without the need for molecular data. As noted by these authors, if "genuine fossil taxa exhibit these character combinations, and are sufficiently well preserved, then their discovery and inclusion in phylogenetic analyses might be sufficient to largely resolve the current conflict between molecular and morphological analyses…" Asher et al. [34] viewed Beck and Baillie’s [32] method as a novel approach to incorporate the phylogenomic signal from modern taxa into a phylogenetic analysis without "having to sample non-fossilizable data, such as DNA." However, it remains unclear if the inclusion of predicted ancestors for extant taxa will result in more accurate phylogenies for extinct taxa for which direct ancestors cannot be predicted with Beck and Baillie’s [32] method. This is because most extinct taxa lack molecular data and cannot be included as tips on a molecular phylogeny that is used to reconstruct ancestral states.

Here we address this issue with O’Leary et al.’s [31] large morphological data set. To do this, we combined pseudoextinction [12, 28] and HPAs [32, 34] to assess if inclusion of the latter results in the accurate placement of pseudoextinct taxa for which the immediate ancestor cannot be reconstructed. We also examine the impact of including different combinations of molecular scaffolds, predicted ancestors, and real fossils on recovering accurate phylogenies for pseudoextinct taxa.

Methods

Morphological data set

Morphological data were obtained from Morphobank project 773 [31]. The unaltered data set contained a total of 4,541 unordered morphological characters that were scored across 86 mammalian taxa, 40 of which are extinct. Specifically, we employed the data set that was used for maximum likelihood analyses in RAxML by O’Leary et al. [31]. This matrix differs from the original matrix by containing no letters for any coded morphological character. This means that the total number of possible character states for any given character was 10 (0–9) instead of 19 (0–9 and: E, F, I, J, L, O, P, Q, X). O’Leary et al. [29] recoded character states that were above 9 as missing (?) while any character states coded as 0–9 were left intact. This change affected twelve morphological characters. This matrix also differs from the original matrix in that all polymorphic character scorings were recoded as missing (?). These changes increased the number of constant characters from 403 to 407.

Each extant order of placental mammals was represented by at least one taxon. Nine of these orders were represented by two or more species. For several of these analyses, the extinct taxa were removed from the matrix, leaving only the 46 extant taxa. Forty-two of these are placentals, two are marsupials, and two are monotremes. Following Beck and Baillie [32] we updated Thomashuxleya with morphological characters from a more recent study [35] and removed 407 characters that were constant across all 86 taxa.

Molecular scaffold

We employed a well-supported molecular scaffold [36] based on Meredith et al.’s [33] phylogenetic analyses of 33,030 base pairs (bp) of protein coding DNA (21 gene segments) and 2,573 bp of DNA from untranslated regions (five gene segments). Clades that were included in our molecular scaffold are also well supported by other molecular studies [16, 3740]. Our molecular scaffold is illustrated in Fig 1 and included all 46 extant taxa in O’Leary et al.’s [31] data set. This scaffold is fully bifurcating with the exception of the following polytomies: 1. ((Dermoptera + Primates), Scandentia, Glires); 2. (Talpa, Erinaceus, Sorex); 3. ((Saccopteryx + Nycteris), Myotis, Pteronotus); 4. ((Pholidota + Carnivora), Perissodactyla, Cetartiodactyla, Chiroptera); 5. (Hyracoidea, Proboscidea, Sirenia); and 6. (Xenarthra, Afrotheria, Boreoeutheria). We retained these polytomies in all of our scaffold analyses because these nodes were resolved differently in DNA versus amino acid analyses or were not strongly supported (bootstrap support < 90%; Bayesian posterior probabilities < 0.95) [33]. The morphological data of the extant taxa from O’Leary et al. [31] was analyzed with parsimony (see below) and our molecular scaffold to obtain a fully bifurcating molecular scaffold species tree (MSST) (see Fig 1) that was used for ancestral state reconstructions and tree-to-tree comparisons with Robinson and Foulds [41] distances (see below). The molecular scaffold was also employed as a backbone constraint (i.e., containing only a subset of the included taxa) in analyses that included extinct taxa. Finally, extant taxa for each extant placental order were dropped from the molecular scaffold in analyses that treated these orders as pseudoextinct.

thumbnail
Fig 1. Molecular scaffold species tree.

Our fully bifurcating molecular scaffold species tree (20588 steps) that was used for ancestral state reconstructions. The tree was rooted with the monotreme Ornithorhynchus anatinus (platypus). Each internal node has a designated number in red. This number corresponds with the number assigned to each hypothetical predicted ancestor (HPA) in our matrices. These matrices are available in figshare (10.6084/m9.figshare.15183258). Internal nodes that were constrained in the molecular scaffold are denoted with blue circles.

https://doi.org/10.1371/journal.pone.0257338.g001

Pseudoextinct matrices

O’Leary et al.’s [31] phenomic matrix includes both osteological (1–3,660) and non-osteological (3,661–4,541; total of 881) characters. The latter category includes both soft tissue and behavioral characters. For pseudoextinct taxa, only osteological characters were retained for parsimony analyses. Non-osteological characters were scored as missing (?) for pseudoextinct taxa to mimic the usual absence of these characters in extinct taxa that are only known from fossilized remains. Matrix alterations were performed in Mesquite [42]. After the removal of the 407 constant characters, 4,134 characters remained (3,371 of which were osteological).

Ancestral state reconstructions

The MSST constructed with O’Leary et al.’s [31] morphological data set and our molecular scaffold was loaded into Mesquite. Ancestral state reconstructions (ASR) for each node on our fully bifurcating species tree (Fig 1) were performed with parsimony using the ‘Trace all characters’ command. This procedure was replicated for each pseudoextinct data set, i.e., the order in question was removed from the MSST prior to parsimony ancestral state reconstruction, and ASRs were constructed for a tree that did not include the pseudoextinct order. These reconstructions were exported into Excel where all ambiguously reconstructed characters (i.e., with two or more possible states) were removed and replaced with a ‘?’. Following these alterations, each reconstructed data set was run through the same R script used by Beck and Baillie [32] to ensure that the hypothetical reconstructed ancestors were not coded for any characters that are inapplicable given the reconstructed state at another character. We did not alter this script except to change the name of the input (line 49) and output (line 85) file names. This code is available in figshare (10.6084/m9.figshare.15183372).

We did not explore maximum likelihood ASRs because Beck and Baillie [32] found that the two optimality criterion used in their study (maximum parsimony and maximum likelihood) both fit the Meredith et al. [33] topology equally well when measured by Robinson-Foulds distances [41]. Additionally, in analyses that included all of the hypothetical predicted ancestors, maximum parsimony analyses were more accurate than maximum likelihood analyses in reconstructing monophyletic superordinal clades.

Phylogenetic reconstructions

Each matrix was analyzed in TNT [43] using a traditional search and TBR branch swapping. Morganucodon oehleri was used as the outgroup for all analyses that included fossil taxa. Ornithorhynchus anatinus was used as the outgroup in analyses that omitted fossil taxa. We used maximum parsimony with stepwise addition and 10,000 random taxon addition sequences to search for the shortest tree in each analysis. We did not perform maximum likelihood analyses because there is evidence that parsimony performs better than other optimality criterion for morphological phylogenetic analyses [4446]. Furthermore, there are concerns about the appropriateness of likelihood models for morphological data since many of these models were designed to fit molecular data [3, 47, 48]. Seven different sets of analyses ((Scaffold + Fossils + HPA); (Scaffold + Fossils); (Scaffold + HPA); (Fossils + HPA); Scaffold; Fossils; HPA) were conducted using different combinations of fossil taxa, HPAs, and our molecular scaffold. In each analysis, all extant taxa were included. Trees were saved as unrooted. For analyses that were performed with the combination of a molecular scaffold + HPAs, we set the maximum number of saved trees in TNT to 100,000 (10,000 random addition sequences x 10 trees per addition sequence). With the exception of Eulipotyphla (99,520 trees), all of the analyses with a molecular scaffold + HPAs resulted in the maximum number of saved trees. The large number of saved trees in the molecular scaffold + HPAs analysis resulted from each HPA having between one and three most parsimonious placements. For example, the HPA for node 65 (Fig 1) between Homo sapiens and Saimiri sciureus has two most parsimonious placements, as a sister taxon to either S. sciureus or (H. sapiens + S. sciureus). However, after pruning all HPAs from these Scaffold + HPA trees, all trees for each pseudoextinct order condensed to a single topology for the 46 extant taxa.

Tree comparisons

Following our analyses, all trees were exported in NEXUS format and imported into PAUP 4.0a [49]. Once in PAUP, any HPAs and/or fossil taxa were pruned from the tree and any duplicate topologies that resulted from this pruning were discarded. All remaining most parsimonious trees, each of which included 46 extant taxa, were then compared to the MSST using Robinson-Foulds (RF) distances [41]. RF distances measure the number of splits (internal branches) that are unique to one of the two trees. All tree comparisons were based on unrooted trees.

Results

Phylogenetic trees based on seven different analyses with O’Leary et al.’s [31] morphological data set were compared to our MSST using RF distances. The results of these comparisons are summarized in Table 1. Table 2 summarizes the misplacement of pseudoextinct taxa relative to the full molecular scaffold for all seven analyses. The results of these analyses are also summarized in Figs 25 and S1S3 Figs. Phylogenetic trees with pseudoextinct taxa, before and after pruning fossils and/or HPAs, are provided in Newick format in figshare (10.6084/m9.figshare.15183294 and 10.6084/m9.figshare.15183345).

thumbnail
Fig 2. Summary of molecular scaffold + hypothetical predicted ancestors + fossil taxa analyses.

Summary of the pseudoextinction results for the O’Leary et al. [31] data set for one of our seven sets of analyses (molecular scaffold + hypothetical predicted ancestors + fossil taxa). Orders that were recovered in phylogenetic positions that conflict with the molecular scaffold are indicated by arrows. All analyses recovered a single most parsimonious tree except for Macroscelidea, which had three most parsimonious reconstructions. Shifts of individual species within orders are also indicated by arrows. Only eight orders (42%) (Proboscidea, Sirenia, Lagomorpha, Perissodactyla, Cingulata, Pilosa, Rodentia, Cetartiodactyla) remained in the correct interodinal position when pseudoextinct.

https://doi.org/10.1371/journal.pone.0257338.g002

thumbnail
Fig 3. Summary of molecular scaffold + fossil taxa analyses.

Summary of the pseudoextinction results for the O’Leary et al. [31] data set for one of our seven sets of analyses (molecular scaffold + fossil taxa). Orders that were recovered in phylogenetic positions that conflict with the molecular scaffold are indicated by arrows. Any shifts of individual species are also indicated by arrows. All analyses found a single most parsimonious tree (MPT) except for Eulipotyphla and Sirenia, each of which had two most parsimonious reconstructions. One MPT for Eulipotyphla supported Epitheria (Afrotheria + Boreoeutheria), as shown above, while the second MPT supported Exafroplacentalia (Xenarthra + Boreoeutheria). One MPT for Sirenia supported Epitheria (Afrotheria + Boreoeutheria), as shown above, while the second MPT supported Atlantogenata (Xenarthra + Afrotheria). In both cases the relationships for the pseudoextinct Eulipotyphla and Sirenia were unchanged between their two MPTs. Only six orders (32%) (Proboscidea, Sirenia, Lagomorpha, Hyracoidea, Cingulata, Pilosa) remained in the correct interordinal position when pseudoextinct. * = MPT supports Atlantogenata, ß = MPT supports Chiroptera as sister to Ferae (Carnivora + Pholidota), Ω = MPT supports Chiroptera as sister to Pholidota.

https://doi.org/10.1371/journal.pone.0257338.g003

thumbnail
Fig 4. Summary of molecular scaffold analyses.

Summary of the pseudoextinction results for the O’Leary et al. [31] data set for one of our seven sets of analyses (molecular scaffold only). Orders that were recovered in phylogenetic positions that conflict with the molecular scaffold are indicated by arrows. Any shifts of individual species are also indicated by arrows. All analyses recovered a single most parsimonious tree (MPT). Only seven orders (37%) (Proboscidea, Sirenia, Lagomorpha, Hyracoidea, Cingulata, Pilosa, Rodentia) remained in the correct interordinal position when pseudoextinct. Ψ = MPT supports Chiroptera as sister to (Perissodacyla + Cetartiodactyla), Ω = MPT supports Chiroptera as sister to Pholidota.

https://doi.org/10.1371/journal.pone.0257338.g004

thumbnail
Fig 5. Summary of molecular scaffold + hypothetical predicted ancestors analyses.

Summary of the pseudoextinction results for the O’Leary et al. [31] data set for one of our seven sets of analyses (molecular scaffold + hypothetical predicted ancestors). Orders that were recovered in phylogenetic positions that conflict with the molecular scaffold are indicated by arrows. Any shifts of individual species are also indicated by arrows. All analyses recovered a single most parsimonious tree (MPT). Only seven orders (37%) (Proboscidea, Sirenia, Lagomorpha, Hyracoidea, Cingulata, Pilosa, Rodentia) remained in the correct interordinal position when pseudoextinct.

https://doi.org/10.1371/journal.pone.0257338.g005

thumbnail
Table 2. Summary of misplacements of pseudoextinct taxa relative to the molecular scaffold.

https://doi.org/10.1371/journal.pone.0257338.t002

Pseudoextinction analyses with the molecular scaffold by itself recovered the highest number of splits on the MSST and therefore had the lowest RF distances (mean = 5.58) (Table 1). However, mean RF distances were only slightly higher in other analyses that incorporated a molecular scaffold (scaffold + fossils = 5.79, scaffold + fossils + HPA = 6.21, scaffold + HPA = 6). The analyses that incorporated a molecular scaffold and HPAs had the lowest median RF distance (2) whereas the three other analyses that incorporated a molecular scaffold had the same median RF distance (4) (Table 1 and Figs 25). By contrast, mean/median RF distances were much higher for analyses that lacked a molecular scaffold (fossils +HPA = 27.49/30, fossils = 44.53/44, HPA = 36.16/34) (Table 1). Among the 19 placental orders that were treated as pseudoextinct, mean RF distances across seven different analyses were lowest for Sirenia (14.14), Dermoptera (14.86), Lagomorpha (14.86), Primates (15.14), and Scandentia (15.57). Median RF distances across seven different analyses were lowest for Proboscidea (0), Sirenia (1), Cingulata (2), Dermoptera (2), Lagomorpha (2), Pilosa (2), Primates (2) and Scandentia (2). Pseudoextinct placental orders with the highest mean RF distances, in turn, were Afrosoricida (25.43), Chiroptera (25.14), Eulipotyphla (24.43), Pholidota (23.43), Cetartiodactyla (22.86) and Tubulidentata (21.9), and pseudoextinct placental orders with the highest median RF distances were Cetartiodactyla (24), Eulipotyphla (19), Afrosoricida (18), Carnivora (16), Chiroptera (16), Pholidota (14), and Tubulidentata (14).

Table 2 summarizes the extent to which pseudoextinct and other taxa were recovered in positions that violate the molecular scaffold. These misplacements are limited to pseudoextinct taxa in analyses that incorporated a scaffold, but can potentially affect any taxon in analyses that did not employ a scaffold. In analyses that incorporated a scaffold, 11 to 12 of the 19 placental orders (58–63%) were recovered in an incorrect position. Further, in five to eight instances (26–42%) the correct superordinal group (Afrotheria, Euarchontoglires, Laurasiatheria, Xenarthra) for the pseudoextinct order was not recovered as monophyletic. One order (5%) was recovered as polyphyletic or paraphyletic in analyses with the molecular scaffold and HPAs. Three orders (16%) were recovered as polyphyletic or paraphyletic in analyses with the molecular scaffold by itself, and this increased to four orders (21%) in analyses with the scaffold and fossils (Table 2). However, the combination of scaffold + HPAs + fossils resulted in zero pseudoextinct orders that were recovered as polyphyletic or paraphyletic (Table 2). One to three pseudoextinct orders (5–16%) were recovered as monophyletic but with intraordinal relationships that conflicted with the scaffold in analyses that incorporated a scaffold for the non-pseudoextinct orders (Table 2). In the three analyses that did not incorporate a molecular scaffold, placental orders were recovered in an incorrect position in 12 to 15 cases (63–79%), at least one superordinal group was not recovered as monophyletic in 13 to 19 cases (68–100%), zero to three orders (0–16%) were recovered as polyphyletic or paraphyletic, and two to three of the orders (11–16%) with three or more taxa exhibited altered intraordinal relationships that are incongruent with the scaffold.

Among individual placental orders, ten orders (53%) were always estimated in an incorrect interordinal position that violates the molecular scaffold. In five of these cases (Afrosoricida, Chiroptera, Eulipotyphla, Pholidota, Tubulidentata) the pseudoextinct order was misplaced outside of its correct superordinal group. By contrast, only two orders (11%) (Proboscidea, Sirenia) were always estimated with the correct sister-group relationship. However, even in these cases the larger superordinal group (Afrotheria) containing these orders was not always recovered. Three orders (16%) (Afrosoricida, Cetartiodactyla, Eulipotyphla) were estimated as polyphyletic or paraphyletic in at least three of the analyses. Notably, the inclusion of HPAs increased the monophyly of pseudoextinct orders. Indeed, two of the four analyses with HPAs (scaffold + fossils + HPAs, fossils + HPAs) yielded monophyly for every pseudoextinct order. Intraordinal relationships for monophyletic orders were always incorrectly estimated when Chiroptera was pseudoextinct. Specifically, the Old World fruitbat (Pteropus giganteus) was always estimated as the sister group to all of the other bats rather than as the sister group to the other yinpterochiropteran (Rhinopoma hardwickii). Other orders with incorrect intraordinal relationships when the order was monophyletic included Rodentia (five analyses), Cetartiodactyla (three analyses), and Eulipotyphla (two analyses).

Discussion

The accuracy of morphological phylogenetics

The accurate placement of extinct taxa in the Tree of Life is a fundamental challenge in phylogenetics. Because the placement of most extinct taxa cannot be vetted with molecular data, the accuracy of morphological phylogenetics must be assessed with other approaches such as pseudoextinction analyses that are performed upon extant taxa. Previous pseudoextinction analyses using O’Leary et al.’s [31] morphological data set suggest that morphology alone is insufficient for accurate phylogenetic reconstructions of the majority of pseudoextinct orders [15]. Recently, Beck and Baillie [32] employed a molecular scaffold from Meredith et al. [33] and reconstructed hypothetical predicted ancestors for morphological characters in O’Leary et al.’s [31] data set. These authors concluded that the inclusion of HPAs and fossils in a phylogenetic analysis was sufficient to accurately place all 19 placental orders into their correct superordinal group (Xenarthra, Afrotheria, Laurasiatheria, Euarchontoglires) using morphological data alone. Asher et al. [34] obtained similar results for Glires and found that the inclusion of HPAs nearly tripled the number of well-corroborated groups that were recovered in their morphological phylogenetic analysis of this clade. Importantly, both of these studies estimated and included the immediate HPAs of all extant taxa in their morphological phylogenetic analyses [32, 34]. However, the immediate HPAs of most extinct taxa cannot be reconstructed in scaffold-based analyses because molecular data cannot be procured for these taxa. In this study, we combined HPAs with pseudoextinction analyses to determine if HPAs elsewhere in the tree are sufficient to determine the accurate placement of each ‘extinct’ placental order when its immediate HPA is unknown. We also compared the general performance of molecular scaffolds, fossil taxa, HPAs, and combinations thereof for determining the accurate placement of pseudoextinct taxa.

Our results corroborate the results of Springer et al. [15] and show morphological characters by themselves cannot be relied upon to accurately place placental orders when they are pseudoextinct. Indeed, all seven sets of analyses failed to place the majority of 19 pseudoextinct placental orders next to their correct sister group. Analyses with the molecular scaffold (11 to 12 or 58–63% incorrect sister groups) performed slightly better than analyses that excluded the molecular scaffold (12 to 15 or 63–79% incorrect sister groups). Moreover, at least one of the four superordinal groups was not recovered as monophyletic in 11 to 17 cases (58–89%) when placental orders were treated as pseudoextinct (Table 2). These results stand in sharp contrast to the analysis of Asher et al. [34] that employed HPAs but no pseudoextinction. If pseudoextinct orders are a proxy for extinct orders, the inclusion of HPAs will not result in the accurate placement of pseudoextinct orders when the immediate hypothetical ancestor is unknown. Nevertheless, the inclusion of HPAs, with or without fossils, resulted in greater congruence with well-established clades than analyses with extant taxa only [15] or with the inclusion of fossils but not HPAs (also see [34]).

The challenge of placing insectivores and myrmecophages

Ten placental orders (53%) (Afrosoricida, Carnivora, Chiroptera, Dermoptera, Eulipotyphla, Macroscelidea, Pholidota, Primates, Scandentia, Tubulidentata) were recovered in incorrect phylogenetic positions in all seven analyses. Five of these orders (Afrosoricida, Dermoptera, Eulipotyphla, Macroscelidea, Scandentia) are comprised of taxa that Wagner [50] included in his Insectivora. The polyphyletic origins of Wagner’s Insectivora were recognized by morphologists who removed menotyphlan insectivores from this group and placed them in the orders Dermoptera, Macroscelidea, and Scandentia [51, 52]. However, morphologists continued to support the monophyly of Lipotyphla (solenodons, moles, shrews, hedgehogs, tenrecs, golden moles) [51, 5355] until this group was shown to be polyphyletic based on molecular data and divided into Afrosoricida and Eulipotyphla [5659]. Even with the general acceptance of lipotyphlan polyphyly, cladistic analyses of morphology-only data sets have continued to support the monophyly of this clade [31]. Our analyses confirm the difficulty of correctly placing “insectivores” in cladistic analyses with morphological data. Indeed, Afrosoricida and Eulipotyphla always join each other when one of these groups is pseudoextinct.

Beyond the various “insectivore” groups (Insectivora, Menotyphla, Lipotyphla) that have subsequently been shown to be polyphyletic, Edentata is a polyphyletic taxon that is historically comprised of the myrmecophagous orders Xenarthra, Pholidota [2224] and sometimes Tubulidentata [31]. Two of these orders (Pholidota, Tubulidentata) are among those that were always misplaced in our pseudoextinction analyses. Pholidotans were always the sister taxon to Xenarthra or Pilosa when pseudoextinct. Tubulidentata, in turn, were usually the sister taxon to other myrmecophagous taxa (i.e., Pholidota, Xenarthra, Pholidota + Xenarthra) although in two cases (Scaffold, Scaffold + HPAs) were the sister taxon to Euungulata (Cetartiodactyla + Perissodactyla). With the exception of Tubulidentata + Euungulata, these results represent different incarnations of Edentata and underscore the challenges that are associated with accurately placing myrmecophagous taxa in morphological phylogenetic analyses. The order Carnivora is represented by Canis familiaris (dog) and Felis silvestris (European wildcat) in O’Leary et al.’s [29] data set. With a few exceptions (e.g., Proteles cristata), the members of Carnivora are not insectivorous/myrmecophagous. However, the sister taxon to Carnivora is an edentulous, myrmecophagous order (Pholidota) [33] and was misplaced in all seven analyses when pseudoextinct. This result further highlights the difficulty of reconstructing clades such as Ferae (Carnivora + Pholidota) that include myrmecophagous orders.

By contrast with our results for Pholidota and Tubulidentata, the myrmecophagous Tamandua tetradactyla (Order Pilosa) and largely myrmecophagous Dasypus novemcinctus (Order Cingulata) were often recovered as sister groups to each other when pseudoextinct (Cingulata in 6 of 7 analyses; Pilosa in 5 of 7 analyses; see Table 2). Previously Springer et al. [15] performed a pseudoextinction analysis with O’Leary et al.’s [31] data set and treated Cingulata and Pilosa as a single order (Xenarthra). Pseudoextinction analyses with Xenarthra resulted in this taxon jumping inside of Afrotheria and joining with Tubulidentata [15]. Similarly, Springer et al. [12] performed pseudoextinction analyses with Xenarthra and a much smaller data set (185 osteological characters; [60]), and recovered Cingulata as sister to Tubulidentata and Pilosa as sister to Pholidota. These results suggest that morphological data will recover more accurate positions for extinct taxa when a suitably close living relative is also included in the analysis.

Effect of HPAs

The inclusion of the HPAs resulted in a greater number of pseudoextinct orders that were recovered as monophyletic. For two analyses, Scaffold + Fossils + HPA and Fossils + HPA, all pseudoextinct orders were recovered as monophyletic. In the Scaffold + HPA and HPA only analyses only one pseudoextinct order (5%) (Cetartiodactyla) was not monophyletic. In the HPA only analyses Cetacea was recovered outside of an ungulate clade that included Perissodactyla and terrestrial Cetartiodactyla. In the Scaffold + HPA analyses, Cetacea was recovered as the sister group to Sirenia. The increase in the number of monophyletic (pseudoextinct) orders that resulted from the inclusion of HPAs suggests that increased fossil sampling may also increase the accuracy of morphological phylogenetic analyses if a sufficient number of transitional fossils with appropriate combinations of primitive and derived characters are included in the analysis.

However, HPAs may not replicate extinct taxa in the fossil record [34]. Thus, new fossil discoveries may not have the same positive effect as HPAs when they are included in a cladistic analysis. Additionally, the fossil record is highly incomplete and many gaps in the fossil record will likely be permanent due to environmental factors that did not favor fossilization [61]. Thus, even with additional fossil discoveries, it is unlikely that all ancestors for a large group will ever be known. Our results show that the absence of even a single ancestor can be severely detrimental to the accurate phylogenetic reconstruction of some pseudoextinct groups. We acknowledge that the analyses of Beck and Baillie [32] represent a best-case scenario for future fossil discoveries. It remains to be tested if close relatives that are not ancestral would be as beneficial as ancestral species to improve the performance of morphological phylogenetic analyses.

Future directions

Fossil species comprise the majority of taxa in the mammalian Tree of Life. The discovery of fossil species and the accurate placement of these fossils in phylogenetic trees is crucial for both molecular dating analyses and reconstructing temporal sequences of character acquisition. The inclusion of hypothetical predicted ancestors is not a panacea for morphological phylogenetic analyses with extinct taxa, but the inclusion of HPAs may increase the accuracy of morphological phylogenetic analyses relative to analyses that lack these ancestors. It is crucial to continue using fossil taxa to elucidate the evolutionary history of eutherian mammals. Despite our results, it is possible that additional fossil taxa will increase the accuracy of morphological phylogenetics. O’Leary et al.’s [31] data set includes far more morphological characters (>4500) than any other data set for mammals, but only includes 40 fossil taxa that span a vast amount of evolutionary time (from ~200 million years ago to the Recent). The inclusion of additional fossil taxa may break up long branches that are prone to long-branch attraction and improve the accuracy of morphological phylogenetics in estimating the higher-level phylogeny of mammals. Future work should aim to increase the diversity of fossil taxa that are used alongside extant taxa in morphological or total-evidence analyses.

Supporting information

S1 Fig. Fossil taxa + hypothetical predicted ancestors analyses.

All pruned trees for one of our seven sets of analyses (fossil taxa + hypothetical predicted ancestors) including the trees that resulted from an analysis where no taxa were pseudoextinct. In agreement with Beck and Baillie [32], both most parsimonious trees (MPTs) that were recovered when no taxa were pseudoextinct reconstructed all four placental superorders (Afrotheria, Xenarthra, Laurasiatheria, Euarchontoglires) as monophyletic. All pseudoextinction analyses recovered a single MPT except for Carnivora (2 MPTs), Eulipotyphla (2 MPTs), Macroscelidea (2 MPTs), Scandentia (2 MPTs) and Tubulidentata (3 MPTs).

https://doi.org/10.1371/journal.pone.0257338.s001

(PDF)

S2 Fig. Fossil taxa analyses.

All pruned trees for one of our seven sets of analyses (fossil taxa only) including the tree that resulted from an analysis where no taxa were pseudoextinct. The most parsimonious tree (MPT) found when no taxa were pseudoextinct failed to recover Afrotheria, Laurasiatheria, and Euarchontoglires as monophyletic. Xenarthra, while monophyletic, was recovered within a larger “edentate” group including Pholidota and Tubulidentata. All pseudoextinction analyses found a single MPT except for Carnivora and Hyracoidea, which each had two MPTs.

https://doi.org/10.1371/journal.pone.0257338.s002

(PDF)

S3 Fig. Hypothetical predicted ancestors analyses.

All pruned trees for one of our seven sets of analyses (hypothetical predicted ancestors only) including the trees that resulted from an analysis where no taxa were pseudoextinct. The most parsimonious trees (MPTs) found when no taxa were pseudoextinct failed to recover Afrotheria, Xenarthra, and Laurasiatheria as monophyletic. Euarchontoglires was recovered as monophyletic in all 3 MPTs that lacked pseudoextinct taxa, although in each case the interordinal relationships within Euarchontoglires and intraordinal relationships within Rodentia conflicted with the molecular scaffold. All pseudoextinction analyses found a single MPT exception for Cetartiodacyla (2 MPTs), Proboscidea (3 MPTs), Rodentia (2 MPTs), Sirenia (3 MPTs), and Tubulidentata (2 MPTs).

https://doi.org/10.1371/journal.pone.0257338.s003

(PDF)

Acknowledgments

We thank the Willi Hennig Society for providing open access to TNT and Robin Beck for his R script. We thank Robert Asher and Robin Beck for helpful comments on earlier versions of this manuscript.

References

  1. 1. Darwin C. On the origin of species. England: John Murray; 1859.
  2. 2. Giribet G. Morphology should not be forgotten in the era of genomics–a phylogenetic perspective. Zool. Anz. 2015;256:96–103. pmid:26212884
  3. 3. Lee MS, Palci A. Morphological phylogenetics in the genomic age. Curr. Biol. 2015;25(19):R922–9. pmid:26439355
  4. 4. Murphy WJ, Foley NM, Bredemeyer KR, Gatesy J, Springer MS. Phylogenomics and the genetic architecture of the placental mammal radiation. Annu. Rev. Anim. Biosci. 2020;9. pmid:33228377
  5. 5. Hillis DM. Molecular versus morphological approaches to systematics. Annu. Rev. Ecol. Syst. 1987;18(1):23–42.
  6. 6. Zou Z, Zhang J. Morphological and molecular convergences in mammalian phylogenetics. Nat. Commun. 2016;7:12758. pmid:27585543
  7. 7. Hofreiter M, Serre D, Poinar HN, Kuch M, Pääbo S. Ancient DNA. Nat. Rev. Genet. 2001;2(5):353. pmid:11331901
  8. 8. Guillerme T, Cooper N. Assessment of available anatomical characters for linking living mammals to fossil taxa in phylogenetic analyses. Biol. Lett. 2016;12(5):20151003. pmid:27146442
  9. 9. Koch NM, Parry LA. Death is on our side: paleontological data drastically modify phylogenetic hypotheses. Syst. Biol. 2020;69(6):1052–1067 pmid:32208492
  10. 10. Mongiardino Koch N, Garwood RJ, Parry LA. Fossils improve phylogenetic analyses of morphological characters. Proc. Biol. Sci. 2021;288(1950):20210044. pmid:33947239
  11. 11. Wiens JJ. The role of morphological data in phylogeny reconstruction. Syst. Biol. 2004;53(4):653–61. pmid:15371253
  12. 12. Springer MS, Burk-Herrick A, Meredith R, Eizirik E, Teeling E, O’Brien SJ, et al. The adequacy of morphology for reconstructing the early history of placental mammals. Syst. Biol. 2007;56(4):673–84. pmid:17661234
  13. 13. Springer MS, Meredith R, Teeling E, Murphy W. Technical comment on “The placental mammal ancestor and the post–K-Pg radiation of placentals.” Science. 2013;341(6146):613.
  14. 14. Springer MS, Emerling CA, Meredith RW, Janečka JE, Eizirik E, Murphy WJ. Waking the undead: Implications of a soft explosive model for the timing of placental mammal diversification. Mol. Phylogenet. Evol. 2017;106:86–102. pmid:27659724
  15. 15. Springer MS, Foley N, Brady P, Gatesy J, Murphy W. Evolutionary models for the diversification of placental mammals across the KPg boundary. Front. Genet. 2019;10:1241. pmid:31850081
  16. 16. Foley NM, Springer MS, Teeling EC. Mammal madness: is the mammal tree of life not yet resolved? Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2016;371(1699):20150140. pmid:27325836
  17. 17. Madsen O, Scally M, Douady CJ, Kao DJ, DeBry RW, Adkins R, et al. Parallel adaptive radiations in two major clades of placental mammals. Nature. 2001;409(6820):610. pmid:11214318
  18. 18. Murphy WJ, Eizirik E, Johnson WE, Zhang YP, Ryder OA, O’Brien SJ. Molecular phylogenetics and the origins of placental mammals. Nature. 2001;409(6820):614–8. pmid:11214319
  19. 19. Murphy WJ, Eizirik E, O’Brien SJ, Madsen O, Scally M, Douady CJ, et al. Resolution of the early placental mammal radiation using Bayesian phylogenetics. Science. 2001;294(5550):2348–51. pmid:11743200
  20. 20. Springer MS, Murphy WJ, Eizirik E, O’Brien SJ. Placental mammal diversification and the Cretaceous–Tertiary boundary. Proc. Natl. Acad. Sci. 2003;100(3):1056–61. pmid:12552136
  21. 21. Springer MS, Murphy WJ, Eizirik E, O’Brien SJ. Molecular evidence for major placental clades. In: The Rise of Placental Mammals. 2005. p. 45–64.
  22. 22. Novacek MJ. Fossils, topologies, missing data, and the higher level phylogeny of eutherian mammals. Syst. Biol. 1992;41(1):58–73.
  23. 23. Novacek MJ. Mammalian phylogeny: shaking the tree. Nature. 1992;356(6365):121–5. pmid:1545862
  24. 24. Novacek MJ, Wyss AR. Higher-level relationships of the recent eutherian orders: morphological evidence. Cladistics. 1986;2(4):257–87.
  25. 25. Shoshani J, McKenna MC. Higher taxonomic relationships among extant mammals based on morphology, with selected comparisons of results from molecular data. Mol. Phylogenet. Evol. 1998;9(3):572–84. pmid:9668007
  26. 26. Szalay FS, Lucas SG. Cranioskeletal morphology of archontans, and diagnoses of Chiroptera, Volitantia, and Archonta. In: Primates and their relatives in phylogenetic perspective. Springer; 1993. p. 187–226.
  27. 27. Springer MS, Meredith RW, Eizirik E, Teeling E, Murphy WJ. Morphology and placental mammal phylogeny. Syst. Biol. 2008;57(3):499–503. pmid:18570041
  28. 28. Asher RJ, Hofreiter M. Tenrec phylogeny and the noninvasive extraction of nuclear DNA. Syst. Biol. 2006;55(2):181–94. pmid:16522569
  29. 29. Pattinson DJ, Thompson RS, Piotrowski AK, Asher RJ. Phylogeny, paleontology, and primates: do incomplete fossils bias the tree of life? Syst. Biol. 2014;64(2):169–86. pmid:25239212
  30. 30. Sansom RS, Wills MA. Fossilization causes organisms to appear erroneously primitive by distorting evolutionary trees. Sci. Rep. 2013;3:2545. pmid:23985991
  31. 31. O’Leary MA, Bloch JI, Flynn JJ, Gaudin TJ, Giallombardo A, Giannini NP, et al. The placental mammal ancestor and the post–K-Pg radiation of placentals. Science. 2013;339(6120):662–7. pmid:23393258
  32. 32. Beck RM, Baillie C. Improvements in the fossil record may largely resolve current conflicts between morphological and molecular estimates of mammal phylogeny. Proc. Biol. Sci. 2018;285(1893):20181632. pmid:30963896
  33. 33. Meredith RW, Janečka JE, Gatesy J, Ryder OA, Fisher CA, Teeling EC, et al. Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification. Science. 2011;334(6055):521–4. pmid:21940861
  34. 34. Asher R, Smith MR, Rankin A, Emry R. Congruence, fossils, and the evolutionary tree of rodents and lagomorphs. R. Soc. Open Sci. 2019;6(7):190387. pmid:31417738
  35. 35. Carillo JD, Asher RJ. An exceptionally well-preserved skeleton of Thomashuxleya externa (Mammalia, Notoungulata), from the Eocene of Patagonia, Argentina. Palaeontol. Electronica. 2017;online.
  36. 36. Springer MS, Teeling EC, Madsen O, Stanhope MJ, de Jong WW. Integrated fossil and molecular data reconstruct bat echolocation. Proc. Natl. Acad. Sci. USA. 2001;98(11):6241–6. pmid:11353869
  37. 37. Emerling CA, Huynh HT, Nguyen MA, Meredith RW, Springer MS. Spectral shifts of mammalian ultraviolet-sensitive pigments (short wavelength-sensitive opsin 1) are associated with eye length and photic niche evolution. Proc. Biol. Sci. 2015;282(1819):20151817. pmid:26582021
  38. 38. Esselstyn JA, Oliveros CH, Swanson MT, Faircloth BC. Investigating difficult nodes in the placental mammal tree with expanded taxon sampling and thousands of ultraconserved elements. Genome Biol. Evol. 2017;9(9):2308–21. pmid:28934378
  39. 39. dos Reis M, Inoue J, Hasegawa M, Asher RJ, Donoghue PC, Yang Z. Phylogenomic datasets provide both precision and accuracy in estimating the timescale of placental mammal phylogeny. Proc. Biol. Sci. 2012;279(1742):3491–500. pmid:22628470
  40. 40. Tarver JE, Dos Reis M, Mirarab S, Moran RJ, Parker S, O’Reilly JE, et al. The interrelationships of placental mammals and the limits of phylogenetic inference. Genome Biol. Evol. 2016;8(2):330–44. pmid:26733575
  41. 41. Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math. Biosci. 1981;53(1–2):131–47.
  42. 42. Maddison WP, Maddison DR. Mesquite: a modeular system for evolutionary analysis [Internet]. 2018. Available from: http://www.mesquiteproject.org
  43. 43. Goloboff PA, Catalano SA. TNT version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics. 2016;32(3):221–38.
  44. 44. Goloboff PA, Torres A, Arias JS. Weighted parsimony outperforms other methods of phylogenetic inference under models appropriate for morphology. Cladistics. 2018;34(4):407–37.
  45. 45. Sansom RS, Choate PG, Keating JN, Randle E. Parsimony, not Bayesian analysis, recovers more stratigraphically congruent phylogenetic trees. Biol. Lett. 2018;14(6):20180263. pmid:29925561
  46. 46. Schrago CG, Aguiar BO, Mello B. Comparative evaluation of maximum parsimony and Bayesian phylogenetic reconstruction using empirical morphological data. Journal of evolutionary biology. 2018;31(10):1477–84. pmid:29957887
  47. 47. Parins-Fukuchi C. Use of continuous traits can improve morphological phylogenetics. Syst. Biol. 2017;67(2):328–39.
  48. 48. Goloboff PA, Pittman M, Pol D, Xu X. Morphological data sets fit a common mechanism much more poorly than DNA sequences and call into question the Mkv model. Syst. Biol. 2019;68(3):494–504. pmid:30445627
  49. 49. Swofford DL. PAUP*: Phylogenetic analysis using parsimony (*and other methods). Sunderland, Massachusetts; 2003.
  50. 50. Wagner J A. Die Säugtiere in Abbildungen nach der Natur. Leipzig; 1855.
  51. 51. Butler PM. The problem of insectivore classification. In: Studies in vertebrate evolution. Oliver and Boyd; 1972. p. 253–65.
  52. 52. Leche W. Über die Säugethiergattung Galeopithecus: eine morphologische untersuchung. Vol. 21. Kungl. Svenska vetenskapsakademiens handlingar.; 1885. 1–92 p.
  53. 53. Butler PM. The skull of Ictops and the classification of the Insectivora. In: Proceedings of the Zoological Society of London. Wiley Online Library; 1956. p. 453–81.
  54. 54. Butler PM. Phylogeny of the insectivores. In: The phylogeny and classification of the tetrapods. Oxford: Clarendon Press; 1988. p. 117–41.
  55. 55. MacPhee RDE, Novacek MJ. Definition and relationships of Lipotyphla. In: Mammal phylogeny: Placentals. New York, Springer; 1993. p. 13–31.
  56. 56. Springer MS, Cleven GC, Madsen O, de Jong WW, Waddell VG, Amrine HM, et al. Endemic African mammals shake the phylogenetic tree. Nature. 1997;388(6637):61–4. pmid:9214502
  57. 57. Stanhope MJ, Madsen O, Waddell VG, Cleven GC, de Jong WW, Springer MS. Highly congruent molecular support for a diverse superordinal clade of endemic African mammals. Mol. Phylogenet. Evol. 1998;9(3):501–8. pmid:9667998
  58. 58. Stanhope MJ, Waddell VG, Madsen O, De Jong W, Hedges SB, Cleven GC, et al. Molecular evidence for multiple origins of Insectivora and for a new order of endemic African insectivore mammals. Proc. Natl. Acad. Sci. USA. 1998;95(17):9967–72. pmid:9707584
  59. 59. Waddell PJ, Okada N, Hasegawa M. Towards resolving the interordinal relationships of placental mammals. Syst. Biol. 1999;48(1):1–5. pmid:12078634
  60. 60. Asher RJ, Novacek MJ, Geisler JH. Relationships of endemic African mammals and their fossil relatives based on morphological and molecular evidence. J. Mamm. Evol. 2003;10(1–2):131–94.
  61. 61. Walker FM, Dunhill AM, Benton MJ. Variable preservation potential and richness in the fossil record of vertebrates. Palaeontology. 2019;63(2):313–29.