Nested Clades, The Consistency Index, and Affirming the Consequent

EricMH · October 17, 2020, 6:06pm

Perfect trees always have a consistency index (CI) score of 1. A CI score less than 1 means a non-tree DAG fits the data better than a tree. If you look at the Klassen graph, almost all the studies are less than 1, and many are much less than 1, especially as you go into higher taxa. If you subtract CI from 1 you get another metric called the homoplasy index (HI), which is a measure of how untreelike the data is, i.e. how often the same character shows up in completely different clades.

The homoplasy index (HI) is simply 1 − CI.

So, if we flip the Klassen 1991 graph on its head, we get a graph of HI vs taxa count. As you see, the results actually show the more data we get, the better it is fit by an untreelike DAG, i.e. much closer to what we see in design scenarios. In fact, the ‘phylogenetic signal’ would be more aptly named the ‘design signal’. This is also why I am finding it so easy to generate ‘high’ CI scores from graph structures that don’t look anything like an evolutionary tree.

paleomalacologist · October 18, 2020, 1:40am

A tree with no convergence would have a consistency index of 1. How you calculate CI will determine whether you get a high consistency index or not with unresolved branches - how are you handling the lack of resolution? Basically, it’s an apples and oranges problem - a CI value for a tree that is not completely bifurcating is not measuring the same thing as a tree that is completely bifurcating, just as comparing tree statistics generated by different data sets is not generally very meaningful. What version of PAUP are you using? In the current PAUP* 4.0a168 (beta testing), if you scroll down the analysis menu, bootstrap/jackknife gives you ways to test the strength of groupings in the tree through different resampling strategies. Note that likelihood analyses are not too fast to begin with and the bootstrap or jackknife involves repeated analyses, so you need to allow a fair amount of time for it to give reasonable numbers. If you want to try parsimony analyses, TNT analyzes the data faster but is not as user-friendly - you’d want to build your file in PAUP and then analyze it with TNT.

Note that running PAUP*4, using your data set of
#NEXUS
Begin data;
Dimensions ntax=4 nchar=2;
Format datatype=DNA gap=- missing=X;
Matrix
taxon1 GA
taxon2 AT
taxon3 TC
taxon4 CG;
End;

a branch and bound search gives three trees with a CI of 1 (describe trees) but also it lists a CI excluding uninformative characters, and that CI is zero. The CI excluding uninformative characters is what is meaningful for assessing the type of question you are asking. Any data set where none of the examples actually share any features will automatically give a CI of 1, but none of those features tell any sort of phylogenetic analysis anything meaningful - there needs to be some commonalities to work with.

An appropriate comparison would be analyzing a data set made of several long random strings of characters - for convenience, say A, G, T, and C, and see how the results for that data set compares with a set of actual DNA sequences that are appropriate for a particular group of organisms. (By appropriate, I mean that it needs to fall somewhere between having essentially no change, which again would give high CI but be highly uninformative and having so much change that it is essentially randomized. As DNA has only A, G, T, and C to choose from, a sequence that mutates enough will have random matches with other sequences.)

The actual patterns observed with analyzing data for real organisms closely matches the expectations of an evolutionary pattern, which is to have pretty good nested clades but also some convergence, random variation, and other “noise” [from the point of view of someone trying to figure out the evolutionary relationships, those are noise, but not necessarily to someone asking other types of questions.]

paleomalacologist · October 23, 2020, 3:44pm

As previously pointed out, the paper by Farris available at https://onlinelibrary.wiley.com/doi/epdf/10.1111/j.1096-0031.1989.tb00573.x discusses limits of the consistency index and why some modifications can be advantageous to highlight where the consistency index alone is not very informative. For data from real organisms, a high consistency index suggests that the analysis is doing a reasonable job of matching the actual evolutionary pattern. However, there are likely to be a huge number of trees that differ very little in CI. A data set with very few potential synapomorphies will not give a very meaningful CI. For example, the sample data sets largely had each taxon unique, with no shared similarities. A data set where almost all taxa had nearly identical sequences would also not give a very meaningful CI. The CI would, however, distinguish between a set of features that originated by an evolutionary pattern versus one that originated from a “mix and match” approach. In principle, an intelligent designer could give bats and birds matching genes for wings and bats and cats matching genes for reproduction, etc., so that the similarities do not follow any consistent pattern. Such a scenario would yield a low CI. Of course, a design hypothesis cannot be tested unless it is specified how that design took place; a designer would not have to follow such an approach, but it is similar to a popular type of ID model. Conversely, bacteria can pick up all sorts of random DNA, so there is a good deal of mixing and matching without intelligent intervention.

EricMH · October 26, 2020, 2:06pm

I don’t know if you’ve been tracking the later results in this thread, but I can generate datasets that score highly in all the metrics people have mentioned. I can do this at the nucleotide level, as well as with binary characteristics. At this point I don’t know of any metrics that can distinguish my DAG datasets from real world datasets. So, as far as I can tell a DAG is a better fit for real world datasets than an evolutionary tree. I’ve posted links to my generators a couple times in this thread if you want to try it for yourself.

paleomalacologist · October 26, 2020, 3:12pm

But the metrics are not designed to distinguish between artificial data sets and real ones; they are designed to measure how well a tree fits a particular criterion. Again, this means that, for example, a CI of 0.91 for a molecular analysis on snails and a CI of .95 for a morphological analysis on bivalves doesn’t indicate that the morphological analysis was better. All that comparison tells us is that the morphological analysis had fewer examples of convergence. If two trees generated in analyzing the same data set show different CIs, then the higher CI is a tree with less convergence. If we believe that convergence is rare in this context, then we would think that the one with higher CI is better. But each metric is designed to measure a particular thing. Of course, you can design data sets to score well on any particular metric. But you need to consider whether your approach to generating a data set might mimic a possible evolutionary process, as well as what all the other options might be.

Analyzing the various types of molecular and morphological data that I study, I find that they fit very well with evolutionary models. I get reasonably consistent results when I use different analytical techniques; inconsistent results generally reflect cases where the data are not adequate.

EricMH · October 26, 2020, 5:12pm

Yes, that’s exactly my point. The metrics don’t really justify inference of an evolutionary tree, since that’s the presupposition. E.g. Ewert’s dependency graph is an alternate model which fits much better than an evolutionary tree. As you point out, these metrics don’t allow us to preclude such a model. In which case ‘nested clades’ is not a good piece of evidence for evolution. I rest my case.

Chris_Falter · October 26, 2020, 6:56pm

Hi Eric,

Take this with a grain of salt because I am not a biologist…

One of @paleomalacologist 's key points is that your artificial dataset is too unlike naturally occurring datasets to provide any useful insights.

I would encourage you to go back and deal with this substance rather than resting on supposed laurels gained in a battle of rhetoric.

Best,
Chris

system · November 2, 2020, 3:56pm

This topic was automatically closed 6 days after the last reply. New replies are no longer allowed.