Adam, Eve and Population Genetics: A Reply to Dr. Richard Buggs (Part 1)

Chris_Falter · December 26, 2017, 6:06pm

Question: Why is this an accurate simulation for haplotype distributions? A Gaussian distribution is essentially pure noise except for the mean and the variance. @DennisVenema is claiming that the existence of TMRCA values > mya is a legitimate signal of ancestry, not just noise. If the observed PDF across the genome is the result of a stochastic process that makes some regions seem younger than the ground truth (as measured by TMRCA), then the noise should not be interpreted as nullifying the signal from regions at extrema that point to a very ancient TMRCA.

Does that make sense?

glipsnort · December 26, 2017, 6:15pm

I don’t know what you mean by correcting for interbreeding here. If many of us are descendants of ancient interbreeding between lineages that diverged at least half a million years ago (which we are), then we are not a product of a bottleneck of size two within the last half million years. To take seriously a recent tight bottleneck, you don’t have to correct for introgression – you have to ignore it.

Swamidass · December 27, 2017, 9:59pm

I’ll post more later. This is just regarding @DennisVenema’s claim re: Homo sapiens. If we care about measuring Homo sapien population size, we have to correct for interbreeding.

The good news is that I found exactly the data we need to do this study. It includes a scan of phylogenies constructed across the entire autosomal genome. Each phylogeny is computed for non-recombining blocks, which corrects for recombination introduced artifacts. From this we can compute TMRCA and T4MRCA across the whole genome, by looking at the times of the first and third coalescent nodes. Once we rescale to years with the mutation rates, we will have the distribution across the whole genome.

We can test to see how well TMRCA / 4 = TMR4A.
We can see the distribution of TMR4A across the genome.
We can identify the outlier areas, or subset on any region of the genome.

The bad news is that the source data is 424 GB of compressed data. I got a plan to handle it though. If that is a good analysis to do?

What is everyone’s thoughts? (especially @glipsnort, @DennisVenema, and @RichardBuggs) Not expecting a response till Jan of course.

glipsnort · December 28, 2017, 1:38pm

I have to admit that I haven’t looked at the precise wording of Dennis’s statements lately, but my impression was that his claims were not about “organisms arbitrarily labelled as Homo sapiens” but about “us”, which would correspond to contemporary Homo sapiens. Contemporary Homo sapiens did not go through a bottleneck of size two 200,000 years ago. My ancestors did not go through a tight bottleneck 200,000 years ago. At that time, some of them were living in Africa and some of them were living in Europe. I still don’t see why the demographic history of one branch of the structured population I descend from has anything other than purely academic interest to anyone. The fact that many (but not all) biologists have slapped different names on the different branches doesn’t strike me as relevant at all.

tallen_1 · December 28, 2017, 3:02pm

Josh, would you agree with this statement based on your understanding of the evidence?

Jon_Garvey · December 28, 2017, 3:18pm

“Sir, Is it on your grandfather’s or your grandmother’s side that you claim descent from a tight bottleneck 200,000 years ago?”

gbrooks9 · December 28, 2017, 4:59pm

I once pointed out in another group that if we go back far enough, both sides of all human lineage is African!

I thought that was a pretty clever turn of the phrase. But wouldn’t you know I was soon confronted by a Texan, amateur anthropologist and white supremacist, who insisted that some human races have no ancestors from Africa.

I turned him off and it was blissfully quiet.

glipsnort · December 28, 2017, 5:35pm

Excuse me, but in my family we prefer not to talk about “those” relatives.

Lynn_Munter · December 28, 2017, 6:12pm

Are you implying, sir, that it has become onerous to scroll the length of this thread?

I recently shared the fruits of such scrolling to answer this same question in comment 361, to save others the trouble.

glipsnort · December 28, 2017, 6:32pm

Yeah, that was the quotation I was basing my response on; I exploited your effort without citing it (or even liking it – bad glipsnort). What I don’t know is whether Dennis wrote other things that phrased the claim in different ways, and to which this discussion would actually be relevant.

Lynn_Munter · December 28, 2017, 8:57pm

I went through the first 290 posts looking for any relevant statements by @DennisVenema. I don’t see any that can be construed as saying that a bottleneck to two in Homo Sapiens (as opposed to “our lineage” including interbreeding) is ruled out with certainty comparable to heliocentrism. This is getting pretty nitpicky, but if it aids the discussion then good.

DennisVenema:

A few comments: the paper that suggests Homo sapiens back at ~300,000 years ago came out after AatG was published, but I don’t see how it really affects things. I don’t see any evidence of a bottleneck in our lineage for the last 1 MYA or more. Even Zhao (2000) has a range of TMRCA/4 values (if we want to go with that) that easily pushes back past 300,000.

(Of course, we need to look at the distribution of TMRCA values across the genome as a whole - not just at one 10kb region, as you’ve noted.)

Also, those fossils are not uniformly accepted as Homo sapiens. They have some features that are decidedly different than Homo sapiens at 200,000 years ago. This is to be expected. Our species becomes a species as a gradient biologically. With a “perfect” fossil record over the last 2MY, we presumably would have a smooth gradient between early H. erectus all the way to modern humans in the present day. As such, it will become more and more difficult to “draw the line on the gradient” - as I discuss in the book- as we get a more complete picture of the fossil record. We’re a chronospecies.

DennisVenema:

Swamidass:

DennisVenema:

Is this positive evidence for the claim I make in Adam and the Genome? Yes. I can’t see how we can squeeze this variation into 2 people at 200,000 years ago (or even 300,000 years ago, if one was to accept Homo sapiens at this time). Can you, or can @RichardBuggs?
[/quote]It seems like you are limiting your claims to merely Homo sapiens, is that correct? I thought we were asking about bottlenecks in our distant past going back to the origin of Homo about 2 million years ago. By limiting your claims this way, are you suggesting that you think it is possible for there to have been a single couple bottleneck, say, 2 million years ago?

In the book I make the claim that humans evolve as a population, and I define human as “our species” - Homo sapiens. (That’s not to say that other species might not have had the image of God, and so on.) In the book I place H. sapiens at 200 KYA, as was the consensus at the time. I think the evidence is solid that there is not a bottleneck to 2 in our ancestry for at least the last 500,000 years, but I don’t defend that in the book per se. From there it gets progressively more challenging to make a strong claim, but I think that even 1,000,000 years is reasonable to exclude a bottleneck. 2MYA? By now we’re at the limit of PSMC and other similar modelling approaches.

Now, is there positive evidence for a bottleneck to 2 (or even 20 or 200 or 2000?) at any time in the last 3,000,000 years or more? Nope. At best one could claim that our present-day methods cannot exclude the possibility between 1-3 MYA.

ETA: Adding some more past 290:

DennisVenema:

The heliocentric quote, which I thought was the object of your concern, is about humans (Homo sapiens). When I’m speaking about our lineage leading up to humans at 200KYA I use “lineage” or similar.

The other two quotes remain valid. Does “it seems” sound like I’m saying this is as certain as heliocentrism? That would be quite the understatement. That is a summary statement of all the lines of evidence in the literature to date that do not provide support for a bottleneck below ~10,000 at any time in the last 18MY (which remains the case).

"All methods employed to date agree that the human lineage has not dipped below several thousand individuals for the last 3 million years or more – long before our lineage was even remotely called “human”.

This quote also remains valid. There are no studies in the literature that support a lower bottleneck, and several that support large Ne values over this timeframe (PSMC and LD studies, for example). If there were (perhaps if I had missed one somewhere?) I’m sure you would point it out if you were aware of it.

So: “heliocentric certain”: humans. Pretty darn certain: lineage leading to humans over the last several hundred thousand years (say back to ~500,000 years ago). Confident but not as definitive: lineage over the last few million years. Survey of literature to date: no evidence of a bottleneck greater than thousands anywhere, regardless of time.

DennisVenema:

It’s up the thread ^ . “Heliocentrism certain” = humans. 200KYA in the book. I’m ok with stretching that to 300KYA if needed.

Also including @Swamidass’ restatement which has become the jumping-off point for much subsequent discussion:[quote=“Swamidass, post:322, topic:37039”]
Dennis,

Very interesting papers. I’ve read the first one already, but not the second. However, I am not sure how they help your case. As I understand it, you are trying to demonstrate…

Homo sapiens specifically do not dip down to a single couple in 300 kya to the confidence we have in heliocentrism.

Our ancestors as a whole do not dip down to a single couple between 300 kya and 3 mya with very high confidence, but maybe not as high.

As you put it…

DennisVenema: The heliocentric quote, which I thought was the object of your concern, is about humans (Homo sapiens). When I’m speaking about our lineage leading up to humans at 200KYA I use “lineage” or similar.

DennisVenema: So: “heliocentric certain”: humans. Pretty darn certain: lineage leading to humans over the last several hundred thousand years (say back to ~500,000 years ago). Confident but not as definitive: lineage over the last few million years.

Swamidass: As I understand it, the first claim appears to be novel, and I cannot find it in the literature anywhere. It would be help to see a paper that estimates population size of Homo sapiens specifically. I have not been able to find one. Have you?

Charles_Alexandre · December 29, 2017, 2:36pm

I fully agree with you and I’ve raised similar points about people’s tone in other threads, only to be told otherwise by moderators. Obviously, it’s nearly impossible to police for tone leaving only overtly hostile or crude comments as objectively inappropriate. However, my guess is that moderators who frequently see and censor that sort of dialogue end up with overly low expectations of what civil dialogue looks like. Unfortunately, to non-experts reading these sorts of exchanges, tone plays a large role because it’s one of the few elements that they can actually assess. So while it’s not really realistic to moderate, I think that we could at least admit that it could often be better and that as Christian brothers and sisters, we should strive for the most gracious dialogue possible. I think that Dennis (and all Biologos writers, really), are completely above board on this and I honestly think that that is one of the most persuasive aspects of their position.

jpm · December 29, 2017, 3:56pm

While I have trouble what post of tallen_1 you are referring to, agree with your comments on tone. As a suggestion, if you comment on a post that is way back in the thread, highlight and quote it to allow others to easily see what you are referring to.
Part of the problem is that some seem to see these discussions as competitions to be won by the knights carrying their colors, when they are more an effort to reach understanding of the relative positions.

tallen_1 · December 30, 2017, 3:32pm

It’s a bit odd to see this dug back up, but my concern now is simply one of basic fairness rather than tone. By now we’re all accustomed to Richard’s inquisitorial conversation style. It is what it is. What I do want to see happen, and what I think Richard owes his conversational partners on this thread, is to abide by his same standards of accountability in openly and honestly answering questions posed to one about what the evidence does or does not support. As long as he plays by the same rules as he holds others to, then I think we’re good. But if he thinks the only important questions to answer are the ones he pushes, then I don’t think we are. He has some unanswered pertinent questions put to him by Dennis and others. Let’s see what he does with them.

GJDS · December 31, 2017, 12:12am

I have not read every single post, but I have read with interest the various expert opinions, and for what it is worth, I give my impression.

The certainty that is displayed by one side seems to be the absence of modelling that would support a “bottleneck” of two (I find it hard to believe two creatures are a bottleneck? but this is the terminology).

The questions that are asked however point to a “lack of similar certainty” regarding what I think are important parameters in the modelling.

I (as a non-participant) am at a loss as to the physical evidence that is supposed regarding this bottleneck of a few hundred or whatever number is discussed. Am I missing something? Has someone discovered remains at a particular location to show such a group existed at some point in time? If not, is this not an inference derived from the modelling. If there is direct physical evidence, why is this not discussed at length, as it would be relevant to the certainty/uncertainty aspect of the discussion?

Jon_Garvey · December 31, 2017, 9:21am

Yes - one can use considerable expertise and ingenuity to argue whether the population genetics model supports a particular contention or not. That’s the nature of this thread.

But the question of whether the model itself is adequately valid over such timescales, given its known limitations and the state of flux of theories of large-scale evolution, is a significant one.

“All models are wrong - some are useful”. But their utility is only measurable by the ability to validate them by independent observation under the situation for which they are being used - in this case the origin of humanity defined, at least, as our species or even across hominin species by some protagonists. That’s very different from studying the evolution of Y-chromosomes in the living population.

In this case, validation would seem to require counting fossils that are as rare as hens’ teeth - in the absence of physical evidence, the population genetics model seems to validate itself in a circular manner.

Chris_Falter · December 31, 2017, 1:49pm

Jon_Garvey:

But their utility is only measurable by the ability to validate them by independent observation under the situation for which they are being used - in this case the origin of humanity defined, at least, as our species or even across hominin species by some protagonists. [/quote]

The models are estimating the minimum size of an ancestral population at various points on the past. The models can be validated by examining their predictions for the current state of the genome(s). Also, their assumptions regarding the initial state and regarding rates of various kinds of mutations can be supported or disproven by experimental evidence regarding current states and rates.

That said, if this modeling were easy, we non-biologists would not have had the opportunity to grab some popcorn and enjoy the show.

[quote=“Jon_Garvey, post:383, topic:37039”]That’s very different from studying the evolution of Y-chromosomes in the living population.

AFAIK, it’s basically the same thing, but on a longer time scale and across more of the genome than typical Y-chromosome studies.

Perhaps one of our biologist friends like @Swamidass, @glipsnort, @DennisVenema or @RichardBuggs would be able to shed more light.

GJDS · December 31, 2017, 11:06pm

Perhaps you would elaborate; what predictions can any model make on what seems to be current data used to set up the model itself?

Chris_Falter · January 1, 2018, 1:31am

Good question, George. Assuming I have understood this thread, the data being predicted/compared with current state and the data used in rate calculations are not the same. The predictions/observations of current state focus on the distribution of genomic features, whereas the assumptions of rates of change are based on known history.

GJDS · January 1, 2018, 8:40am

Hi Chris,

Validation of models, as I have practiced (and is commonly understood) requires a result from the model to be similar to an observation/measurement of a system independent of the model data base. Within this I have a difficult time noting such validation of the models discussed here. Since a major point is the size and time of a bottleneck, for example, validation would be considered by using physical data of a real bottleneck. There may be other ways, and if you can identify them I would be interested to know.

This is not a question on the technicalities of the modelling proceure.