<meta http-equiv="Content-Type" content="application/xhtml+xml; charset=UTF-8"/> <link rel="stylesheet" href="LaTeXML.css" type="text/css"/> <link rel="stylesheet" href="ltx-article.css" type="text/css"/> <link rel="stylesheet" href="latexmliness/plr-style.css" type="text/css"/> <script src="latexmliness/LaTeXML-maybeMathjax.js" type="text/javascript"/> <script src="latexmliness/adjust-svg.js" type="text/javascript"/> </head> <body> <div class="ltx_page_main"> <div class="ltx_page_content"> <div class="ltx_document"> <div id="S1" class="ltx_section"> <h1 class="ltx_title ltx_title_section"><span class="ltx_tag ltx_tag_section">1 </span>Summary statistics</h1> <div id="S1.p1" class="ltx_para"> <p class="ltx_p">First, we fix some notation. For a sample of individuals indexed by some set <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m1" class="ltx_Math" display="inline" alttext="A"><semantics><mi>A</mi><annotation encoding="application/x-tex">A</annotation></semantics></math>, genotyped at a set of genomic positions indexed by <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m2" class="ltx_Math" alttext="S" display="inline"><semantics><mi>S</mi><annotation encoding="application/x-tex">S</annotation></semantics></math>, the data are <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m3" class="ltx_Math" display="inline" alttext="\{G_{ijk}\;:\;i\in A,\;j\in S,\;k\in\{m,p\}\}"><semantics><mrow><mo>{</mo><mrow><mpadded width="+2.777778pt"><msub><mi>G</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi><mo>⁢</mo><mi>k</mi></mrow></msub></mpadded><mo separator="true">: </mo><mrow><mrow><mi>i</mi><mo>∈</mo><mi>A</mi></mrow><mo separator="true">, </mo><mrow><mrow><mi>j</mi><mo>∈</mo><mi>S</mi></mrow><mo separator="true">, </mo><mrow><mi>k</mi><mo>∈</mo><mrow><mo>{</mo><mrow><mi>m</mi><mo>,</mo><mi>p</mi></mrow><mo>}</mo></mrow></mrow></mrow></mrow></mrow><mo>}</mo></mrow><annotation encoding="application/x-tex">\{G_{ijk}\;:\;i\in A,\;j\in S,\;k\in\{m,p\}\}</annotation></semantics></math>, i.e. <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m4" class="ltx_Math" display="inline" alttext="G_{ijm}"><semantics><msub><mi>G</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi><mo>⁢</mo><mi>m</mi></mrow></msub><annotation encoding="application/x-tex">G_{ijm}</annotation></semantics></math> is the allele that the <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m5" class="ltx_Math" alttext="i^{\mathrm{th}}" display="inline"><semantics><msup><mi>i</mi><mi>th</mi></msup><annotation encoding="application/x-tex">i^{\mathrm{th}}</annotation></semantics></math> individual inherited at the <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m6" class="ltx_Math" alttext="j^{\mathrm{th}}" display="inline"><semantics><msup><mi>j</mi><mi>th</mi></msup><annotation encoding="application/x-tex">j^{\mathrm{th}}</annotation></semantics></math> position from her mother, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p1.m7" class="ltx_Math" display="inline" alttext="G_{ijp}"><semantics><msub><mi>G</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi><mo>⁢</mo><mi>p</mi></mrow></msub><annotation encoding="application/x-tex">G_{ijp}</annotation></semantics></math> is the corresponding allele inherited from her father.</p> </div> <div id="S1.p2" class="ltx_para"> <p class="ltx_p">Regardless of the process that has generated <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p2.m1" class="ltx_Math" display="inline" alttext="G"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math>, it makes sense to think about the sampling distribution of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p2.m2" class="ltx_Math" display="inline" alttext="G"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math>, and associated statistics – i.e. the distribution of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p2.m3" class="ltx_Math" alttext="G" display="inline"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math> induced by some sort of random sampling of the individuals. Often, we can actually obtain from <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p2.m4" class="ltx_Math" alttext="G" display="inline"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math> a good estimate of the entire sampling distribution. For instance, we can estimate the distribution of the the number of nucleotide differences between two individuals in a 100bp region across all such regions and all pairs of sampled individuals, as long as <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p2.m5" class="ltx_Math" alttext="G" display="inline"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math> can be reasonably regarded as a random sample from some population. We can further estimate conditional sampling distributions, e.g. number of such differences as a function of geographical distance between them, or in protein coding regions. </p> </div> <div id="S1.p3" class="ltx_para"> <p class="ltx_p">Here we relate the sampling distributions of a number of statistics easily computable form <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.p3.m1" class="ltx_Math" alttext="G" display="inline"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math> to sampling distributions of properties of the pedigree with recombination.</p> </div> <div id="S1.p4" class="ltx_para"> <p class="ltx_p">We will study aspects of the distributions of these statistics under various “levels of sampling”:</p> <ol id="I1" class="ltx_enumerate"> <li id="I1.i1" class="ltx_item" style="list-style-type:none;"><span class="ltx_tag ltx_tag_enumerate">1.</span> <div id="I1.i1.p1" class="ltx_para"> <p class="ltx_p">Unconditional, including averaging over the population process.</p> </div></li> <li id="I1.i2" class="ltx_item" style="list-style-type:none;"><span class="ltx_tag ltx_tag_enumerate">2.</span> <div id="I1.i2.p1" class="ltx_para"> <p class="ltx_p">Conditional on the population pedigree (<math xmlns="http://www.w3.org/1998/Math/MathML" id="I1.i2.p1.m1" class="ltx_Math" alttext="\mathcal{P}" display="inline"><semantics><mi>𝒫</mi><annotation encoding="application/x-tex">\mathcal{P}</annotation></semantics></math>), and averaging over recombinations, segregation, and mutations.</p> </div></li> <li id="I1.i3" class="ltx_item" style="list-style-type:none;"><span class="ltx_tag ltx_tag_enumerate">3.</span> <div id="I1.i3.p1" class="ltx_para"> <p class="ltx_p">Conditional on the population ARG (<math xmlns="http://www.w3.org/1998/Math/MathML" id="I1.i3.p1.m1" class="ltx_Math" alttext="\mathcal{M}" display="inline"><semantics><mi>ℳ</mi><annotation encoding="application/x-tex">\mathcal{M}</annotation></semantics></math>), averaging over choices of individuals.</p> </div></li> <li id="I1.i4" class="ltx_item" style="list-style-type:none;"><span class="ltx_tag ltx_tag_enumerate">4.</span> <div id="I1.i4.p1" class="ltx_para"> <p class="ltx_p">Conditional on the population ARG (<math xmlns="http://www.w3.org/1998/Math/MathML" id="I1.i4.p1.m1" class="ltx_Math" display="inline" alttext="\mathcal{M}"><semantics><mi>ℳ</mi><annotation encoding="application/x-tex">\mathcal{M}</annotation></semantics></math>), averaging over choices of locus.</p> </div></li> </ol> <p class="ltx_p">Only the first needs a population model. The second is clearly fictitious, but can be useful (as we see below). The latter two can be interpreted as empirical distributions. Often, in practice, we have an empirical distribution obtained averaging across many loci (as in the last point), and compare it to a theoretical distribution for a single locus under a population model. This is in principle wrong, since it ignores correlations between loci introduced by the pedigree, but in practice seems to be pretty good <cite class="ltx_cite">(<a href="#bib.bib7" title="Gene genealogies within a fixed pedigree, and the robustness of Kingman’s coalescent" class="ltx_ref">2</a>)</cite>.</p> </div> <div id="S1.SS1" class="ltx_subsection"> <h2 class="ltx_title ltx_title_subsection"><span class="ltx_tag ltx_tag_subsection">1.1 </span>Number of segregating sites</h2> <div id="S1.SS1.p1" class="ltx_para"> <p class="ltx_p">As a first example, we can relate the mean and variance of the number of segregating sites in a sample to the distribution of total time in the sample tree, simliar to the calculation of number of mutations in <cite class="ltx_cite">(<a href="#bib.bib11" title="Gene genealogies and the coalescent process" class="ltx_ref">1</a>)</cite>. The number of segregating sites, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p1.m1" class="ltx_Math" alttext="A_{0}" display="inline"><semantics><msub><mi>A</mi><mn>0</mn></msub><annotation encoding="application/x-tex">A_{0}</annotation></semantics></math>, in a sample of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p1.m2" class="ltx_Math" alttext="k" display="inline"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math> chromosomes at <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p1.m3" class="ltx_Math" alttext="|S|" display="inline"><semantics><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><annotation encoding="application/x-tex">|S|</annotation></semantics></math> loci, is the number of sites at which at least two of the sampled chromosomes differ. For this to happen, there must have been a mutation at that site somewhere on the tree that relates the samples.</p> </div> <div id="S1.SS1.p2" class="ltx_para"> <p class="ltx_p">Concretely, suppose that we have sampled <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m1" class="ltx_Math" alttext="k" display="inline"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math> chromosomes, and that (as above) the tree relating these samples at genomic position <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m2" class="ltx_Math" display="inline" alttext="x"><semantics><mi>x</mi><annotation encoding="application/x-tex">x</annotation></semantics></math> is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m3" class="ltx_Math" display="inline" alttext="T_{x}"><semantics><msub><mi>T</mi><mi>x</mi></msub><annotation encoding="application/x-tex">T_{x}</annotation></semantics></math>. We measure “length” of the tree in meioses, and denote by <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m4" class="ltx_Math" display="inline" alttext="|T_{x}|"><semantics><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow><annotation encoding="application/x-tex">|T_{x}|</annotation></semantics></math> the total number of meioses the tree, up until the most recent common ancestor. Also, let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m5" class="ltx_Math" display="inline" alttext="N_{x}"><semantics><msub><mi>N</mi><mi>x</mi></msub><annotation encoding="application/x-tex">N_{x}</annotation></semantics></math> denote the total number of mutations that have occurred at site <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m6" class="ltx_Math" alttext="x" display="inline"><semantics><mi>x</mi><annotation encoding="application/x-tex">x</annotation></semantics></math> during any of the meioses anywhere in <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m7" class="ltx_Math" display="inline" alttext="T_{x}"><semantics><msub><mi>T</mi><mi>x</mi></msub><annotation encoding="application/x-tex">T_{x}</annotation></semantics></math>. Under the usual assumption on mutations, this number is Poisson distributed with mean <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p2.m8" class="ltx_Math" alttext="\mu|T_{x}|" display="inline"><semantics><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow><annotation encoding="application/x-tex">\mu|T_{x}|</annotation></semantics></math>.</p> </div> <div id="S1.SS1.p3" class="ltx_para"> <p class="ltx_p">Now let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p3.m1" class="ltx_Math" display="inline" alttext="X"><semantics><mi>X</mi><annotation encoding="application/x-tex">X</annotation></semantics></math> denote a randomly chosen locus, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p3.m2" class="ltx_Math" display="inline" alttext="T=|T_{X}|"><semantics><mrow><mi>T</mi><mo>=</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>X</mi></msub><mo fence="true">|</mo></mrow></mrow><annotation encoding="application/x-tex">T=|T_{X}|</annotation></semantics></math>, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p3.m3" class="ltx_Math" display="inline" alttext="N=N_{X}"><semantics><mrow><mi>N</mi><mo>=</mo><msub><mi>N</mi><mi>X</mi></msub></mrow><annotation encoding="application/x-tex">N=N_{X}</annotation></semantics></math>. It is straightforward that</p> <table id="S1.EGx1" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E1" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E1.m1" class="ltx_Math" display="inline" alttext="\displaystyle\mathbb{E}[N]=\mu\mathbb{E}[T]"><semantics><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mi>N</mi><mo>]</mo></mrow></mrow><mo>=</mo><mrow><mi>μ</mi><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mi>T</mi><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[N]=\mu\mathbb{E}[T]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(1)</span></td></tr> </table> <p class="ltx_p">and using the formula for conditional partitioning of variance, </p> <table id="S1.EGx2" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E2" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E2.m1" class="ltx_Math" alttext="\displaystyle\var[N]" display="inline"><semantics><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mi>N</mi><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\var[N]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E2.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}[\var[N|T]]+\var[\mathbb{E}[N|T]]" display="inline"><semantics><mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><mi>var</mi><mrow><mo>[</mo><mi>N</mi><mo>|</mo><mi>T</mi><mo>]</mo></mrow><mo>]</mo></mrow><mo>+</mo><mi>var</mi><mrow><mo>[</mo><mi>𝔼</mi><mrow><mo>[</mo><mi>N</mi><mo>|</mo><mi>T</mi><mo>]</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[\var[N|T]]+\var[\mathbb{E}[N|T]]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(2)</span></td></tr> <tr id="S1.E3" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E3.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}[\mu T]+\var[\mu T]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>μ</mi><mo>⁢</mo><mi>T</mi></mrow><mo>]</mo></mrow></mrow><mo>+</mo><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mi>μ</mi><mo>⁢</mo><mi>T</mi></mrow><mo>]</mo></mrow></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[\mu T]+\var[\mu T].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(3)</span></td></tr> </table> <p class="ltx_p">Note that this remains true if <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p3.m4" class="ltx_Math" alttext="\mu" display="inline"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math> depends on <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p3.m5" class="ltx_Math" display="inline" alttext="x"><semantics><mi>x</mi><annotation encoding="application/x-tex">x</annotation></semantics></math>.</p> </div> <div id="S1.SS1.p4" class="ltx_para"> <p class="ltx_p">Now, by linearity of expectation,</p> <table id="S1.EGx3" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E4" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E4.m1" class="ltx_Math" alttext="\displaystyle\mathbb{E}[A_{0}]=\mathbb{E}[\sum_{x}N_{x}]=|S|\mathbb{E}[\mu T]." display="inline"><semantics><mrow><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msub><mi>A</mi><mn>0</mn></msub><mo>]</mo></mrow></mrow><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>x</mi></munder></mstyle><msub><mi>N</mi><mi>x</mi></msub></mrow><mo>]</mo></mrow></mrow><mo>=</mo><mrow><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>μ</mi><mo>⁢</mo><mi>T</mi></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[A_{0}]=\mathbb{E}[\sum_{x}N_{x}]=|S|\mathbb{E}[\mu T].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(4)</span></td></tr> </table> <p class="ltx_p">Similarly, by conditioning on the collection of trees <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p4.m1" class="ltx_Math" display="inline" alttext="\mathcal{T}=\{T_{x}\colon x\in S\}"><semantics><mrow><mi>𝒯</mi><mo>=</mo><mrow><mo>{</mo><mrow><msub><mi>T</mi><mi>x</mi></msub><mo separator="true">:</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></mrow><mo>}</mo></mrow></mrow><annotation encoding="application/x-tex">\mathcal{T}=\{T_{x}\colon x\in S\}</annotation></semantics></math>, since <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p4.m2" class="ltx_Math" display="inline" alttext="N_{x}"><semantics><msub><mi>N</mi><mi>x</mi></msub><annotation encoding="application/x-tex">N_{x}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p4.m3" class="ltx_Math" display="inline" alttext="N_{y}"><semantics><msub><mi>N</mi><mi>y</mi></msub><annotation encoding="application/x-tex">N_{y}</annotation></semantics></math> are conditionally independent given <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p4.m4" class="ltx_Math" alttext="T_{x}" display="inline"><semantics><msub><mi>T</mi><mi>x</mi></msub><annotation encoding="application/x-tex">T_{x}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS1.p4.m5" class="ltx_Math" alttext="T_{y}" display="inline"><semantics><msub><mi>T</mi><mi>y</mi></msub><annotation encoding="application/x-tex">T_{y}</annotation></semantics></math>,</p> <table id="S1.EGx4" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E5" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E5.m1" class="ltx_Math" display="inline" alttext="\displaystyle\var[A_{0}]"><semantics><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><msub><mi>A</mi><mn>0</mn></msub><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\var[A_{0}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E5.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}[\var[A_{0}|\mathcal{T}]]+\var\left[\mathbb{E}[A_{0}|% \mathcal{T}]\right]"><semantics><mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><mi>var</mi><mrow><mo>[</mo><msub><mi>A</mi><mn>0</mn></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow><mo>+</mo><mi>var</mi><mrow><mo>[</mo><mi>𝔼</mi><mrow><mo>[</mo><msub><mi>A</mi><mn>0</mn></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[\var[A_{0}|\mathcal{T}]]+\var\left[\mathbb{E}[A_{0}|% \mathcal{T}]\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(5)</span></td></tr> <tr id="S1.E6" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E6.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}\left[\sum_{x,y\in S}\cov[N_{x},N_{y}|\mathcal{T}]% \right]+\var\left[\sum_{x\in S}\mathbb{E}[N_{x}|\mathcal{T}]\right]"><semantics><mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mrow><mi>x</mi><mo>,</mo><mi>y</mi></mrow><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mi>cov</mi><mrow><mo>[</mo><msub><mi>N</mi><mi>x</mi></msub><mo>,</mo><msub><mi>N</mi><mi>y</mi></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow><mo>+</mo><mi>var</mi><mrow><mo>[</mo><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mi>𝔼</mi><mrow><mo>[</mo><msub><mi>N</mi><mi>x</mi></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[\sum_{x,y\in S}\cov[N_{x},N_{y}|\mathcal{T}]% \right]+\var\left[\sum_{x\in S}\mathbb{E}[N_{x}|\mathcal{T}]\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(6)</span></td></tr> <tr id="S1.E7" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E7.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}\left[\sum_{x\in S}\var[N_{x}|\mathcal{T}]\right]+\var% \left[\sum_{x\in S}\mathbb{E}[N_{x}|\mathcal{T}]\right]"><semantics><mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mi>var</mi><mrow><mo>[</mo><msub><mi>N</mi><mi>x</mi></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow><mo>+</mo><mi>var</mi><mrow><mo>[</mo><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mi>𝔼</mi><mrow><mo>[</mo><msub><mi>N</mi><mi>x</mi></msub><mo>|</mo><mi>𝒯</mi><mo>]</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[\sum_{x\in S}\var[N_{x}|\mathcal{T}]\right]+\var% \left[\sum_{x\in S}\mathbb{E}[N_{x}|\mathcal{T}]\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(7)</span></td></tr> <tr id="S1.E8" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E8.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}\left[\sum_{x\in S}\mu|T_{x}|\right]+\var\left[\sum_{x% \in S}\mu|T_{x}|\right]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow><mo>+</mo><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[\sum_{x\in S}\mu|T_{x}|\right]+\var\left[\sum_{x% \in S}\mu|T_{x}|\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(8)</span></td></tr> <tr id="S1.E9" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E9.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}\left[\sum_{x\in S}\mu|T_{x}|\right]+\sum_{x,y\in S}% \cov\left[\mu|T_{x}|,\mu|T_{y}|\right]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow><mo>+</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mrow><mi>x</mi><mo>,</mo><mi>y</mi></mrow><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow><mo>,</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>y</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[\sum_{x\in S}\mu|T_{x}|\right]+\sum_{x,y\in S}% \cov\left[\mu|T_{x}|,\mu|T_{y}|\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(9)</span></td></tr> <tr id="S1.E10" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E10.m2" class="ltx_Math" alttext="\displaystyle=\mu|S|\mathbb{E}[T]+\mu^{2}|S|\var[T]+\sum_{x\neq y\in S}\cov% \left[\mu|T_{x}|,\mu|T_{y}|\right]." display="inline"><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mi>T</mi><mo>]</mo></mrow></mrow><mo>+</mo><mrow><msup><mi>μ</mi><mn>2</mn></msup><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mi>T</mi><mo>]</mo></mrow></mrow></mrow><mo>+</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>x</mi><mo>≠</mo><mi>y</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>x</mi></msub><mo fence="true">|</mo></mrow></mrow><mo>,</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>T</mi><mi>y</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mu|S|\mathbb{E}[T]+\mu^{2}|S|\var[T]+\sum_{x\neq y\in S}\cov% \left[\mu|T_{x}|,\mu|T_{y}|\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(10)</span></td></tr> </table> <p class="ltx_p">This agrees with <cite class="ltx_cite">Hudson (<a href="#bib.bib11" title="Gene genealogies and the coalescent process" class="ltx_ref">1</a>)</cite> if we are restricted to a single, nonrecombining locus.</p> </div> </div> <div id="S1.SS2" class="ltx_subsection"> <h2 class="ltx_title ltx_title_subsection"><span class="ltx_tag ltx_tag_subsection">1.2 </span>Heterozygosity</h2> <div id="S1.SS2.p1" class="ltx_para"> <p class="ltx_p">The “observed heterozygosity” in a group of individuals in a genomic region is the probability that a randomly chosen individual is heterozygous at a randomly chosen nucleotide, or</p> <table id="S1.EGx5" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E11" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E11.m1" class="ltx_Math" alttext="\displaystyle H_{O}=\frac{\#\left\{(i,j)\colon i\in A,\;j\in S,\;G_{ijp}\neq G% _{ijm}\right\}}{|S|\,|A|}," display="inline"><semantics><mrow><mrow><msub><mi>H</mi><mi>O</mi></msub><mo>=</mo><mstyle displaystyle="true"><mfrac><mrow><mi mathvariant="normal">#</mi><mo>⁢</mo><mrow><mo>{</mo><mrow><mrow><mo>(</mo><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow><mo>)</mo></mrow><mo separator="true">:</mo><mrow><mrow><mi>i</mi><mo>∈</mo><mi>A</mi></mrow><mo separator="true">, </mo><mrow><mrow><mi>j</mi><mo>∈</mo><mi>S</mi></mrow><mo separator="true">, </mo><mrow><msub><mi>G</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi><mo>⁢</mo><mi>p</mi></mrow></msub><mo>≠</mo><msub><mi>G</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi><mo>⁢</mo><mi>m</mi></mrow></msub></mrow></mrow></mrow></mrow><mo>}</mo></mrow></mrow><mrow><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow></mrow></mfrac></mstyle></mrow><mo>,</mo></mrow><annotation encoding="application/x-tex">\displaystyle H_{O}=\frac{\#\left\{(i,j)\colon i\in A,\;j\in S,\;G_{ijp}\neq G% _{ijm}\right\}}{|S|\,|A|},</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(11)</span></td></tr> </table> <p class="ltx_p">where <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p1.m1" class="ltx_Math" alttext="|S|" display="inline"><semantics><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><annotation encoding="application/x-tex">|S|</annotation></semantics></math> denotes the total number of loci and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p1.m2" class="ltx_Math" alttext="|A|" display="inline"><semantics><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow><annotation encoding="application/x-tex">|A|</annotation></semantics></math> denotes the total number of individuals.</p> </div> <div id="S1.SS2.p2" class="ltx_para"> <p class="ltx_p">In other words, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m1" class="ltx_Math" alttext="H_{O}" display="inline"><semantics><msub><mi>H</mi><mi>O</mi></msub><annotation encoding="application/x-tex">H_{O}</annotation></semantics></math> is the proportion of homologous alleles that differ from each other, across <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m2" class="ltx_Math" display="inline" alttext="S"><semantics><mi>S</mi><annotation encoding="application/x-tex">S</annotation></semantics></math> and across <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m3" class="ltx_Math" display="inline" alttext="A"><semantics><mi>A</mi><annotation encoding="application/x-tex">A</annotation></semantics></math>. By calling them “homologous” we assume they share a common ancestor; so if they differ there must have occurred a mutation since that common ancestor. Take a single individual <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m4" class="ltx_Math" alttext="i" display="inline"><semantics><mi>i</mi><annotation encoding="application/x-tex">i</annotation></semantics></math>, suppose that the chance of a mutation occurring at site <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m5" class="ltx_Math" display="inline" alttext="j"><semantics><mi>j</mi><annotation encoding="application/x-tex">j</annotation></semantics></math> in a particular meiosis is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m6" class="ltx_Math" display="inline" alttext="\mu_{d}"><semantics><msub><mi>μ</mi><mi>d</mi></msub><annotation encoding="application/x-tex">\mu_{d}</annotation></semantics></math>, and that there have been <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m7" class="ltx_Math" display="inline" alttext="\tau_{ij}"><semantics><msub><mi>τ</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi></mrow></msub><annotation encoding="application/x-tex">\tau_{ij}</annotation></semantics></math> generations since the common ancestor of the maternal and paternal copies. The probability that there has been no mutations since that time is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m8" class="ltx_Math" alttext="(1-\mu_{d})^{2\tau_{ij}}" display="inline"><semantics><msup><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><msub><mi>μ</mi><mi>d</mi></msub></mrow><mo>)</mo></mrow><mrow><mn>2</mn><mo>⁢</mo><msub><mi>τ</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi></mrow></msub></mrow></msup><annotation encoding="application/x-tex">(1-\mu_{d})^{2\tau_{ij}}</annotation></semantics></math>, since there are <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m9" class="ltx_Math" alttext="2\tau_{ij}" display="inline"><semantics><mrow><mn>2</mn><mo>⁢</mo><msub><mi>τ</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi></mrow></msub></mrow><annotation encoding="application/x-tex">2\tau_{ij}</annotation></semantics></math> meioses separating the two. The proportion of heterozygous sites is determined by the empirical distribution of times back to the most common ancestor of paired homologous sites, averaged across sites and across individuals. Let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m10" class="ltx_Math" alttext="\tau_{H}" display="inline"><semantics><msub><mi>τ</mi><mi>H</mi></msub><annotation encoding="application/x-tex">\tau_{H}</annotation></semantics></math> denote this distribution, i.e. <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m11" class="ltx_Math" alttext="\mathbb{P}\{\tau_{H}=t\}=\#\{(i,j)\colon\tau_{ij}=t\}/|S||A|" display="inline"><semantics><mrow><mi>ℙ</mi><mrow><mo>{</mo><msub><mi>τ</mi><mi>H</mi></msub><mo>=</mo><mi>t</mi><mo>}</mo></mrow><mo>=</mo><mi mathvariant="normal">#</mi><mrow><mo>{</mo><mrow><mo>(</mo><mi>i</mi><mo>,</mo><mi>j</mi><mo>)</mo></mrow><mo>:</mo><msub><mi>τ</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi></mrow></msub><mo>=</mo><mi>t</mi><mo>}</mo></mrow><mo>/</mo><mo>|</mo><mi>S</mi><mo>|</mo><mo>|</mo><mi>A</mi><mo>|</mo></mrow><annotation encoding="application/x-tex">\mathbb{P}\{\tau_{H}=t\}=\#\{(i,j)\colon\tau_{ij}=t\}/|S||A|</annotation></semantics></math>. If, as assumed, there is no back mutation, then,</p> <table id="S1.EGx6" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E12" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E12.m1" class="ltx_Math" display="inline" alttext="\displaystyle\mathbb{E}[H_{O}|\mathcal{M}]"><semantics><mrow><mi>𝔼</mi><mrow><mo>[</mo><msub><mi>H</mi><mi>O</mi></msub><mo>|</mo><mi>ℳ</mi><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[H_{O}|\mathcal{M}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E12.m2" class="ltx_Math" alttext="\displaystyle=\frac{1}{|S|\,|A|}\sum_{i\in A}\sum_{j\in S}\left(1-(1-\mu_{d})^% {2\tau_{ij}}\right)" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><mfrac><mn>1</mn><mrow><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow></mrow></mfrac></mstyle><mo>⁢</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>i</mi><mo>∈</mo><mi>A</mi></mrow></munder></mstyle><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><mi>j</mi><mo>∈</mo><mi>S</mi></mrow></munder></mstyle><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><msup><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><msub><mi>μ</mi><mi>d</mi></msub></mrow><mo>)</mo></mrow><mrow><mn>2</mn><mo>⁢</mo><msub><mi>τ</mi><mrow><mi>i</mi><mo>⁢</mo><mi>j</mi></mrow></msub></mrow></msup></mrow><mo>)</mo></mrow></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{1}{|S|\,|A|}\sum_{i\in A}\sum_{j\in S}\left(1-(1-\mu_{d})^% {2\tau_{ij}}\right)</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(12)</span></td></tr> <tr id="S1.E13" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E13.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}\left[1-(1-\mu_{d})^{2\tau_{H}}\right]"><semantics><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mn>1</mn><mo>-</mo><msup><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><msub><mi>μ</mi><mi>d</mi></msub></mrow><mo>)</mo></mrow><mrow><mn>2</mn><mo>⁢</mo><msub><mi>τ</mi><mi>H</mi></msub></mrow></msup></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[1-(1-\mu_{d})^{2\tau_{H}}\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(13)</span></td></tr> <tr id="S1.E14" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E14.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}\left[1-e^{-2\mu\tau_{H}}\right]." display="inline"><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mn>1</mn><mo>-</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mi>H</mi></msub></mrow></mrow></msup></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[1-e^{-2\mu\tau_{H}}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(14)</span></td></tr> </table> <p class="ltx_p">Note that <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m12" class="ltx_Math" display="inline" alttext="\tau_{H}"><semantics><msub><mi>τ</mi><mi>H</mi></msub><annotation encoding="application/x-tex">\tau_{H}</annotation></semantics></math>, if it was observable, would be a good <em class="ltx_emph">summary statistic</em> (albeit complicated) of the pedigree, and depends implicitly on the choice of individuals <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m13" class="ltx_Math" display="inline" alttext="A"><semantics><mi>A</mi><annotation encoding="application/x-tex">A</annotation></semantics></math> and the choice of genomic region <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m14" class="ltx_Math" display="inline" alttext="S"><semantics><mi>S</mi><annotation encoding="application/x-tex">S</annotation></semantics></math>. If <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m15" class="ltx_Math" display="inline" alttext="\mu"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math> is small, then <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p2.m16" class="ltx_Math" display="inline" alttext="H_{O}\approx\mu\mathbb{E}[\tau_{H}]"><semantics><mrow><msub><mi>H</mi><mi>O</mi></msub><mo>≈</mo><mrow><mi>μ</mi><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msub><mi>τ</mi><mi>H</mi></msub><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">H_{O}\approx\mu\mathbb{E}[\tau_{H}]</annotation></semantics></math>, i.e. the proportion of sites that an individual is heterozygous is equal to the mutation rates multiplied by the average time back to the common ancestor of the maternal and paternal chomosomes.</p> </div> <div id="S1.SS2.p3" class="ltx_para"> <p class="ltx_p">Similarly,</p> <table id="S1.EGx7" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E15" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E15.m1" class="ltx_Math" alttext="\displaystyle\var[H_{0}|\mathcal{M}]" display="inline"><semantics><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msub><mi>H</mi><mn>0</mn></msub><mo separator="true">|</mo><mi>ℳ</mi></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\var[H_{0}|\mathcal{M}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E15.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{1}{|S|\,|A|}\mathbb{E}\left[e^{-2\mu\tau_{H}}\left(1-e^{-2% \mu\tau_{H}}\right)\right]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><mfrac><mn>1</mn><mrow><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow></mrow></mfrac></mstyle><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mi>H</mi></msub></mrow></mrow></msup><mo>⁢</mo><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mi>H</mi></msub></mrow></mrow></msup></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{1}{|S|\,|A|}\mathbb{E}\left[e^{-2\mu\tau_{H}}\left(1-e^{-2% \mu\tau_{H}}\right)\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(15)</span></td></tr> </table> <p class="ltx_p">This implies that the observed heterozygosity <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p3.m1" class="ltx_Math" display="inline" alttext="H_{0}"><semantics><msub><mi>H</mi><mn>0</mn></msub><annotation encoding="application/x-tex">H_{0}</annotation></semantics></math> is an estimtor of the statistic <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p3.m2" class="ltx_Math" alttext="\mathbb{E}[\tau_{H}]" display="inline"><semantics><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msub><mi>τ</mi><mi>H</mi></msub><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\mathbb{E}[\tau_{H}]</annotation></semantics></math> of the empirical distribution of pairwise coalescence times, and that can put some explicit bounds on how good this estimator is.</p> </div> <div id="S1.SS2.p4" class="ltx_para"> <p class="ltx_p">As stated, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p4.m1" class="ltx_Math" display="inline" alttext="H_{O}"><semantics><msub><mi>H</mi><mi>O</mi></msub><annotation encoding="application/x-tex">H_{O}</annotation></semantics></math> is a single number, the chance that a randomly chosen homologous pair of alleles differ. This averages over levels of relatedness of different individuals, as well as mutation rates and depths of relatedness that may differ systematically across loci. If we know local mutation rates, and partition sites according to this, then we can estimate <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p4.m2" class="ltx_Math" display="inline" alttext="H_{O}(\mu)=\mathbb{E}\left[e^{-2\mu\tau_{H}}\right]"><semantics><mrow><mrow><msub><mi>H</mi><mi>O</mi></msub><mo>⁢</mo><mrow><mo>(</mo><mi>μ</mi><mo>)</mo></mrow></mrow><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mi>H</mi></msub></mrow></mrow></msup><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">H_{O}(\mu)=\mathbb{E}\left[e^{-2\mu\tau_{H}}\right]</annotation></semantics></math> as a function of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p4.m3" class="ltx_Math" display="inline" alttext="\mu"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math>, obtaining an estimate of the Laplace transform of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS2.p4.m4" class="ltx_Math" alttext="\tau_{H}" display="inline"><semantics><msub><mi>τ</mi><mi>H</mi></msub><annotation encoding="application/x-tex">\tau_{H}</annotation></semantics></math>. </p> </div> </div> <div id="S1.SS3" class="ltx_subsection"> <h2 class="ltx_title ltx_title_subsection"><span class="ltx_tag ltx_tag_subsection">1.3 </span>Mean number of pairwise differences</h2> <div id="S1.SS3.p1" class="ltx_para"> <p class="ltx_p">Also known as “expected heterozygosity”, this is the chance that two randomly chosen alleles from <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m1" class="ltx_Math" alttext="A" display="inline"><semantics><mi>A</mi><annotation encoding="application/x-tex">A</annotation></semantics></math> at a random site in <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m2" class="ltx_Math" display="inline" alttext="S"><semantics><mi>S</mi><annotation encoding="application/x-tex">S</annotation></semantics></math> differ:</p> <table id="S1.EGx8" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E16" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E16.m1" class="ltx_Math" display="inline" alttext="\displaystyle H_{E}"><semantics><msub><mi>H</mi><mi>E</mi></msub><annotation encoding="application/x-tex">\displaystyle H_{E}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E16.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{\#\{(j,i_{1},i_{2},k_{1},k_{2})\colon G_{i_{1}jk_{1}}\neq G% _{i_{2}jk_{2}}\}}{2|S|\,|A|(|A|-1)}."><semantics><mrow><mrow><mi/><mo>=</mo><mstyle displaystyle="true"><mfrac><mrow><mi mathvariant="normal">#</mi><mo>⁢</mo><mrow><mo>{</mo><mrow><mrow><mo>(</mo><mrow><mi>j</mi><mo>,</mo><msub><mi>i</mi><mn>1</mn></msub><mo>,</mo><msub><mi>i</mi><mn>2</mn></msub><mo>,</mo><msub><mi>k</mi><mn>1</mn></msub><mo>,</mo><msub><mi>k</mi><mn>2</mn></msub></mrow><mo>)</mo></mrow><mo separator="true">:</mo><mrow><msub><mi>G</mi><mrow><msub><mi>i</mi><mn>1</mn></msub><mo>⁢</mo><mi>j</mi><mo>⁢</mo><msub><mi>k</mi><mn>1</mn></msub></mrow></msub><mo>≠</mo><msub><mi>G</mi><mrow><msub><mi>i</mi><mn>2</mn></msub><mo>⁢</mo><mi>j</mi><mo>⁢</mo><msub><mi>k</mi><mn>2</mn></msub></mrow></msub></mrow></mrow><mo>}</mo></mrow></mrow><mrow><mn>2</mn><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow><mo>-</mo><mn>1</mn></mrow><mo>)</mo></mrow></mrow></mfrac></mstyle></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{\#\{(j,i_{1},i_{2},k_{1},k_{2})\colon G_{i_{1}jk_{1}}\neq G% _{i_{2}jk_{2}}\}}{2|S|\,|A|(|A|-1)}.</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(16)</span></td></tr> </table> <p class="ltx_p"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m3" class="ltx_Math" display="inline" alttext="H_{E}"><semantics><msub><mi>H</mi><mi>E</mi></msub><annotation encoding="application/x-tex">H_{E}</annotation></semantics></math>, like <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m4" class="ltx_Math" alttext="H_{O}" display="inline"><semantics><msub><mi>H</mi><mi>O</mi></msub><annotation encoding="application/x-tex">H_{O}</annotation></semantics></math>, is computable from the distribution of the number of generations available for mutation where the relevant number of generations here is defined to be <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m5" class="ltx_Math" alttext="\tau_{T}" display="inline"><semantics><msub><mi>τ</mi><mi>T</mi></msub><annotation encoding="application/x-tex">\tau_{T}</annotation></semantics></math>. Concretely, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p1.m6" class="ltx_Math" display="inline" alttext="\tau_{T}"><semantics><msub><mi>τ</mi><mi>T</mi></msub><annotation encoding="application/x-tex">\tau_{T}</annotation></semantics></math> is the number of generations back to the common ancestor at a uniformly chosen locus between two uniformly chosen chromosomes in the population (possibly, but not necessarily, in the same individual). Again,</p> <table id="S1.EGx9" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E17" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E17.m1" class="ltx_Math" display="inline" alttext="\displaystyle H_{E}"><semantics><msub><mi>H</mi><mi>E</mi></msub><annotation encoding="application/x-tex">\displaystyle H_{E}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E17.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}\left[(1-\mu_{d})^{2\tau_{T}}\right]=\mathbb{E}\left[e% ^{-2\mu\tau_{T}}\right]." display="inline"><semantics><mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><msup><mrow><mo>(</mo><mn>1</mn><mo>-</mo><msub><mi>μ</mi><mi>d</mi></msub><mo>)</mo></mrow><mrow><mn>2</mn><mo>⁢</mo><msub><mi>τ</mi><mi>T</mi></msub></mrow></msup><mo>]</mo></mrow><mo>=</mo><mi>𝔼</mi><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mi>T</mi></msub></mrow></mrow></msup><mo>]</mo></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[(1-\mu_{d})^{2\tau_{T}}\right]=\mathbb{E}\left[e% ^{-2\mu\tau_{T}}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(17)</span></td></tr> </table> </div> <div id="S1.SS3.p2" class="ltx_para"> <p class="ltx_p">Such measures of heterozygosity can measure not only within-group diversity but also between-group divergence, by computing e.g. the probability that two randomly chosen individuals in different subpopulations differ at a randomly chosen locus. Any such measurement can be thought of as the proportion of some subset of paths through the pedigree along which a mutation has occurred; (crucially) assuming that the mutation process is independent of inheritance, this probability of mutation only depends on the number of meioses along the path, and hence on the distribution of path lengths. Above these distributions of lengths across certain sets of paths through the pedigree appeared as <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p2.m1" class="ltx_Math" display="inline" alttext="\tau_{H}"><semantics><msub><mi>τ</mi><mi>H</mi></msub><annotation encoding="application/x-tex">\tau_{H}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS3.p2.m2" class="ltx_Math" display="inline" alttext="\tau_{T}"><semantics><msub><mi>τ</mi><mi>T</mi></msub><annotation encoding="application/x-tex">\tau_{T}</annotation></semantics></math>.</p> </div> </div> <div id="S1.SS4" class="ltx_subsection"> <h2 class="ltx_title ltx_title_subsection"><span class="ltx_tag ltx_tag_subsection">1.4 </span>The allele frequency spectra</h2> <div id="S1.SS4.p1" class="ltx_para"> <p class="ltx_p">Mutations at a locus induce a partition of a set of chromosomes – those who are identical at that locus. Heterozygosities are pairwise statistics; when comparing two chromosomes there are only two possible results: identical or not. When looking at larger samples, any partition is possible; at loci with no more than two alleles, all dichotomous partitions are possible.</p> </div> <div id="S1.F1" class="ltx_figure"><object data="frequency-spectra-trees.svg" id="S1.F1.g1" class="ltx_graphics ltx_centering" width="316" height="306" alt=""/> <div class="ltx_caption"><span class="ltx_tag ltx_tag_figure">Figure 1: </span><span class="ltx_text ltx_font_bold">(A)</span> The lengths <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F1.m6" class="ltx_Math" display="inline" alttext="T_{3}"><semantics><msub><mi>T</mi><mn>3</mn></msub><annotation encoding="application/x-tex">T_{3}</annotation></semantics></math>, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F1.m7" class="ltx_Math" alttext="T_{2}" display="inline"><semantics><msub><mi>T</mi><mn>2</mn></msub><annotation encoding="application/x-tex">T_{2}</annotation></semantics></math>, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F1.m8" class="ltx_Math" display="inline" alttext="T_{1}"><semantics><msub><mi>T</mi><mn>1</mn></msub><annotation encoding="application/x-tex">T_{1}</annotation></semantics></math> (see text). Note that mutations on <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F1.m9" class="ltx_Math" alttext="T_{3}" display="inline"><semantics><msub><mi>T</mi><mn>3</mn></msub><annotation encoding="application/x-tex">T_{3}</annotation></semantics></math> are indistinguishable from those on <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F1.m10" class="ltx_Math" display="inline" alttext="T_{1}"><semantics><msub><mi>T</mi><mn>1</mn></msub><annotation encoding="application/x-tex">T_{1}</annotation></semantics></math> if the alleles are not polarized. <span class="ltx_text ltx_font_bold">(B–C)</span> The frequency spectrum encodes information about distributions of tree shape: the lower set of trees has longer internal branches, and so will have a higher chance of 2:2 partitions than the upper set of trees. Mutations (circles on the tree) separate “red” from “black” types; assuming that mutation is independent of the pedigree implies that the location of mutation is uniform (proportional to length) on the tree. </div> </div> <div id="S1.SS4.p2" class="ltx_para"> <p class="ltx_p">Suppose we are looking at the empirical distribution of allele frequencies in a sample of size <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p2.m1" class="ltx_Math" alttext="|A|=n" display="inline"><semantics><mrow><mrow><mo fence="true">|</mo><mi>A</mi><mo fence="true">|</mo></mrow><mo>=</mo><mi>n</mi></mrow><annotation encoding="application/x-tex">|A|=n</annotation></semantics></math> at biallelic sites, including only the polymorphic sites, i.e. the sites where more than one allele is seen in the sample. This is called the “allele frequency spectrum”, or “site frequency spectrum”. Let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p2.m2" class="ltx_Math" display="inline" alttext="(N_{j,0},N_{j,1})"><semantics><mrow><mo>(</mo><mrow><msub><mi>N</mi><mrow><mi>j</mi><mo>,</mo><mn>0</mn></mrow></msub><mo>,</mo><msub><mi>N</mi><mrow><mi>j</mi><mo>,</mo><mn>1</mn></mrow></msub></mrow><mo>)</mo></mrow><annotation encoding="application/x-tex">(N_{j,0},N_{j,1})</annotation></semantics></math> denote the numbers of sampled chromosomes that have the ‘0’ and ‘1’ alleles, respectively, and define <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p2.m3" class="ltx_Math" alttext="F_{k}" display="inline"><semantics><msub><mi>F</mi><mi>k</mi></msub><annotation encoding="application/x-tex">F_{k}</annotation></semantics></math> to be the number of sites with the allele ‘1’ is at frequency <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p2.m4" class="ltx_Math" display="inline" alttext="k"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math>, or</p> <table id="S1.EGx10" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E18" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E18.m1" class="ltx_Math" alttext="\displaystyle F_{k}" display="inline"><semantics><msub><mi>F</mi><mi>k</mi></msub><annotation encoding="application/x-tex">\displaystyle F_{k}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E18.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{\#\{j:N_{j,0}=k\}}{|S|}"><semantics><mrow><mi/><mo>=</mo><mstyle displaystyle="true"><mfrac><mrow><mi mathvariant="normal">#</mi><mo>⁢</mo><mrow><mo>{</mo><mrow><mi>j</mi><mo separator="true">:</mo><mrow><msub><mi>N</mi><mrow><mi>j</mi><mo>,</mo><mn>0</mn></mrow></msub><mo>=</mo><mi>k</mi></mrow></mrow><mo>}</mo></mrow></mrow><mrow><mo fence="true">|</mo><mi>S</mi><mo fence="true">|</mo></mrow></mfrac></mstyle></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{\#\{j:N_{j,0}=k\}}{|S|}</annotation></semantics></math></td> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E18.m3" class="ltx_Math" display="inline" alttext="\displaystyle k"><semantics><mi>k</mi><annotation encoding="application/x-tex">\displaystyle k</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E18.m4" class="ltx_Math" alttext="\displaystyle\in\{0,1,\ldots,n\}" display="inline"><semantics><mrow><mi/><mo>∈</mo><mrow><mo>{</mo><mrow><mn>0</mn><mo>,</mo><mn>1</mn><mo>,</mo><mi mathvariant="normal">…</mi><mo>,</mo><mi>n</mi></mrow><mo>}</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\in\{0,1,\ldots,n\}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(18)</span></td></tr> </table> <p class="ltx_p">and the “unfolded” and “folded” allele frequency spectra </p> <table id="S1.EGx11" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E20" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E20.m1" class="ltx_Math" alttext="\displaystyle a_{k}^{*}" display="inline"><semantics><msubsup><mi>a</mi><mi>k</mi><mo>*</mo></msubsup><annotation encoding="application/x-tex">\displaystyle a_{k}^{*}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E20.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{F_{k}}{F_{1}+\cdots+F_{n-1}}"><semantics><mrow><mi/><mo>=</mo><mstyle displaystyle="true"><mfrac><msub><mi>F</mi><mi>k</mi></msub><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>+</mo><mi mathvariant="normal">⋯</mi><mo>+</mo><msub><mi>F</mi><mrow><mi>n</mi><mo>-</mo><mn>1</mn></mrow></msub></mrow></mfrac></mstyle></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{F_{k}}{F_{1}+\cdots+F_{n-1}}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(20)</span></td></tr> <tr id="S1.E21" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E21.m1" class="ltx_Math" display="inline" alttext="\displaystyle a_{k}"><semantics><msub><mi>a</mi><mi>k</mi></msub><annotation encoding="application/x-tex">\displaystyle a_{k}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E21.m2" class="ltx_Math" alttext="\displaystyle=\begin{cases}\frac{F_{n/2}}{F_{1}+\cdots+F_{n-1}}\qquad\text{if % }k=n/2\\ \frac{F_{k}+F_{n-k}}{F_{1}+\cdots+F_{n-1}}\qquad\text{if }k\neq n/2\end{cases}" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mo>{</mo><mtable columnspacing="0.4em" rowspacing="0.2ex"><mtr><mtd columnalign="left"><mrow><mrow><mstyle displaystyle="true"><mfrac><msub><mi>F</mi><mrow><mi>n</mi><mo>/</mo><mn>2</mn></mrow></msub><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>+</mo><mi mathvariant="normal">⋯</mi><mo>+</mo><msub><mi>F</mi><mrow><mi>n</mi><mo>-</mo><mn>1</mn></mrow></msub></mrow></mfrac></mstyle><mo separator="true"> </mo><mrow><mtext>if </mtext><mo>⁢</mo><mi>k</mi></mrow></mrow><mo>=</mo><mrow><mi>n</mi><mo>/</mo><mn>2</mn></mrow></mrow></mtd><mtd/></mtr><mtr><mtd columnalign="left"><mrow><mrow><mstyle displaystyle="true"><mfrac><mrow><msub><mi>F</mi><mi>k</mi></msub><mo>+</mo><msub><mi>F</mi><mrow><mi>n</mi><mo>-</mo><mi>k</mi></mrow></msub></mrow><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>+</mo><mi mathvariant="normal">⋯</mi><mo>+</mo><msub><mi>F</mi><mrow><mi>n</mi><mo>-</mo><mn>1</mn></mrow></msub></mrow></mfrac></mstyle><mo separator="true"> </mo><mrow><mtext>if </mtext><mo>⁢</mo><mi>k</mi></mrow></mrow><mo>≠</mo><mrow><mi>n</mi><mo>/</mo><mn>2</mn></mrow></mrow></mtd><mtd/></mtr></mtable></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\begin{cases}\frac{F_{n/2}}{F_{1}+\cdots+F_{n-1}}\qquad\text{if % }k=n/2\\ \frac{F_{k}+F_{n-k}}{F_{1}+\cdots+F_{n-1}}\qquad\text{if }k\neq n/2\end{cases}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(21)</span></td></tr> </table> <p class="ltx_p">If we have some way of polarizing mutations, so that e.g. allele ‘0’ is more likely to be the ancestral allele, then the unfolded spectrum is more useful; otherwise, if the choice of allele labeling is arbitrary, we expect <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p2.m5" class="ltx_Math" alttext="a_{k}^{*}=a_{n-k}^{*}" display="inline"><semantics><mrow><msubsup><mi>a</mi><mi>k</mi><mo>*</mo></msubsup><mo>=</mo><msubsup><mi>a</mi><mrow><mi>n</mi><mo>-</mo><mi>k</mi></mrow><mo>*</mo></msubsup></mrow><annotation encoding="application/x-tex">a_{k}^{*}=a_{n-k}^{*}</annotation></semantics></math> and the folded spectrum is more natural.</p> </div> <div id="S1.SS4.p3" class="ltx_para"> <p class="ltx_p">Now, we’ll compute the mean and variance of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m1" class="ltx_Math" display="inline" alttext="F_{k}"><semantics><msub><mi>F</mi><mi>k</mi></msub><annotation encoding="application/x-tex">F_{k}</annotation></semantics></math> conditional on the ARG <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m2" class="ltx_Math" display="inline" alttext="\mathcal{M}"><semantics><mi>ℳ</mi><annotation encoding="application/x-tex">\mathcal{M}</annotation></semantics></math>, averaging across the mutation process. Let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m3" class="ltx_Math" alttext="\mathcal{T}_{j}" display="inline"><semantics><msub><mi>𝒯</mi><mi>j</mi></msub><annotation encoding="application/x-tex">\mathcal{T}_{j}</annotation></semantics></math> be the gene three at site <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m4" class="ltx_Math" alttext="j" display="inline"><semantics><mi>j</mi><annotation encoding="application/x-tex">j</annotation></semantics></math>, and let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m5" class="ltx_Math" alttext="T_{j,k}" display="inline"><semantics><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub><annotation encoding="application/x-tex">T_{j,k}</annotation></semantics></math> be the total length of branches in <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m6" class="ltx_Math" display="inline" alttext="\mathcal{T}_{j}"><semantics><msub><mi>𝒯</mi><mi>j</mi></msub><annotation encoding="application/x-tex">\mathcal{T}_{j}</annotation></semantics></math> subtended by exactly <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m7" class="ltx_Math" display="inline" alttext="k"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math> tips, so that <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m8" class="ltx_Math" alttext="|\mathcal{T}_{j}|=\sum_{k=1}^{n-1}T_{j,k}" display="inline"><semantics><mrow><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow><mo>=</mo><mrow><msubsup><mo>∑</mo><mrow><mi>k</mi><mo>=</mo><mn>1</mn></mrow><mrow><mi>n</mi><mo>-</mo><mn>1</mn></mrow></msubsup><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">|\mathcal{T}_{j}|=\sum_{k=1}^{n-1}T_{j,k}</annotation></semantics></math> (see figure <a href="#S1.F1" title="Figure 1 ‣ 1.4 The allele frequency spectra ‣ 1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">1</span></a>). Again assuming that the mutation is independent of inheritance, the probability that site <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m9" class="ltx_Math" alttext="j" display="inline"><semantics><mi>j</mi><annotation encoding="application/x-tex">j</annotation></semantics></math> has no segregating mutation is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m10" class="ltx_Math" display="inline" alttext="\exp(-\mu|\mathcal{T}_{j}|)"><semantics><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\exp(-\mu|\mathcal{T}_{j}|)</annotation></semantics></math>, and the probability that only a single segregating mutation has occurred is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m11" class="ltx_Math" display="inline" alttext="\exp(-\mu|\mathcal{T}_{j}|)\mu|\mathcal{T}_{j}|"><semantics><mrow><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow><annotation encoding="application/x-tex">\exp(-\mu|\mathcal{T}_{j}|)\mu|\mathcal{T}_{j}|</annotation></semantics></math>. Given that only a single mutation has occurred, the location of that mutation is uniform on the tree, and so the probability that a mutation occurs at frequency <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m12" class="ltx_Math" display="inline" alttext="k"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math> at site <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m13" class="ltx_Math" display="inline" alttext="j"><semantics><mi>j</mi><annotation encoding="application/x-tex">j</annotation></semantics></math> is</p> <table id="S1.Ex1" class="ltx_equation"> <tr class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_align_center"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.Ex1.m1" class="ltx_Math" display="block" alttext="\mathbb{P}\{N_{j,1}=k|\mathcal{T}_{j}\}=\mu T_{j,k}\exp(-\mu|\mathcal{T}_{j}|)% =\mu T_{j,k}+O(\mu^{2}|\mathcal{T}_{j}|^{2})."><semantics><mrow><mrow><mrow><mi>ℙ</mi><mo>⁢</mo><mrow><mo>{</mo><mrow><mrow><msub><mi>N</mi><mrow><mi>j</mi><mo>,</mo><mn>1</mn></mrow></msub><mo>=</mo><mi>k</mi></mrow><mo separator="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub></mrow><mo>}</mo></mrow></mrow><mo>=</mo><mrow><mi>μ</mi><mo>⁢</mo><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub><mo>⁢</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>=</mo><mrow><mrow><mi>μ</mi><mo>⁢</mo><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub></mrow><mo>+</mo><mrow><mi>O</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><msup><mi>μ</mi><mn>2</mn></msup><mo>⁢</mo><msup><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow><mn>2</mn></msup></mrow><mo>)</mo></mrow></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\mathbb{P}\{N_{j,1}=k|\mathcal{T}_{j}\}=\mu T_{j,k}\exp(-\mu|\mathcal{T}_{j}|)% =\mu T_{j,k}+O(\mu^{2}|\mathcal{T}_{j}|^{2}).</annotation></semantics></math></td> <td class="ltx_eqn_pad"/></tr> </table> <p class="ltx_p">Therefore,</p> <table id="S1.EGx12" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E22" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E22.m1" class="ltx_Math" alttext="\displaystyle\mathbb{E}[F_{k}|\mathcal{M}]" display="inline"><semantics><mrow><mi>𝔼</mi><mrow><mo>[</mo><msub><mi>F</mi><mi>k</mi></msub><mo>|</mo><mi>ℳ</mi><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[F_{k}|\mathcal{M}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E22.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\sum_{j}\mathbb{P}\{N_{j,1}=k|\mathcal{T}_{j}\}"><semantics><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>j</mi></munder></mstyle><mrow><mi>ℙ</mi><mo>⁢</mo><mrow><mo>{</mo><mrow><mrow><msub><mi>N</mi><mrow><mi>j</mi><mo>,</mo><mn>1</mn></mrow></msub><mo>=</mo><mi>k</mi></mrow><mo separator="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub></mrow><mo>}</mo></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\sum_{j}\mathbb{P}\{N_{j,1}=k|\mathcal{T}_{j}\}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(22)</span></td></tr> <tr id="S1.E23" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E23.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mu\sum_{j}T_{j,k}\exp(-\mu|\mathcal{T}_{j}|),"><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>j</mi></munder></mstyle><mrow><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub><mo>⁢</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow></mrow></mrow></mrow><mo>,</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mu\sum_{j}T_{j,k}\exp(-\mu|\mathcal{T}_{j}|),</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(23)</span></td></tr> </table> <p class="ltx_p">and since <em class="ltx_emph">given the marginal gene trees</em>, the mutation processes at each site are assumed to be independent,</p> <table id="S1.EGx13" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E24" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E24.m1" class="ltx_Math" alttext="\displaystyle\var[F_{k}|\mathcal{M}]" display="inline"><semantics><mrow><mi>var</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msub><mi>F</mi><mi>k</mi></msub><mo separator="true">|</mo><mi>ℳ</mi></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\var[F_{k}|\mathcal{M}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E24.m2" class="ltx_Math" alttext="\displaystyle=\mu\sum_{j}T_{j,k}\exp(-\mu|\mathcal{T}_{j}|)\left(1-\mu T_{j,k}% \exp(-\mu|\mathcal{T}_{j}|)\right)." display="inline"><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>j</mi></munder></mstyle><mrow><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub><mo>⁢</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>⁢</mo><mrow><mo>(</mo><mrow><mn>1</mn><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub><mo>⁢</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mu\sum_{j}T_{j,k}\exp(-\mu|\mathcal{T}_{j}|)\left(1-\mu T_{j,k}% \exp(-\mu|\mathcal{T}_{j}|)\right).</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(24)</span></td></tr> </table> <p class="ltx_p">Therefore, if the number of mutations in the region under consideration is large, then <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m14" class="ltx_Math" display="inline" alttext="F_{k}"><semantics><msub><mi>F</mi><mi>k</mi></msub><annotation encoding="application/x-tex">F_{k}</annotation></semantics></math> is well-approximated by <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p3.m15" class="ltx_Math" alttext="\mu" display="inline"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math> times the sum of the appropriate edges in the trees:</p> <table id="S1.EGx14" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E25" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E25.m1" class="ltx_Math" display="inline" alttext="\displaystyle F_{k}=\mu\sum_{j}T_{j,k}+O(\sqrt{\mu\sum_{j}T_{j,k}}),\qquad% \text{given $\mathcal{M}$}."><semantics><mrow><mrow><msub><mi>F</mi><mi>k</mi></msub><mo>=</mo><mrow><mrow><mrow><mi>μ</mi><mo>⁢</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>j</mi></munder></mstyle><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub></mrow></mrow><mo>+</mo><mrow><mi>O</mi><mo>⁢</mo><mrow><mo>(</mo><msqrt><mrow><mi>μ</mi><mo>⁢</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mi>j</mi></munder></mstyle><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub></mrow></mrow></msqrt><mo>)</mo></mrow></mrow></mrow><mo separator="true">, </mo><mrow><mtext>given </mtext><semantics><mi>ℳ</mi><annotation encoding="application/x-tex">\mathcal{M}</annotation></semantics></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle F_{k}=\mu\sum_{j}T_{j,k}+O(\sqrt{\mu\sum_{j}T_{j,k}}),\qquad% \text{given $\mathcal{M}$}.</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(25)</span></td></tr> </table> </div> <div id="S1.SS4.p4" class="ltx_para"> <p class="ltx_p">Therefore, </p> <table id="S1.EGx15" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E26" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E26.m1" class="ltx_Math" display="inline" alttext="\displaystyle\mathbb{E}[a^{*}_{k}|\mathcal{M}]"><semantics><mrow><mi>𝔼</mi><mrow><mo>[</mo><msubsup><mi>a</mi><mi>k</mi><mo>*</mo></msubsup><mo>|</mo><mi>ℳ</mi><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[a^{*}_{k}|\mathcal{M}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E26.m2" class="ltx_Math" display="inline" alttext="\displaystyle\approx\frac{\sum_{j}T_{j,k}}{\sum_{j}|\mathcal{T}_{j}|}."><semantics><mrow><mrow><mi/><mo>≈</mo><mstyle displaystyle="true"><mfrac><mrow><msub><mo>∑</mo><mi>j</mi></msub><msub><mi>T</mi><mrow><mi>j</mi><mo>,</mo><mi>k</mi></mrow></msub></mrow><mrow><msub><mo>∑</mo><mi>j</mi></msub><mrow><mo fence="true">|</mo><msub><mi>𝒯</mi><mi>j</mi></msub><mo fence="true">|</mo></mrow></mrow></mfrac></mstyle></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle\approx\frac{\sum_{j}T_{j,k}}{\sum_{j}|\mathcal{T}_{j}|}.</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(26)</span></td></tr> </table> <p class="ltx_p">The approximation holds if the number of sites at which two or more mutations have occurred is small, and if the total number of segregating sites is large. In other words, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p4.m1" class="ltx_Math" alttext="a_{k}^{*}" display="inline"><semantics><msubsup><mi>a</mi><mi>k</mi><mo>*</mo></msubsup><annotation encoding="application/x-tex">a_{k}^{*}</annotation></semantics></math> gives, to good approximation, the chance that a genetic ancestor, chosen uniformly among those genetic ancestors of some (but not all) of the sample, is a genetic ancestor to exactly <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS4.p4.m2" class="ltx_Math" display="inline" alttext="k"><semantics><mi>k</mi><annotation encoding="application/x-tex">k</annotation></semantics></math> of the sampled chromosomes.</p> </div> </div> <div id="S1.SS5" class="ltx_subsection"> <h2 class="ltx_title ltx_title_subsection"><span class="ltx_tag ltx_tag_subsection">1.5 </span>Linkage</h2> <div id="S1.SS5.p1" class="ltx_para"> <p class="ltx_p">The previous statistics were <em class="ltx_emph">single-site</em> statistics that took their information from the branching structure of the pedigree and the differentiating action of mutation along it. Consideration of the relationships multiple loci brings recombination into the picture. Perhaps the simplest summary of this is the measure of <em class="ltx_emph">linkage disequilibrium</em>. It is a two-site statistic, and is in some sense is a single-individual statistic.</p> </div> <div id="S1.SS5.p2" class="ltx_para"> <p class="ltx_p">Take two sites <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m1" class="ltx_Math" display="inline" alttext="\ell_{1}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m2" class="ltx_Math" alttext="\ell_{2}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math>, at recombination distance <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m3" class="ltx_Math" display="inline" alttext="r"><semantics><mi>r</mi><annotation encoding="application/x-tex">r</annotation></semantics></math>, so that mean number of crossovers that fall between them in a generation is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m4" class="ltx_Math" alttext="r" display="inline"><semantics><mi>r</mi><annotation encoding="application/x-tex">r</annotation></semantics></math>. One statistic measuring association between alleles <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m5" class="ltx_Math" display="inline" alttext="A_{1}"><semantics><msub><mi>A</mi><mn>1</mn></msub><annotation encoding="application/x-tex">A_{1}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m6" class="ltx_Math" display="inline" alttext="A_{2}"><semantics><msub><mi>A</mi><mn>2</mn></msub><annotation encoding="application/x-tex">A_{2}</annotation></semantics></math> at <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m7" class="ltx_Math" alttext="\ell_{1}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m8" class="ltx_Math" display="inline" alttext="\ell_{2}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math> is</p> <table id="S1.EGx16" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E27" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E27.m1" class="ltx_Math" alttext="\displaystyle D_{\ell_{1}\ell_{2}}(A_{1},A_{2})=P_{\ell_{1}\ell_{2}}(A_{1}A_{2% })-P_{\ell_{1}}(A_{1})P_{\ell_{2}}(A_{2})," display="inline"><semantics><mrow><mrow><mrow><msub><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>,</mo><msub><mi>A</mi><mn>2</mn></msub></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><mrow><msub><mi>P</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi>A</mi><mn>2</mn></msub></mrow><mo>)</mo></mrow></mrow><mo>-</mo><mrow><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><msub><mi>A</mi><mn>1</mn></msub><mo>)</mo></mrow><mo>⁢</mo><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><msub><mi>A</mi><mn>2</mn></msub><mo>)</mo></mrow></mrow></mrow></mrow><mo>,</mo></mrow><annotation encoding="application/x-tex">\displaystyle D_{\ell_{1}\ell_{2}}(A_{1},A_{2})=P_{\ell_{1}\ell_{2}}(A_{1}A_{2% })-P_{\ell_{1}}(A_{1})P_{\ell_{2}}(A_{2}),</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(27)</span></td></tr> </table> <p class="ltx_p">where <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m9" class="ltx_Math" alttext="P_{\ell_{1}\ell_{2}}(11)" display="inline"><semantics><mrow><msub><mi>P</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mn>11</mn><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">P_{\ell_{1}\ell_{2}}(11)</annotation></semantics></math> is the empirical frequency of chromosomes that have the ‘1’ allele at both sites <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m10" class="ltx_Math" alttext="\ell_{1}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m11" class="ltx_Math" display="inline" alttext="\ell_{2}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math>, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p2.m12" class="ltx_Math" alttext="P_{\ell_{1}}(1)" display="inline"><semantics><mrow><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><mn>1</mn><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">P_{\ell_{1}}(1)</annotation></semantics></math> is similar. To measure association between the loci we sum over alleles and square, defining</p> <table id="S1.EGx17" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E28" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E28.m1" class="ltx_Math" display="inline" alttext="\displaystyle D_{\ell_{1}\ell_{2}}^{2}"><semantics><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><annotation encoding="application/x-tex">\displaystyle D_{\ell_{1}\ell_{2}}^{2}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E28.m2" class="ltx_Math" alttext="\displaystyle=\left(\sum_{A_{1},A_{2}}D_{\ell_{1}\ell_{2}}(A_{1},A_{2})\right)% ^{2}" display="inline"><semantics><mrow><mi/><mo>=</mo><msup><mrow><mo>(</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>,</mo><msub><mi>A</mi><mn>2</mn></msub></mrow></munder></mstyle><mrow><msub><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>,</mo><msub><mi>A</mi><mn>2</mn></msub></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\displaystyle=\left(\sum_{A_{1},A_{2}}D_{\ell_{1}\ell_{2}}(A_{1},A_{2})\right)% ^{2}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(28)</span></td></tr> <tr id="S1.E29" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E29.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\sum_{A_{1},A_{2}}\left(P_{\ell_{1}\ell_{2}}(A_{1}A_{2})-P_{\ell% _{1}}(A_{1})P_{\ell_{2}}(A_{2})\right)^{2}"><semantics><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><munder><mo movablelimits="false">∑</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>,</mo><msub><mi>A</mi><mn>2</mn></msub></mrow></munder></mstyle><msup><mrow><mo>(</mo><mrow><mrow><msub><mi>P</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><msub><mi>A</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi>A</mi><mn>2</mn></msub></mrow><mo>)</mo></mrow></mrow><mo>-</mo><mrow><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><msub><mi>A</mi><mn>1</mn></msub><mo>)</mo></mrow><mo>⁢</mo><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><msub><mi>A</mi><mn>2</mn></msub><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\sum_{A_{1},A_{2}}\left(P_{\ell_{1}\ell_{2}}(A_{1}A_{2})-P_{\ell% _{1}}(A_{1})P_{\ell_{2}}(A_{2})\right)^{2}</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(29)</span></td></tr> </table> </div> <div id="S1.SS5.p3" class="ltx_para"> <p class="ltx_p">Now assume that the loci are biallelic, coded as <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m1" class="ltx_Math" alttext="\{0,1\}" display="inline"><semantics><mrow><mo>{</mo><mrow><mn>0</mn><mo>,</mo><mn>1</mn></mrow><mo>}</mo></mrow><annotation encoding="application/x-tex">\{0,1\}</annotation></semantics></math> (in which case <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m2" class="ltx_Math" alttext="D_{\ell_{1}\ell_{2}}^{2}=4\left(P_{\ell_{1}\ell_{2}}(11)-P_{\ell_{1}}(1)P_{% \ell_{2}}(1)\right)^{2}" display="inline"><semantics><mrow><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><mo>=</mo><mrow><mn>4</mn><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><mrow><msub><mi>P</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow></msub><mo>⁢</mo><mrow><mo>(</mo><mn>11</mn><mo>)</mo></mrow></mrow><mo>-</mo><mrow><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><mn>1</mn><mo>)</mo></mrow><mo>⁢</mo><msub><mi>P</mi><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></msub><mo>⁢</mo><mrow><mo>(</mo><mn>1</mn><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow></mrow><annotation encoding="application/x-tex">D_{\ell_{1}\ell_{2}}^{2}=4\left(P_{\ell_{1}\ell_{2}}(11)-P_{\ell_{1}}(1)P_{% \ell_{2}}(1)\right)^{2}</annotation></semantics></math>) and let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m3" class="ltx_Math" display="inline" alttext="I"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math>, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m4" class="ltx_Math" display="inline" alttext="J"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math>, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m5" class="ltx_Math" display="inline" alttext="K"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math>, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m6" class="ltx_Math" display="inline" alttext="L"><semantics><mi>L</mi><annotation encoding="application/x-tex">L</annotation></semantics></math> be the indices of individuals chosen uniformly at random with replacement. Now let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m7" class="ltx_Math" display="inline" alttext="X_{I}"><semantics><msub><mi>X</mi><mi>I</mi></msub><annotation encoding="application/x-tex">X_{I}</annotation></semantics></math> be a randomly chosen allele at locus <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m8" class="ltx_Math" alttext="\ell_{1}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math> for <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m9" class="ltx_Math" display="inline" alttext="I"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> (i.e. either <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m10" class="ltx_Math" display="inline" alttext="G_{I\ell_{1}m}"><semantics><msub><mi>G</mi><mrow><mi>I</mi><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><mi>m</mi></mrow></msub><annotation encoding="application/x-tex">G_{I\ell_{1}m}</annotation></semantics></math> or <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m11" class="ltx_Math" alttext="G_{I\ell_{1}p}" display="inline"><semantics><msub><mi>G</mi><mrow><mi>I</mi><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><mi>p</mi></mrow></msub><annotation encoding="application/x-tex">G_{I\ell_{1}p}</annotation></semantics></math>), <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m12" class="ltx_Math" display="inline" alttext="Y_{I}"><semantics><msub><mi>Y</mi><mi>I</mi></msub><annotation encoding="application/x-tex">Y_{I}</annotation></semantics></math> be the same for locus <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m13" class="ltx_Math" display="inline" alttext="\ell_{2}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math>, on the same chromosome as <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m14" class="ltx_Math" alttext="X_{I}" display="inline"><semantics><msub><mi>X</mi><mi>I</mi></msub><annotation encoding="application/x-tex">X_{I}</annotation></semantics></math>, and similarly for <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m15" class="ltx_Math" alttext="J" display="inline"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math>, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m16" class="ltx_Math" display="inline" alttext="K"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math>, and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m17" class="ltx_Math" alttext="L" display="inline"><semantics><mi>L</mi><annotation encoding="application/x-tex">L</annotation></semantics></math>. Then</p> <table id="S1.EGx18" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E30" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E30.m1" class="ltx_Math" alttext="\displaystyle D_{\ell_{1}\ell_{2}}^{2}=\mathbb{P}\{X_{I}=X_{J}\&Y_{I}=Y_{J}\}-% 2\mathbb{P}\{X_{I}=X_{J}\&Y_{I}=Y_{K}\}+\mathbb{P}\{X_{I}=X_{J}\&Y_{K}=Y_{L}\}." display="inline"><semantics><mrow><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><mo>=</mo><mi>ℙ</mi><mrow><mo>{</mo><msub><mi>X</mi><mi>I</mi></msub><mo>=</mo><msub><mi>X</mi><mi>J</mi></msub><mi mathvariant="normal">&</mi><msub><mi>Y</mi><mi>I</mi></msub><mo>=</mo><msub><mi>Y</mi><mi>J</mi></msub><mo>}</mo></mrow><mo>-</mo><mn>2</mn><mi>ℙ</mi><mrow><mo>{</mo><msub><mi>X</mi><mi>I</mi></msub><mo>=</mo><msub><mi>X</mi><mi>J</mi></msub><mi mathvariant="normal">&</mi><msub><mi>Y</mi><mi>I</mi></msub><mo>=</mo><msub><mi>Y</mi><mi>K</mi></msub><mo>}</mo></mrow><mo>+</mo><mi>ℙ</mi><mrow><mo>{</mo><msub><mi>X</mi><mi>I</mi></msub><mo>=</mo><msub><mi>X</mi><mi>J</mi></msub><mi mathvariant="normal">&</mi><msub><mi>Y</mi><mi>K</mi></msub><mo>=</mo><msub><mi>Y</mi><mi>L</mi></msub><mo>}</mo></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle D_{\ell_{1}\ell_{2}}^{2}=\mathbb{P}\{X_{I}=X_{J}\&Y_{I}=Y_{J}\}-% 2\mathbb{P}\{X_{I}=X_{J}\&Y_{I}=Y_{K}\}+\mathbb{P}\{X_{I}=X_{J}\&Y_{K}=Y_{L}\}.</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(30)</span></td></tr> </table> <p class="ltx_p">(Note: this is an example of the more general idea of a “distance covariance”, here between <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m18" class="ltx_Math" display="inline" alttext="X_{I}"><semantics><msub><mi>X</mi><mi>I</mi></msub><annotation encoding="application/x-tex">X_{I}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p3.m19" class="ltx_Math" alttext="Y_{I}" display="inline"><semantics><msub><mi>Y</mi><mi>I</mi></msub><annotation encoding="application/x-tex">Y_{I}</annotation></semantics></math>.) </p> </div> <div id="S1.SS5.p4" class="ltx_para"> <p class="ltx_p">These quantities are things that we can compute in terms of paths through the pedigree if we can assume that the appearance of mutations can be taken as independent of the pedigree. Let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m1" class="ltx_Math" alttext="\tau_{1}(I,J)" display="inline"><semantics><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)</annotation></semantics></math> be the number of generations back to the common ancestor of the chosen chromosomes of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m2" class="ltx_Math" alttext="I" display="inline"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m3" class="ltx_Math" display="inline" alttext="J"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math> at locus <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m4" class="ltx_Math" alttext="\ell_{1}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math>, and similarly for <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m5" class="ltx_Math" alttext="\tau_{2}(I,J)" display="inline"><semantics><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{2}(I,J)</annotation></semantics></math> at locus <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m6" class="ltx_Math" display="inline" alttext="\ell_{2}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math>. Then under the infinite alleles model, with mutation rate <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m7" class="ltx_Math" display="inline" alttext="\mu"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math>,</p> <table id="S1.EGx19" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E31" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E31.m1" class="ltx_Math" alttext="\displaystyle\mathbb{P}\{X_{I}=X_{J}\&Y_{K}=Y_{L}\}" display="inline"><semantics><mrow><mi>ℙ</mi><mrow><mo>{</mo><msub><mi>X</mi><mi>I</mi></msub><mo>=</mo><msub><mi>X</mi><mi>J</mi></msub><mi mathvariant="normal">&</mi><msub><mi>Y</mi><mi>K</mi></msub><mo>=</mo><msub><mi>Y</mi><mi>L</mi></msub><mo>}</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{P}\{X_{I}=X_{J}\&Y_{K}=Y_{L}\}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E31.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(K,L)))]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(K,L)))].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(31)</span></td></tr> </table> <p class="ltx_p">Similar equations for the other terms leads to</p> <table id="S1.EGx20" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E32" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E32.m1" class="ltx_Math" alttext="\displaystyle D_{\ell_{1}\ell_{2}}^{2}" display="inline"><semantics><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><annotation encoding="application/x-tex">\displaystyle D_{\ell_{1}\ell_{2}}^{2}</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E32.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}[\exp(-\mu(\tau_{1}(I,J)+\tau_{2}(I,J)))]-2\mathbb{E}[% \exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(I,K)))]+\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+% \tau_{2}(K,L)))]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow><mo>+</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[\exp(-\mu(\tau_{1}(I,J)+\tau_{2}(I,J)))]-2\mathbb{E}[% \exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(I,K)))]+\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+% \tau_{2}(K,L)))]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(32)</span></td></tr> <tr id="S1.E33" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E33.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,J)}]-2\cov[e^{-2% \mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,K)}]+\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu% \tau_{2}(K,L)}]"><semantics><mrow><mi/><mo>=</mo><mrow><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>,</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup></mrow><mo>]</mo></mrow></mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>,</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup></mrow><mo>]</mo></mrow></mrow></mrow><mo>+</mo><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>,</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup></mrow><mo>]</mo></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,J)}]-2\cov[e^{-2% \mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,K)}]+\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu% \tau_{2}(K,L)}]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(33)</span></td></tr> <tr id="S1.E34" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E34.m2" class="ltx_Math" alttext="\displaystyle\approx 2\mu^{2}\left(\cov[\tau_{1}(I,J),\tau_{2}(I,J)]-2\cov[% \tau_{1}(I,J),\tau_{2}(I,K)]+\cov[\tau_{1}(I,J),\tau_{2}(K,L)]\right)," display="inline"><semantics><mrow><mrow><mi/><mo>≈</mo><mrow><mn>2</mn><mo>⁢</mo><msup><mi>μ</mi><mn>2</mn></msup><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>,</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>,</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow></mrow><mo>+</mo><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>,</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>]</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>,</mo></mrow><annotation encoding="application/x-tex">\displaystyle\approx 2\mu^{2}\left(\cov[\tau_{1}(I,J),\tau_{2}(I,J)]-2\cov[% \tau_{1}(I,J),\tau_{2}(I,K)]+\cov[\tau_{1}(I,J),\tau_{2}(K,L)]\right),</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(34)</span></td></tr> </table> <p class="ltx_p">where the latter approximation holds if the expected number of mutations per site (<math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p4.m8" class="ltx_Math" display="inline" alttext="\mu\tau"><semantics><mrow><mi>μ</mi><mo>⁢</mo><mi>τ</mi></mrow><annotation encoding="application/x-tex">\mu\tau</annotation></semantics></math>) is small.</p> </div> <div id="S1.SS5.p5" class="ltx_para"> <p class="ltx_p">What does <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p5.m1" class="ltx_Math" display="inline" alttext="D_{\ell_{1}\ell_{2}}^{2}"><semantics><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><annotation encoding="application/x-tex">D_{\ell_{1}\ell_{2}}^{2}</annotation></semantics></math> have to say about the structure of the ancestral recombination graph? Intuitively, since it is the squared correlation between alleles at two loci on the same chromosome, it should be telling us about how much those loci tend to stick together. This is reflected in the formula above, which interprets <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p5.m2" class="ltx_Math" display="inline" alttext="D_{\ell_{1}\ell_{2}}^{2}"><semantics><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><annotation encoding="application/x-tex">D_{\ell_{1}\ell_{2}}^{2}</annotation></semantics></math> in terms of covariances of times back to most recent common ancestors at the two sites.</p> </div> <div id="S1.SS5.p6" class="ltx_para"> <p class="ltx_p">We can do a little more to make these covariances interpretable, in terms of the recombination distance between <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m1" class="ltx_Math" display="inline" alttext="\ell_{1}"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\ell_{1}</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m2" class="ltx_Math" alttext="\ell_{2}" display="inline"><semantics><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\ell_{2}</annotation></semantics></math>. For convenience, let <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m3" class="ltx_Math" display="inline" alttext="Z_{1}(I,J)=e^{-2\mu\tau_{1}(I,J)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]"><semantics><mrow><mrow><msub><mi>Z</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>-</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>]</mo></mrow></mrow></mrow></mrow><annotation encoding="application/x-tex">Z_{1}(I,J)=e^{-2\mu\tau_{1}(I,J)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]</annotation></semantics></math>, etcetera. Again assuming independence of mutation and the pedigree, given <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m4" class="ltx_Math" display="inline" alttext="\tau_{1}(I,J)"><semantics><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)</annotation></semantics></math>, the probability that there was no recombination between the loci along the path between <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m5" class="ltx_Math" alttext="I" display="inline"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m6" class="ltx_Math" display="inline" alttext="J"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math> is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m7" class="ltx_Math" display="inline" alttext="\exp(-2r\tau_{1}(I,J))"><semantics><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\exp(-2r\tau_{1}(I,J))</annotation></semantics></math>; in this case, <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m8" class="ltx_Math" display="inline" alttext="\tau_{1}(I,J)=\tau_{2}(I,J)"><semantics><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)=\tau_{2}(I,J)</annotation></semantics></math>. Suppose that in the complimentary case, when there was recombination, that <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m9" class="ltx_Math" alttext="\tau_{2}" display="inline"><semantics><msub><mi>τ</mi><mn>2</mn></msub><annotation encoding="application/x-tex">\tau_{2}</annotation></semantics></math> is (conditionally) independent of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m10" class="ltx_Math" display="inline" alttext="\tau_{1}"><semantics><msub><mi>τ</mi><mn>1</mn></msub><annotation encoding="application/x-tex">\tau_{1}</annotation></semantics></math> – not true, but not too bad either. The correponding term in the formula for <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m11" class="ltx_Math" alttext="D_{\ell_{1}\ell_{2}}^{2}" display="inline"><semantics><msubsup><mi>D</mi><mrow><msub><mi mathvariant="normal">ℓ</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi mathvariant="normal">ℓ</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><annotation encoding="application/x-tex">D_{\ell_{1}\ell_{2}}^{2}</annotation></semantics></math> decays exponentially with <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p6.m12" class="ltx_Math" display="inline" alttext="r"><semantics><mi>r</mi><annotation encoding="application/x-tex">r</annotation></semantics></math>:</p> <table id="S1.EGx21" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E35" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E35.m1" class="ltx_Math" display="inline" alttext="\displaystyle\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,J)}]"><semantics><mrow><mi>cov</mi><mo>⁡</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>,</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\cov[e^{-2\mu\tau_{1}(I,J)},e^{-2\mu\tau_{2}(I,J)}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E35.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(I,J)]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msub><mi>Z</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow><mo>⁢</mo><msub><mi>Z</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(I,J)]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(35)</span></td></tr> <tr id="S1.E36" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E36.m2" class="ltx_Math" display="inline" alttext="\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}Z_{1}(I,J)^{2}\right]"><semantics><mrow><mi/><mo>≈</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msub><mi>Z</mi><mn>1</mn></msub><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}Z_{1}(I,J)^{2}\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(36)</span></td></tr> <tr id="S1.E37" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E37.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}\left(e^{-2\mu\tau_{1}(I,J)}% -\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>-</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>]</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}\left(e^{-2\mu\tau_{1}(I,J)}% -\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(37)</span></td></tr> </table> </div> <div id="S1.SS5.p7" class="ltx_para"> <p class="ltx_p">Now take the second term. The most obvious way that the genealogy induces correlations between <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m1" class="ltx_Math" display="inline" alttext="\tau_{1}(I,J)"><semantics><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m2" class="ltx_Math" alttext="\tau_{2}(I,K)" display="inline"><semantics><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{2}(I,K)</annotation></semantics></math> occurs if the most recent common ancestor of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m3" class="ltx_Math" alttext="I" display="inline"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m4" class="ltx_Math" display="inline" alttext="J"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math> is the same as that of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m5" class="ltx_Math" display="inline" alttext="I"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m6" class="ltx_Math" display="inline" alttext="K"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math>, in which case <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m7" class="ltx_Math" display="inline" alttext="\tau_{1}(I,J)=\tau_{1}(I,K)"><semantics><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)=\tau_{1}(I,K)</annotation></semantics></math> (see figure <a href="#S1.F2" title="Figure 2 ‣ 1.5 Linkage ‣ 1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">2</span></a>A), and there is no recombination along the whole genealogy back to this MRCA. Define <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m8" class="ltx_Math" alttext="\tau_{1}(I,J,K)" display="inline"><semantics><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J,K)</annotation></semantics></math> to be the age of this MRCA. If we now assume that the case in which there was a recombination on the path from <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m9" class="ltx_Math" alttext="I" display="inline"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> to <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m10" class="ltx_Math" display="inline" alttext="K"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math> contributes nothing to the covariance, since the probability that <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m11" class="ltx_Math" display="inline" alttext="I"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> is in this position is <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p7.m12" class="ltx_Math" alttext="1/3" display="inline"><semantics><mrow><mn>1</mn><mo>/</mo><mn>3</mn></mrow><annotation encoding="application/x-tex">1/3</annotation></semantics></math>, </p> <table id="S1.EGx22" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E38" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E38.m1" class="ltx_Math" display="inline" alttext="\displaystyle\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(I,K)))]"><semantics><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(I,K)))]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E38.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(I,K)]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msub><mi>Z</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow><mo>⁢</mo><msub><mi>Z</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(I,K)]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(38)</span></td></tr> <tr id="S1.E39" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E39.m2" class="ltx_Math" alttext="\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K)}Z_{1}(I,J)^{2}|\tau_% {1}(I,J)=\tau_{1}(I,J,K)\right]" display="inline"><semantics><mrow><mo>≈</mo><mi>𝔼</mi><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><msub><mi>Z</mi><mn>1</mn></msub><msup><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>)</mo></mrow><mn>2</mn></msup><mo>|</mo><msub><mi>τ</mi><mn>1</mn></msub><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>)</mo></mrow><mo>=</mo><msub><mi>τ</mi><mn>1</mn></msub><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>)</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K)}Z_{1}(I,J)^{2}|\tau_% {1}(I,J)=\tau_{1}(I,J,K)\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(39)</span></td></tr> <tr id="S1.E40" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E40.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{1}{3}\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K)}\left(e^{-2\mu% \tau_{1}(I,J,K)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><mfrac><mn>1</mn><mn>3</mn></mfrac></mstyle><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>-</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>]</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{1}{3}\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K)}\left(e^{-2\mu% \tau_{1}(I,J,K)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(40)</span></td></tr> </table> </div> <div id="S1.F2" class="ltx_figure"><object data="coal-time-correlation.svg" id="S1.F2.g1" class="ltx_graphics ltx_centering" width="223" height="98" alt=""/> <div class="ltx_caption"><span class="ltx_tag ltx_tag_figure">Figure 2: </span><span class="ltx_text ltx_font_bold">(A)</span> The tree topology in which the most recent common ancestor of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m9" class="ltx_Math" alttext="I" display="inline"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m10" class="ltx_Math" alttext="J" display="inline"><semantics><mi>J</mi><annotation encoding="application/x-tex">J</annotation></semantics></math> is the same as the most recent common ancestor of <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m11" class="ltx_Math" display="inline" alttext="I"><semantics><mi>I</mi><annotation encoding="application/x-tex">I</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m12" class="ltx_Math" display="inline" alttext="K"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math>, so that <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m13" class="ltx_Math" alttext="\tau(I,J)=\tau(I,K)" display="inline"><semantics><mrow><mrow><mi>τ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><mi>τ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\tau(I,J)=\tau(I,K)</annotation></semantics></math>. <span class="ltx_text ltx_font_bold">(B)</span> Similar, but <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m14" class="ltx_Math" display="inline" alttext="\tau_{I,J}=\tau_{K,L}"><semantics><mrow><msub><mi>τ</mi><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow></msub><mo>=</mo><msub><mi>τ</mi><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow></msub></mrow><annotation encoding="application/x-tex">\tau_{I,J}=\tau_{K,L}</annotation></semantics></math> – note that exchanging <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m15" class="ltx_Math" alttext="K" display="inline"><semantics><mi>K</mi><annotation encoding="application/x-tex">K</annotation></semantics></math> and <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.F2.m16" class="ltx_Math" display="inline" alttext="L"><semantics><mi>L</mi><annotation encoding="application/x-tex">L</annotation></semantics></math> would work as well. </div> </div> <div id="S1.SS5.p8" class="ltx_para"> <p class="ltx_p">It should be clear what to do for the third term now. If the situation in figure <a href="#S1.F2" title="Figure 2 ‣ 1.5 Linkage ‣ 1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">2</span></a>B occurs (which it does with probability <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p8.m1" class="ltx_Math" display="inline" alttext="1/6"><semantics><mrow><mn>1</mn><mo>/</mo><mn>6</mn></mrow><annotation encoding="application/x-tex">1/6</annotation></semantics></math>) then <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p8.m2" class="ltx_Math" display="inline" alttext="\tau_{1}(I,J)=\tau_{1}(K,L)"><semantics><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>=</mo><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\tau_{1}(I,J)=\tau_{1}(K,L)</annotation></semantics></math>. As before,</p> <table id="S1.EGx23" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E41" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E41.m1" class="ltx_Math" alttext="\displaystyle\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(K,L)))]" display="inline"><semantics><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mi>exp</mi><mo>⁡</mo><mrow><mo>(</mo><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>+</mo><mrow><msub><mi>τ</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[\exp(-2\mu(\tau_{1}(I,J)+\tau_{2}(K,L)))]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E41.m2" class="ltx_Math" alttext="\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(K,L)]" display="inline"><semantics><mrow><mi/><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msub><mi>Z</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow><mo>⁢</mo><msub><mi>Z</mi><mn>2</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">\displaystyle=\mathbb{E}[Z_{1}(I,J)Z_{2}(K,L)]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(41)</span></td></tr> <tr id="S1.E42" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E42.m2" class="ltx_Math" alttext="\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K,L)}Z_{1}(I,J)^{2}|% \tau_{1}(I,J)=\tau_{1}(I,J,K,L)\right]" display="inline"><semantics><mrow><mo>≈</mo><mi>𝔼</mi><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><msub><mi>Z</mi><mn>1</mn></msub><msup><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>)</mo></mrow><mn>2</mn></msup><mo>|</mo><msub><mi>τ</mi><mn>1</mn></msub><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>)</mo></mrow><mo>=</mo><msub><mi>τ</mi><mn>1</mn></msub><mrow><mo>(</mo><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi><mo>)</mo></mrow><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\approx\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K,L)}Z_{1}(I,J)^{2}|% \tau_{1}(I,J)=\tau_{1}(I,J,K,L)\right]</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(42)</span></td></tr> <tr id="S1.E43" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"/> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E43.m2" class="ltx_Math" display="inline" alttext="\displaystyle=\frac{1}{6}\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K,L)}\left(e^{-2% \mu\tau_{1}(I,J,K,L)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right]."><semantics><mrow><mrow><mi/><mo>=</mo><mrow><mstyle displaystyle="true"><mfrac><mn>1</mn><mn>6</mn></mfrac></mstyle><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>-</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>μ</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>]</mo></mrow></mrow></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle=\frac{1}{6}\mathbb{E}\left[e^{-2r\tau_{1}(I,J,K,L)}\left(e^{-2% \mu\tau_{1}(I,J,K,L)}-\mathbb{E}[e^{-2\mu\tau_{1}(I,J)}]\right)^{2}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(43)</span></td></tr> </table> </div> <div id="S1.SS5.p9" class="ltx_para"> <p class="ltx_p">Combining these gets us an approximate expression for <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p9.m1" class="ltx_Math" display="inline" alttext="\mathbb{E}[D_{j_{1}j_{2}}^{2}]"><semantics><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msubsup><mi>D</mi><mrow><msub><mi>j</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi>j</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\mathbb{E}[D_{j_{1}j_{2}}^{2}]</annotation></semantics></math> that is a tad unwieldy, but is in terms of ages of most recent common ancestors of two, three, and four samples: taking only terms first-order in <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p9.m2" class="ltx_Math" display="inline" alttext="\mu"><semantics><mi>μ</mi><annotation encoding="application/x-tex">\mu</annotation></semantics></math>, and letting <math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.SS5.p9.m3" class="ltx_Math" alttext="t=\mathbb{E}[\tau_{1}(I,J)]" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>]</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">t=\mathbb{E}[\tau_{1}(I,J)]</annotation></semantics></math>,</p> <table id="S1.EGx24" class="ltx_equationgroup ltx_eqn_align"> <tr id="S1.E44" class="ltx_equation ltx_align_baseline"> <td class="ltx_eqn_pad"/> <td class="ltx_td ltx_align_right"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E44.m1" class="ltx_Math" display="inline" alttext="\displaystyle\mathbb{E}[D_{j_{1}j_{2}}^{2}]"><semantics><mrow><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><msubsup><mi>D</mi><mrow><msub><mi>j</mi><mn>1</mn></msub><mo>⁢</mo><msub><mi>j</mi><mn>2</mn></msub></mrow><mn>2</mn></msubsup><mo>]</mo></mrow></mrow><annotation encoding="application/x-tex">\displaystyle\mathbb{E}[D_{j_{1}j_{2}}^{2}]</annotation></semantics></math></td> <td class="ltx_td ltx_align_left"><math xmlns="http://www.w3.org/1998/Math/MathML" id="S1.E44.m2" class="ltx_Math" display="inline" alttext="\displaystyle\approx 4\mu^{2}\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}(\tau_{1}(I,J% )-t)^{2}-\frac{2}{3}e^{-2r\tau_{1}(I,J,K)}(\tau_{1}(I,J,K)-t)^{2}+\frac{1}{6}e% ^{-2r\tau_{1}(I,J,K,L)}(\tau_{1}(I,J,K,L)-t)^{2}\right]."><semantics><mrow><mrow><mi/><mo>≈</mo><mrow><mn>4</mn><mo>⁢</mo><msup><mi>μ</mi><mn>2</mn></msup><mo>⁢</mo><mi>𝔼</mi><mo>⁢</mo><mrow><mo>[</mo><mrow><mrow><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi></mrow><mo>)</mo></mrow></mrow><mo>-</mo><mi>t</mi></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>-</mo><mrow><mstyle displaystyle="true"><mfrac><mn>2</mn><mn>3</mn></mfrac></mstyle><mo>⁢</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi></mrow><mo>)</mo></mrow></mrow><mo>-</mo><mi>t</mi></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow><mo>+</mo><mrow><mstyle displaystyle="true"><mfrac><mn>1</mn><mn>6</mn></mfrac></mstyle><mo>⁢</mo><msup><mi>e</mi><mrow><mo>-</mo><mrow><mn>2</mn><mo>⁢</mo><mi>r</mi><mo>⁢</mo><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow></mrow></msup><mo>⁢</mo><msup><mrow><mo>(</mo><mrow><mrow><msub><mi>τ</mi><mn>1</mn></msub><mo>⁢</mo><mrow><mo>(</mo><mrow><mi>I</mi><mo>,</mo><mi>J</mi><mo>,</mo><mi>K</mi><mo>,</mo><mi>L</mi></mrow><mo>)</mo></mrow></mrow><mo>-</mo><mi>t</mi></mrow><mo>)</mo></mrow><mn>2</mn></msup></mrow></mrow><mo>]</mo></mrow></mrow></mrow><mo>.</mo></mrow><annotation encoding="application/x-tex">\displaystyle\approx 4\mu^{2}\mathbb{E}\left[e^{-2r\tau_{1}(I,J)}(\tau_{1}(I,J% )-t)^{2}-\frac{2}{3}e^{-2r\tau_{1}(I,J,K)}(\tau_{1}(I,J,K)-t)^{2}+\frac{1}{6}e% ^{-2r\tau_{1}(I,J,K,L)}(\tau_{1}(I,J,K,L)-t)^{2}\right].</annotation></semantics></math></td> <td class="ltx_eqn_pad"/> <td rowspan="1" class="ltx_align_middle ltx_align_right"><span class="ltx_tag ltx_tag_equation">(44)</span></td></tr> </table> </div> </div> </div> <div id="bib" class="ltx_bibliography"> <h1 class="ltx_title ltx_title_bibliography">References</h1> <ul id="L1" class="ltx_biblist"> <li id="bib.bib11" class="ltx_bibitem ltx_bib_article"><span class="ltx_bibtag ltx_bib_author-year ltx_role_refnum">R.R. Hudson (1990)</span> <span class="ltx_bibblock"><span class="ltx_text ltx_bib_title">Gene genealogies and the coalescent process</span>, </span> <span class="ltx_bibblock"><span class="ltx_text ltx_bib_journal">Oxford surveys in evolutionary biology</span> <span class="ltx_text ltx_bib_volume">7</span> (<span class="ltx_text ltx_bib_number">1</span>), <span class="ltx_text ltx_bib_pages"> pp. 44</span>. </span> <span class="ltx_bibblock">External Links: <span class="ltx_text ltx_bib_links"><a href="http://web.eve.ucdavis.edu/pbg298/pdfs/Hudson_OxfordSurveysEvolBiol_1991.pdf" title="" class="ltx_ref ltx_bib_external">Link</a></span>. </span> <span class="ltx_bibblock ltx_bib_cited">Cited by: <a href="#S1.SS1.p1" title="1.1 Number of segregating sites ‣ 1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">1.1</span></a>, <a href="#S1.SS1.p4" title="1.1 Number of segregating sites ‣ 1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">1.1</span></a>. </span></li> <li id="bib.bib7" class="ltx_bibitem ltx_bib_article"><span class="ltx_bibtag ltx_bib_author-year ltx_role_refnum">J. Wakeley, L. King, B. S. Low and S. Ramachandran (2012)</span> <span class="ltx_bibblock"><span class="ltx_text ltx_bib_title">Gene genealogies within a fixed pedigree, and the robustness of Kingman’s coalescent</span>, </span> <span class="ltx_bibblock"><span class="ltx_text ltx_bib_journal">Genetics</span> <span class="ltx_text ltx_bib_volume">190</span> (<span class="ltx_text ltx_bib_number">4</span>), <span class="ltx_text ltx_bib_pages"> pp. 1433–1445</span>. </span> <span class="ltx_bibblock">External Links: <span class="ltx_text ltx_bib_links"><a href="http://dx.doi.org/10.1534/genetics.111.135574" title="" class="ltx_ref doi ltx_bib_external">Document</a>, <a href="http://www.ncbi.nlm.nih.gov/pubmed/22234858" title="" class="ltx_ref ltx_bib_external">Link</a></span>. </span> <span class="ltx_bibblock ltx_bib_cited">Cited by: <a href="#S1.p4" title="1 Summary statistics" class="ltx_ref"><span class="ltx_text ltx_ref_tag">1</span></a>. </span></li> </ul> </div> </div> </div> <div class="ltx_page_footer"> <div class="ltx_page_logo">Generated on Fri Feb 7 12:37:20 2014 by <a href="http://dlmf.nist.gov/LaTeXML/">LaTeXML <img src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAsAAAAOCAYAAAD5YeaVAAAAAXNSR0IArs4c6QAAAAZiS0dEAP8A/wD/oL2nkwAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1FB9wKExQZLWTEaOUAAAAddEVYdENvbW1lbnQAQ3JlYXRlZCB3aXRoIFRoZSBHSU1Q72QlbgAAAdpJREFUKM9tkL+L2nAARz9fPZNCKFapUn8kyI0e4iRHSR1Kb8ng0lJw6FYHFwv2LwhOpcWxTjeUunYqOmqd6hEoRDhtDWdA8ApRYsSUCDHNt5ul13vz4w0vWCgUnnEc975arX6ORqN3VqtVZbfbTQC4uEHANM3jSqXymFI6yWazP2KxWAXAL9zCUa1Wy2tXVxheKA9YNoR8Pt+aTqe4FVVVvz05O6MBhqUIBGk8Hn8HAOVy+T+XLJfLS4ZhTiRJgqIoVBRFIoric47jPnmeB1mW/9rr9ZpSSn3Lsmir1fJZlqWlUonKsvwWwD8ymc/nXwVBeLjf7xEKhdBut9Hr9WgmkyGEkJwsy5eHG5vN5g0AKIoCAEgkEkin0wQAfN9/cXPdheu6P33fBwB4ngcAcByHJpPJl+fn54mD3Gg0NrquXxeLRQAAwzAYj8cwTZPwPH9/sVg8PXweDAauqqr2cDjEer1GJBLBZDJBs9mE4zjwfZ85lAGg2+06hmGgXq+j3+/DsixYlgVN03a9Xu8jgCNCyIegIAgx13Vfd7vdu+FweG8YRkjXdWy329+dTgeSJD3ieZ7RNO0VAXAPwDEAO5VKndi2fWrb9jWl9Esul6PZbDY9Go1OZ7PZ9z/lyuD3OozU2wAAAABJRU5ErkJggg==" alt="[LOGO]"/></a></div></div> </div> </body> </html>