research

Geometric medians in reconciliation spaces

Abstract

In evolutionary biology, it is common to study how various entities evolve together, for example, how parasites coevolve with their host, or genes with their species. Coevolution is commonly modelled by considering certain maps or reconciliations from one evolutionary tree PP to another HH, all of which induce the same map ϕ\phi between the leaf-sets of PP and HH (corresponding to present-day associations). Recently, there has been much interest in studying spaces of reconciliations, which arise by defining some metric dd on the set Rec(P,H,ϕ)Rec(P,H,\phi) of all possible reconciliations between PP and HH. In this paper, we study the following question: How do we compute a geometric median for a given subset Ψ\Psi of Rec(P,H,ϕ)Rec(P,H,\phi) relative to dd, i.e. an element ψmedRec(P,H,ϕ)\psi_{med} \in Rec(P,H,\phi) such that ψΨd(ψmed,ψ)ψΨd(ψ,ψ) \sum_{\psi' \in \Psi} d(\psi_{med},\psi') \le \sum_{\psi' \in \Psi} d(\psi,\psi') holds for all ψRec(P,H,ϕ)\psi \in Rec(P,H,\phi)? For a model where so-called host-switches or transfers are not allowed, and for a commonly used metric dd called the edit-distance, we show that although the cardinality of Rec(P,H,ϕ)Rec(P,H,\phi) can be super-exponential, it is still possible to compute a geometric median for a set Ψ\Psi in Rec(P,H,ϕ)Rec(P,H,\phi) in polynomial time. We expect that this result could be useful for computing a summary or consensus for a set of reconciliations (e.g. for a set of suboptimal reconciliations).Comment: 12 pages, 1 figur

    Similar works