Abstract
Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain.
Original language | English (US) |
---|---|
Article number | e2187 |
Pages (from-to) | e2187 |
Journal | PeerJ |
Volume | 4 |
DOIs | |
State | Published - 2016 |
Keywords
- INHS
- 16S rDNA
- Endosymbiosis
- Anoplura
- Sucking lice
- Gammaproteobacteria
- Phylogenetics
- Endosymbiont
ASJC Scopus subject areas
- General Agricultural and Biological Sciences
- General Biochemistry, Genetics and Molecular Biology
- General Neuroscience