Data from: "Inferring Species Trees from Gene-Family with Duplication and Loss using Multi-Copy Gene-Family Tree Decomposition"

  • James Willson (Creator)
  • Mrinmoy Saha Roddur (Creator)
  • Liu Baqiao (Creator)
  • Paul Zaharias (Creator)
  • Tandy Warnow (Creator)

Dataset

Description

Data sets from "Inferring Species Trees from Gene-Family with Duplication and Loss using Multi-Copy Gene-Family Tree Decomposition." It contains trees and sequences simulated with gene duplication and loss under a variety of different conditions.

<b>Note:</b>

- trees.tar.gz contains the simulated gene-family trees used in our experiments (both true trees from SimPhy as well as trees estimated from alignements).

- sequences.tar.gz contains simulated sequence data used for estimating the gene-family trees as well as the concatenation analysis.

- biological.tar.gz contains the gene trees used as inputs for the experiments we ran on empirical data sets as well as species trees outputted by the methods we tested on those data sets.

- stats.txt list statistics (such as AD, MGTE, and average size) for our simulated model conditions.
Date made availableMay 21 2021
PublisherUniversity of Illinois Urbana-Champaign

Keywords

  • gene duplication and loss
  • simulated data
  • species-tree inference

Cite this