Further genome sequencing would allow a similar analysis to provide the ‘definitive’ phylogeny of the Vibrio, but at much greater effort per strain than for MLSA [33]. MLSA schemes currently devised provide a 4SC-202 clinical trial mean field estimate of the phylogeny of Chromosome I; thus, as they are expanded to include increasing numbers of genes, those phylogenies are expected to agree with the phylogenies derived from studying the origins of replication. This suggests several genes that might be used in an MLSA of the Vibrionaceae, including Alpha, DnaN, and YidC from Chromosome
1 and ParA2 and GluP from Chromosome 2. These genes have potential primer sequences that are hypothetically capable of creating phylogenetic trees with the highest resolution and consistent signal so that they are comparable to the trees found in this study. It is a pleasing
conclusion that separate MLSA schemes will not have to be executed for each chromosome check details independently. Methods Chromosome Phylogenies Mean field approximation refers to the generalized phylogeny of the entire chromosome, regardless of differing histories. This was accomplished conceptually by means of concatenated gene trees for single copy homologous genes whose relatives are most easily determined and whose chromosomal affiliation is most certain. The restriction that the genes had to be single copy is meant to limit the analysis to orthologs while excluding paralogs.
To select the genes for this analysis, a database of Quisinostat mouse genomes was created. All the available Vibrionaceae (Vibrio and Photobacterium) genomes as well as an assortment of other see more gamma proteobacterial genomes (Additional file 6) were selected for analysis. All 62 genomes were broken down into lists of ORFs, which were entered into a MySQL database with their DNA and protein sequences as well as other identifying data. The entire suite of protein sequences were BLASTed against each other and the resulting hits were processed with orthoMCL v 1.4 to identify protein families [36]. A significant parameter used in orthoMCL was an inflation value of 1.5. Genes representing single copy gene families on the different chromosomes were aligned [37], stripped of their gaps, concatenated, and 100 kb, chosen as individual random sites, was chosen as the input for PhyML [38]. Phylogenies for Vibrio and Photobacterium chromosome I and II were based on the complete and incomplete published genomes with P. atlantica and Shewanella sp. ANA3 serving as the outgroup. Initially, Pseudoalteromonas haloplanktis was proposed as an outgroup for the chromosome II phylogeny. P. haloplanktis, unlike other sequenced pseudoalteromonads, has a second chromosome. However, that chromosome appears to have a distinct, plasmid-like origin of replication and a GC-skew that indicates unidirectional replication [39].