In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas aeruginosa PAO579 | A161_15365 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa PAO581 - Assembly GCF_000468555.2 | M801_3119 |
competence ComEA helix-hairpin-helix repeat region domain protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa PS42 | Q094_RS05820 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa PS50 | Q095_01947 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa PS75 | V562_01161 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa PSE305 | TY30_RS16675 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa RP73 | M062_16800 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa S04 90 | YH69_RS09745 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa S35004 | Q012_05447 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa S54485 - Assembly GCF_000481825.1 | Q007_05964 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa SCV20265 | SCV20265_1908 |
DNA uptake protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa Stone 130 | HMPREF1223_08326 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa U2504 - Assembly GCF_000481785.1 | Q009_01617 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa UDL | Q006_02615 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa UM-01 | AAY82_RS22085 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa VRFPA04 - Assembly GCF_000473745.2 | P797_RS21520 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa WS136 | PAWS136_RS09090 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa X13273 | Q013_00921 |
competence protein ComEA
|
1 member |
![]() |
|
Pseudomonas aeruginosa X24509 - Assembly GCF_000481865.1 | Q005_01162 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa YL84 | AI22_RS24040 |
hypothetical protein
|
1 member |
![]() |