In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas aeruginosa PA21_ST175 | H123_00815 |
hypothetical protein
|
2 same-strain members: H123_00815 H123_27308 |
![]() |
|
Pseudomonas aeruginosa PA21_ST175 | H123_27308 |
O-antigen chain length regulator
|
2 same-strain members: H123_00815 H123_27308 |
![]() |
|
Pseudomonas aeruginosa PA96 - Assembly GCF_000626655.2 | PA96_RS08965 |
chain-length determining protein
|
2 same-strain members: PA96_RS08965 PA96_RS21300 |
![]() |
|
Pseudomonas aeruginosa PA96 - Assembly GCF_000626655.2 | PA96_RS21300 |
hypothetical protein
|
2 same-strain members: PA96_RS08965 PA96_RS21300 |
![]() |
|
Pseudomonas aeruginosa PACS2 | A0K_RS07595 |
hypothetical protein
|
2 same-strain members: A0K_RS07595 A0K_RS19885 |
![]() |
|
Pseudomonas aeruginosa PACS2 | A0K_RS19885 |
chain-length determining protein
|
2 same-strain members: A0K_RS07595 A0K_RS19885 |
![]() |
|
Pseudomonas aeruginosa PADK2_CF510 | CF510_05860 |
chain-length determining protein
|
2 same-strain members: CF510_05860 CF510_12922 |
![]() |
|
Pseudomonas aeruginosa PADK2_CF510 | CF510_12922 |
hypothetical protein
|
2 same-strain members: CF510_05860 CF510_12922 |
![]() |
|
Pseudomonas aeruginosa Pae_CF67.10q | ACO71_RS13050 |
hypothetical protein
|
2 same-strain members: ACO71_RS13050 ACO71_RS04005 |
![]() |
|
Pseudomonas aeruginosa Pae_CF67.10q | ACO71_RS04005 |
chain-length determining protein
|
2 same-strain members: ACO71_RS13050 ACO71_RS04005 |
![]() |
|
Pseudomonas aeruginosa Pae_CF67.11p | ACO98_RS13615 |
hypothetical protein
|
2 same-strain members: ACO98_RS13615 ACO98_RS22990 |
![]() |
|
Pseudomonas aeruginosa Pae_CF67.11p | ACO98_RS22990 |
chain-length determining protein
|
2 same-strain members: ACO98_RS13615 ACO98_RS22990 |
![]() |
|
Pseudomonas aeruginosa PAG | DR97_1001 |
hypothetical protein
|
2 same-strain members: DR97_1001 DR97_4776 |
![]() |
|
Pseudomonas aeruginosa PAG | DR97_4776 |
chain-length determining protein
|
2 same-strain members: DR97_1001 DR97_4776 |
![]() |
|
Pseudomonas aeruginosa PAK - Assembly GCF_000408865.1 | PAK_02037 |
chain-length determining protein
|
2 same-strain members: PAK_02037 PAK_04444 |
![]() |
|
Pseudomonas aeruginosa PAK - Assembly GCF_000408865.1 | PAK_04444 |
hypothetical protein
|
2 same-strain members: PAK_02037 PAK_04444 |
![]() |
|
Pseudomonas aeruginosa PAO1-GFP | V563_01029 |
hypothetical protein
|
2 same-strain members: V563_01029 V563_03529 |
![]() |
|
Pseudomonas aeruginosa PAO1-GFP | V563_03529 |
chain-length determining protein
|
2 same-strain members: V563_01029 V563_03529 |
![]() |
|
Pseudomonas aeruginosa PAO1-VE13 - Assembly GCF_000484545.2 | N297_3271 |
chain length determinant family protein
|
2 same-strain members: N297_970 N297_3271 |
![]() |
|
Pseudomonas aeruginosa PAO1-VE13 - Assembly GCF_000484545.2 | N297_970 |
chain length determinant family protein
|
2 same-strain members: N297_970 N297_3271 |
![]() |