In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas aeruginosa MSH-10 | L346_01134 |
chain-length determining protein
|
2 same-strain members: L346_01134 L346_03585 |
![]() |
|
Pseudomonas aeruginosa MSH-10 | L346_03585 |
hypothetical protein
|
2 same-strain members: L346_01134 L346_03585 |
![]() |
|
Pseudomonas aeruginosa MSH10 - Assembly GCF_000481965.1 | Q000_01136 |
chain-length determining protein
|
2 same-strain members: Q000_01136 Q000_03583 |
![]() |
|
Pseudomonas aeruginosa MSH10 - Assembly GCF_000481965.1 | Q000_03583 |
hypothetical protein
|
2 same-strain members: Q000_01136 Q000_03583 |
![]() |
|
Pseudomonas aeruginosa MSH3 - Assembly GCF_000481985.1 | P999_00819 |
hypothetical protein
|
2 same-strain members: P999_00819 P999_03269 |
![]() |
|
Pseudomonas aeruginosa MSH3 - Assembly GCF_000481985.1 | P999_03269 |
chain-length determining protein
|
2 same-strain members: P999_00819 P999_03269 |
![]() |
|
Pseudomonas aeruginosa MTB-1 | U769_09120 |
chain-length determining protein
|
2 same-strain members: U769_09120 U769_21100 |
![]() |
|
Pseudomonas aeruginosa MTB-1 | U769_21100 |
hypothetical protein
|
2 same-strain members: U769_09120 U769_21100 |
![]() |
|
Pseudomonas aeruginosa NCAIM B.001380 | K260_RS0111840 |
chain-length determining protein
|
2 same-strain members: K260_RS0120775 K260_RS0111840 |
![]() |
|
Pseudomonas aeruginosa NCAIM B.001380 | K260_RS0120775 |
hypothetical protein
|
2 same-strain members: K260_RS0120775 K260_RS0111840 |
![]() |
|
Pseudomonas aeruginosa NCGM1900 | NCGM1900_RS08830 |
hypothetical protein
|
2 same-strain members: NCGM1900_RS08830 NCGM1900_RS15620 |
![]() |
|
Pseudomonas aeruginosa NCGM1900 | NCGM1900_RS15620 |
O-antigen chain length regulator
|
2 same-strain members: NCGM1900_RS08830 NCGM1900_RS15620 |
![]() |
|
Pseudomonas aeruginosa NCGM1984 | NCGM1984_RS09365 |
O-antigen chain length regulator
|
2 same-strain members: NCGM1984_RS09365 NCGM1984_RS22680 |
![]() |
|
Pseudomonas aeruginosa NCGM1984 | NCGM1984_RS22680 |
hypothetical protein
|
2 same-strain members: NCGM1984_RS09365 NCGM1984_RS22680 |
![]() |
|
Pseudomonas aeruginosa NCGM2.S1 | NCGM2_RS08555 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas aeruginosa P7-L633/96 | D407_RS05995 |
O-antigen chain length regulator
|
2 same-strain members: D407_RS05995 D407_RS18795 |
![]() |
|
Pseudomonas aeruginosa P7-L633/96 | D407_RS18795 |
hypothetical protein
|
2 same-strain members: D407_RS05995 D407_RS18795 |
![]() |
|
Pseudomonas aeruginosa PA14 - Assembly GCF_000404265.1 | CIA_00837 |
hypothetical protein
|
2 same-strain members: CIA_00837 CIA_03195 |
![]() |
|
Pseudomonas aeruginosa PA14 - Assembly GCF_000404265.1 | CIA_03195 |
chain-length determining protein
|
2 same-strain members: CIA_00837 CIA_03195 |
![]() |
|
Pseudomonas aeruginosa PA1R | PA1R_gp4464 |
regulator of length of O-antigen component of lipopolysaccharide chains
|
1 member |
![]() |