In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas aeruginosa Carb01 63 | YQ19_RS11055 |
chain-length determining protein
|
2 same-strain members: YQ19_RS11055 YQ19_RS25030 |
![]() |
|
Pseudomonas aeruginosa Carb01 63 | YQ19_RS25030 |
hypothetical protein
|
2 same-strain members: YQ19_RS11055 YQ19_RS25030 |
![]() |
|
Pseudomonas aeruginosa CF127 - Assembly GCF_000481945.1 | Q001_01092 |
chain-length determining protein
|
2 same-strain members: Q001_01092 Q001_04853 |
![]() |
|
Pseudomonas aeruginosa CF127 - Assembly GCF_000481945.1 | Q001_04853 |
hypothetical protein
|
2 same-strain members: Q001_01092 Q001_04853 |
![]() |
|
Pseudomonas aeruginosa CF18 | Q002_01062 |
chain-length determining protein
|
2 same-strain members: Q002_01062 Q002_03608 |
![]() |
|
Pseudomonas aeruginosa CF18 | Q002_03608 |
hypothetical protein
|
2 same-strain members: Q002_01062 Q002_03608 |
![]() |
|
Pseudomonas aeruginosa CF27 - Assembly GCF_000481905.1 | Q003_04285 |
hypothetical protein
|
2 same-strain members: Q003_01055 Q003_04285 |
![]() |
|
Pseudomonas aeruginosa CF27 - Assembly GCF_000481905.1 | Q003_01055 |
O-antigen chain length regulator
|
2 same-strain members: Q003_01055 Q003_04285 |
![]() |
|
Pseudomonas aeruginosa CF5 - Assembly GCF_000481885.1 | Q004_01137 |
chain-length determining protein
|
2 same-strain members: Q004_01137 Q004_03463 |
![]() |
|
Pseudomonas aeruginosa CF5 - Assembly GCF_000481885.1 | Q004_03463 |
hypothetical protein
|
2 same-strain members: Q004_01137 Q004_03463 |
![]() |
|
Pseudomonas aeruginosa CF614 | Q093_01000 |
chain-length determining protein
|
2 same-strain members: Q093_01000 Q093_04305 |
![]() |
|
Pseudomonas aeruginosa CF614 | Q093_04305 |
hypothetical protein
|
2 same-strain members: Q093_01000 Q093_04305 |
![]() |
|
Pseudomonas aeruginosa CF77 | Q092_00813 |
chain-length determining protein
|
2 same-strain members: Q092_00813 Q092_05159 |
![]() |
|
Pseudomonas aeruginosa CF77 | Q092_05159 |
hypothetical protein
|
2 same-strain members: Q092_00813 Q092_05159 |
![]() |
|
Pseudomonas aeruginosa CF_PA39 - Assembly GCF_000568235.2 | AX20_RS0103330 |
chain-length determining protein
|
2 same-strain members: AX20_RS0103330 AX20_RS0120765 |
![]() |
|
Pseudomonas aeruginosa CF_PA39 - Assembly GCF_000568235.2 | AX20_RS0120765 |
hypothetical protein
|
2 same-strain members: AX20_RS0103330 AX20_RS0120765 |
![]() |
|
Pseudomonas aeruginosa DK2 | PADK2_08540 |
O-antigen chain length regulator
|
2 same-strain members: PADK2_08540 PADK2_21010 |
![]() |
|
Pseudomonas aeruginosa DK2 | PADK2_21010 |
hypothetical protein
|
2 same-strain members: PADK2_08540 PADK2_21010 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001042925.1 | TU83_RS01325 |
hypothetical protein
|
2 same-strain members: TU83_RS01325 TU83_RS10735 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001042925.1 | TU83_RS10735 |
chain-length determining protein
|
2 same-strain members: TU83_RS01325 TU83_RS10735 |
![]() |