In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas aeruginosa CF_PA39 - Assembly GCF_000568235.2 | AX20_RS0110140 |
signal peptide protein
|
2 same-strain members: AX20_RS0110140 AX20_RS0115765 |
![]() |
|
Pseudomonas aeruginosa CF_PA39 - Assembly GCF_000568235.2 | AX20_RS0115765 |
signal peptide protein
|
2 same-strain members: AX20_RS0110140 AX20_RS0115765 |
![]() |
|
Pseudomonas aeruginosa DK2 | PADK2_00425 |
hypothetical protein
|
2 same-strain members: PADK2_00425 PADK2_17380 |
![]() |
|
Pseudomonas aeruginosa DK2 | PADK2_17380 |
hypothetical protein
|
2 same-strain members: PADK2_00425 PADK2_17380 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001042925.1 | TU83_RS05080 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: TU83_RS05080 TU83_RS09170 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001042925.1 | TU83_RS09170 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: TU83_RS05080 TU83_RS09170 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001045685.1 | PA50071_RS00425 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: PA50071_RS00425 PA50071_RS16890 |
![]() |
|
Pseudomonas aeruginosa DSM 50071 - Assembly GCF_001045685.1 | PA50071_RS16890 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: PA50071_RS00425 PA50071_RS16890 |
![]() |
|
Pseudomonas aeruginosa E2 - Assembly GCF_000482005.1 | P998_02744 |
signal peptide protein
|
2 same-strain members: P998_02744 P998_05040 |
![]() |
|
Pseudomonas aeruginosa E2 - Assembly GCF_000482005.1 | P998_05040 |
signal peptide protein
|
2 same-strain members: P998_02744 P998_05040 |
![]() |
|
Pseudomonas aeruginosa F22031 | F22031_RS12085 |
hypothetical protein
|
2 same-strain members: F22031_RS12085 F22031_RS29310 |
![]() |
|
Pseudomonas aeruginosa F22031 | F22031_RS29310 |
hypothetical protein
|
2 same-strain members: F22031_RS12085 F22031_RS29310 |
![]() |
|
Pseudomonas aeruginosa F9676 | ADJ52_RS09860 |
hypothetical protein
|
2 same-strain members: ADJ52_RS09860 ADJ52_RS26695 |
![]() |
|
Pseudomonas aeruginosa F9676 | ADJ52_RS26695 |
hypothetical protein
|
2 same-strain members: ADJ52_RS09860 ADJ52_RS26695 |
![]() |
|
Pseudomonas aeruginosa FRD1 - Assembly GCF_000829885.1 | EG09_RS01110 |
hypothetical protein
|
2 same-strain members: EG09_RS01110 EG09_RS17515 |
![]() |
|
Pseudomonas aeruginosa FRD1 - Assembly GCF_000829885.1 | EG09_RS17515 |
hypothetical protein
|
2 same-strain members: EG09_RS01110 EG09_RS17515 |
![]() |
|
Pseudomonas aeruginosa FRD1 - Assembly GCF_000950725.1 | UC33_RS15770 |
hypothetical protein
|
2 same-strain members: UC33_RS15770 UC33_RS20740 |
![]() |
|
Pseudomonas aeruginosa FRD1 - Assembly GCF_000950725.1 | UC33_RS20740 |
hypothetical protein
|
2 same-strain members: UC33_RS15770 UC33_RS20740 |
![]() |
|
Pseudomonas aeruginosa H1l | U864_RS0116735 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: U864_RS0116735 U864_RS0123505 |
![]() |
|
Pseudomonas aeruginosa H1l | U864_RS0123505 |
type VI secretion system FHA domain-containing protein
|
2 same-strain members: U864_RS0116735 U864_RS0123505 |
![]() |