In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas oryzihabitans NBRC 102199 | POR01S_RS07820 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas oryzihabitans RIT370 | UM91_RS05005 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas otitidis LNU-E-001 | CR65_RS0111540 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas parafulva CRS01-1 | NJ69_RS18815 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas parafulva NBRC 16636 = DSM 17004 - Assembly GCF_000425765.1 | H619_RS0109485 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas parafulva NBRC 16636 = DSM 17004 - Assembly GCF_000730645.1 | PPA02S_RS01960 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas parafulva YAB-1 | XB13_RS00420 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas plecoglossicida NB2011 | L321_17382 |
tetratricopeptide domain-containing protein
|
1 member |
![]() |
|
Pseudomonas plecoglossicida NBRC 103162 = DSM 15088 assembly GCF_000688275.1 | Q378_RS0113605 |
tetratricopeptide domain-containing protein
|
1 member |
![]() |
|
Pseudomonas plecoglossicida NBRC 103162 = DSM 15088 assembly GCF_000730665.1 | PPL01S_RS04595 |
tetratricopeptide domain-containing protein
|
1 member |
![]() |
|
Pseudomonas plecoglossicida NyZ12 | RK21_RS24715 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas poae RE*1-1-14 | H045_21185 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas protegens CHA0 - Assembly GCF_000397205.1 | PFLCHA0_c00190 |
TPR repeat-containing protein
|
1 member |
![]() |
|
Pseudomonas psychrotolerans L19 | PPL19_08906 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas putida BIRD-1 | PPUBIRD1_0093 |
Tetratricopeptide domain-containing protein
|
1 member |
![]() |
|
Pseudomonas putida DLL-E4 | DW66_RS00185 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas putida GB-1 | PputGB1_0080 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas putida H | AC138_RS26465 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas putida H8234 | L483_32380 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas putida HB3267 | B479_00650 |
hypothetical protein
|
1 member |
![]() |