In order to generate a more inclusive dataset of Pseudomonas genes mapped to putative in-paralogs and putative orthologs in other Pseudomonas species/strains, we developed a Pseudomonas Orthologous Groups classification system.
To generate ortholog groups, pair-wise DIAMOND searches were run on all genomes in the database to find reciprocal best hits (RBHs) for each gene. These analyses often resulted in multiple candidate genes for RBH status, which were narrowed down by examining the similarity between the query's flanking genes and the hit's flanking genes. If two candidate genes were directly adjacent, they where both accepted as RBHs that involve putative in-parology.
Pairwise intra-genome DIAMOND searches were also performed to acquire in-paralog information (i.e. gene duplications occurring after species divergence). If two genes in one genome were reciprocally more similar to each other than to any gene in the other genomes, the two genes were designated putative in-paralogs. Ortholog groups are built by starting with a seed gene and then adding all genes to which there is a RBH or in-paralog relationship.
Every new gene added to an ortholog group was then treated as a seed gene and the addition process was repeated until all qualifying genes had been added. The result was the development of orthologous groups, specifically generated for Pseudomonas species genomes, which can be used to sort search results.
Strain | Locus Tag | Description | Same-Strain Members | Fragment ? | |
---|---|---|---|---|---|
Pseudomonas fluorescens C1 | VC33_RS09480 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fluorescens C2 | NL64_RS11045 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fluorescens C3 | VC34_RS23605 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fluorescens EGD-AQ6 | O204_RS133500 |
rare lipoprotein A
|
1 member |
![]() |
|
Pseudomonas fluorescens F113 | PSF113_5155 |
Rare lipoprotein A
|
1 member |
![]() |
|
Pseudomonas fluorescens MEP34 | RU10_RS24655 |
RlpA family lipoprotein
|
1 member |
![]() |
|
Pseudomonas fluorescens PA4C2 | P909_RS17295 |
lipoprotein, RlpA family
|
1 member |
![]() |
|
Pseudomonas fluorescens PCL1751 | PF1751_RS24420 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas fluorescens Pf0-1 | Pfl01_4967 |
rare lipoprotein A
|
1 member |
![]() |
|
Pseudomonas fluorescens PICF7 | PFLUOLIPICF7_RS11700 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas fluorescens Q2-87 | PflQ2_4828 |
lipoprotein, RlpA family
|
1 member |
![]() |
|
Pseudomonas fluorescens R124 | I1A_004576 |
rare lipoprotein A
|
1 member |
![]() |
|
Pseudomonas fluorescens SF39a | NX10_RS17015 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fluorescens SF4c | QS95_RS23820 |
hypothetical protein
|
1 member |
![]() |
|
Pseudomonas fluorescens SS101 | PflSS101_4767 |
lipoprotein, RlpA family
|
1 member |
![]() |
|
Pseudomonas fluorescens UK4 | HZ99_RS13245 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas frederiksbergensis SI8 - Assembly GCF_000802155.2 | JZ00_RS27030 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fulva 12-X | Psefu_3623 |
rare lipoprotein A
|
1 member |
![]() |
|
Pseudomonas fulva NBRC 16636 = DSM 17004 - Assembly GCF_000621265.1 | Q382_RS0109310 |
lipoprotein
|
1 member |
![]() |
|
Pseudomonas fulva NBRC 16636 = DSM 17004 - Assembly GCF_000730565.1 | PFU01S_RS08595 |
lipoprotein
|
1 member |
![]() |