7F1
JEP131 521 bp JEP132 532 bp Composite sequence: AGGGTTNNTG ATTCAAGACN TGNAGTAGCA AGGTGGTACC GNGATNCGTC NGCACGGTGG NNACGCATTC CGNGCGAAGA NCAACGGGNG NCTCNTATGA AGCGTTCANN GNNNANTTTC CCNCGGCCGG AAAGCGATCG CCAGGCGCTG ATCGCGCTGC ATACGGGTGG AGAAAGATTA TCTNCCGGAG ATGACGCTGG AGCAGAAGGT NGAGTGGCTC GATNCCCACA AGCTACACCC AGTTCNTGCG TGAAAAAGTG GGCNTGAGCG AGCTGGCGAT CCGGTATTTC CAGCAGACCA CCAACGACTT CCAGNCGGTG GGCATCGACG GCACCTCGTG CAGCGATGCG CGCATCTGCG ATCTGCCGGG CCTCGGCGGC ATGAATCTGC CGCCGCTGGA CGAAGAGTCG CAGGCGGATC TCGACGATCC TTACGTGTTC CACTTCCCGG ACGGCAACGC CACGCTGGCG CGCCTGATGG TGCGCAAGCA TGATCCCGCG GAGCGCCGGC CGGCAAAGAC A GGCCAAATTC GACTACAGCC AGCTCGATCG GCCGGAGTCG CCGGTGAAGC TGCGTTTAAA CAGCACCGGG CTGCACGCGG CCAACGTCGG CGACAAGGTG GAAGTCACCT ATATGACCGG CGAGAAGATG ACCAAAGTGC GCGCCGGGCA GGTGGTGATG GCCGGCTACA ACATGATGAT CCCGTACCTG GTGCCGGAGA TGTCGCACGA GCAGCAGGAG GCGCTGAAGC AGAACGTCAA GGCGCCGCTG GTGTACAGCA AGGTGGTGAT CCGCAACTGG CAGCCGTTCA TTAAGCTGGG CGTGCACGAA ATTTATTCGC CGGCCGCGCC GTACAGCCGC GTGAAGCTGG ACTACCCAGT GAGCATGGGC GGGTACCAGC ACCCGCGCGA TCCGGATCGG CAATCGGCCT GCATATGGTG TCGTGCCGAC GCTGCGGCAG CGGCTCAGCC CGCGCGAGCA GTCGCGCAAA GGGCCGCGCT GTTGTTGGCA CCCGTTCANG TGACAACAGA TGTCCGGACA ATTGAGGCAT NT
BLASTX with this sequence gives only two hits (full results here): >gi|11349969|pir||A83182 hypothetical protein PA3713 [imported] - Pseudomonas aeruginosa (strain PAO1) Length = 620 Score = 157 bits (397), Expect(2) = 4e-57 Identities = 73/131 (55%), Positives = 95/131 (72%) Frame = +1 Query: 523 AKFDYSQLDRPESPVKLRLNSTGLHAANVGDKVEVTYMTGEKMTKVRAGQVVMAGYNMMI 702 A+FDYS+LD PV+LRLNST + N V+V Y ++ +VR VMA YNMM+ Sbjct: 359 ARFDYSKLDLAGHPVRLRLNSTAVSVRNRAGGVDVGYSRAGRLHRVRGKHCVMACYNMMV 418 Query: 703 PYLVPEMSHEQQEALKQNVKAPLVYSKVVIRNWQPFIKLGVHEIYSPAAPYSRVKLDYPV 882 PYL+ ++S EQ AL QNVK PLVY+KV++RNWQ + LG+HEIY+P PYSR+KLD+PV Sbjct: 419 PYLLRDLSEEQAHALSQNVKFPLVYTKVLLRNWQAWKTLGIHEIYAPTLPYSRIKLDFPV 478 Query: 883 SMGGYQHPRDP 915 +G Y+HPRDP Sbjct: 479 DLGSYRHPRDP 489 Score = 85.9 bits (211), Expect(2) = 4e-57 Identities = 49/141 (34%), Positives = 67/141 (47%) Frame = +3 Query: 78 RXTGXSYEAFXXXFPXAGKRSPGADRAAYGXXXXXXXXXXXXXXXXXXXPTSYTQFXREK 257 R S+ AF FP + + A A Y TSY + + Sbjct: 207 RLNARSWRAFIGDFPLS-REDREALIALYESPRDYLAGKSVEEKETYLAKTSYRDYLLKN 265 Query: 258 VGXSELAIRYFQQTTNDFQXVGIDGTSCSDARICDXXXXXXXXXXXXDEESQADLDDPYV 437 VG SE +++YFQ +NDF +G D +DA EE+QA++D+PY+ Sbjct: 266 VGLSETSVKYFQGRSNDFSALGADALPAADAYAAGFPGFDALGLPQPSEEAQAEMDEPYI 325 Query: 438 FHFPDGNATLARLMVRKHDPA 500 +HFPDGNA+LARLMVR PA Sbjct: 326 YHFPDGNASLARLMVRDLIPA 346 >gi|22977131|gb|ZP_00022954.1| hypothetical protein [Ralstonia metallidurans] Length = 712 Score = 97.4 bits (241), Expect(2) = 4e-24 Identities = 53/134 (39%), Positives = 80/134 (59%), Gaps = 4/134 (2%) Frame = +1 Query: 484 ASMIPRSAGRQRQ----AKFDYSQLDRPESPVKLRLNSTGLHAANVGDKVEVTYMTGEKM 651 + ++P A Q Q A+FDYS+LDR +PV++RLNST + VTY + Sbjct: 439 SKLVPGVAAAQGQEIVGARFDYSKLDREGNPVRIRLNSTAIAIEPDVRITRVTYGWQGNL 498 Query: 652 TKVRAGQVVMAGYNMMIPYLVPEMSHEQQEALKQNVKAPLVYSKVVIRNWQPFIKLGVHE 831 +V A VV+AGY+MM+P+++P + + AL+ VKAPLVY+KV + NW+ F L Sbjct: 499 QRVEAAHVVVAGYDMMVPFILPSLPVPAKAALRACVKAPLVYTKVALDNWRAFDALKTWR 558 Query: 832 IYSPAAPYSRVKLD 873 I++P YS + L+ Sbjct: 559 IHAPTMAYSDLWLE 572 Score = 35.4 bits (80), Expect(2) = 4e-24 Identities = 25/90 (27%), Positives = 34/90 (37%) Frame = +3 Query: 228 TSYTQFXREKVGXSELAIRYFQQTTNDFQXVGIDGTSCSDARICDXXXXXXXXXXXXDEE 407 T Y F REK G +R+ + + D + DG + + A D Sbjct: 356 TRYAVFLREKAGLGLPGLRFLRSRSQDAYALDADGITVAQAISIGLPAGAGMAAPAIDP- 414 Query: 408 SQADLDDPYVFHFPDGNATLARLMVRKHDP 497 + FPDGNATLAR + K P Sbjct: 415 -RLGKSPASRLWFPDGNATLARALASKLVP 443
The transposon is thus inserted within a gene that encodes a homologue of hypothetical protein PA3713.
Direct homologues of this protein appear to be absent from sequenced bacteria (other than R. metallidurans; BLASTP results here). The most similar proteins are protoporphyrinogen oxidases, but the similarity is restricted almost entirely to an NAD binding site (residues 76-106; see the result from the ProfileScan Server), found also, for example in Thi4 domains (see here).
The protein in fact contains an amino oxidase domain (residues 83-620; see the entry for PA3713 at Pfam).
Protoporphyrinogen oxidases are protoheme biosynthetic enzyme. See for example Hansson & Hederstedt (1992).
Accepting the possibility that this protein is a protoporphyrinogen oxidase, STRING analysis suggests that it would be a (hemY) orthologue (see colour codes).
The current (09/02) Pseudomonas aeruginosa Community annotation for this gene is as a gene of unknown function, in a genomic context that is clearly distinct from that of hemY, with the downstream gene PA3714 (see also SWISSPROT/TrEMBL entry Q9HXS7) being a probable two-component response regulator.