7E7
/jep 132
AGAAGGCGTT AGGGTGATAC TGCAGCTGCG GCAGGGGCAT CCTATGCCAT
CACCACCGTG CTCACTTACC TGATTACCGC CGTGGGCGCC GTGGTTGCGC
TGGGATCGCT CGGCGTCTCC TGGGACAAAC TGCAATGGCT GGCCGCCGGC
CTGACGGTCG GGCTGGGCTT CGGCCTGCAG GCCTCTGCCG AAGCATTGGC
TGAAGAGGTG AACGGCGAAG TCGTGCTGCA GCACCCGGAT AGTGTGCTCG
TCGACGGTAT AGGCGTGGAA CAGATCGAAC TGCATCTGCC CGACGATCTT
GCCCCACTGC GGCATATAAG CCCACAGCAC NCTGTGACGG TGCATCGGCA
CCAGCGCGCG CGACACCGCG CCGGGGTGGC GCAGGATGGC ATAAACAGAT
CGCGCGCTTC CGGGATGGTG ACAGCGGCTG TTTCANGTGA CGCGCGCGTG
CCACTGCCAC GTGGTGAATA AATGCTTTGA TC
The sequence contains several Pst I sites, giving just
AGAAGGCGTT AGGGTGATAC TGCA
That this is the case is confirmed by BLASTX using the composite sequence:
CANCTGTGTA CACCTCCGAG GTGTCAACAG CGCGCTGCGA CCCTGCAGCC GGTCGNTAGC GTCGGAATNT GNTGAGTGGA GCAGATGATG CAATGCGCTG ATGCCCTTGC TGGATCAGAT GACGGTCGCC AGCCAGGCGC CGTAGCGATC GTCGAGCGCC ACGCAGCGTG TTTCGAAGCC CGGCAGCGTG CGGTTGATCA GCACCATGCC CGGGATCTGT TGCATCCAGC CGCAAAGTTC CTGATCGCTC AGCCTTTTGG CGTGCACCAC CAGCGCTTCG CAGCGGTGCC GCACCAGCTG TTCAATCGCC TGGCGCTCTT TTTCGGCGTC GTGGTAACCG TTGCCAATCA GCAGAAAATT ATGGGTGGCG TAGGCCACCT GCTCGACCGC TTTGACCATT GCGCCGAAGA AGGGATCGGA AACGTCGGAG ACGATCAGGC CCAGCGTCTC GGTGGATTGC NNCGCCATCG CCCGTGCGTG GGGAANGNGA NNNNNANNGG AGAAGGCGTT AGGGTGATAC TGCAGCTGCG GCAGGGGCAT CCTATGCCAT CACCACCGTG CTCACTTACC TGATTACCGC CGTGGGCGCC GTGGTTGCGC TGGGATCGCT CGGCGTCTCC TGGGACAAAC TGCAATGGCT GGCCGCCGGC CTGACGGTCG GGCTGGGCTT CGGCCTGCAG GCCTCTGCCG AAGCATTGGC TGAAGAGGTG AACGGCGAAG TCGTGCTGCA GCACCCGGAT AGTGTGCTCG TCGACGGTAT AGGCGTGGAA CAGATCGAAC TGCATCTGCC CGACGATCTT GCCCCACTGC GGCATATAAG CCCACAGCAC NCTGTGACGG TGCATCGGCA CCAGCGCGCG CGACACCGCG CCGGGGTGGC GCAGGATGGC ATAAACAGAT CGCGCGCTTC CGGGATGGTG ACAGCGGCTG TTTCANGTGA CGCGCGCGTG CCACTGCCAC GTGGTGAATA AATGCTTTGA TC that gives (full results here): >gi|22127059|ref|NP_670482.1| (NC_004088) repressor of galETK operon [Yersinia pestis KIM] Length = 363 Score = 221 bits (564), Expect = 2e-57 Identities = 118/166 (71%), Positives = 129/166 (77%), Gaps = 1/166 (0%) Frame = -2 Query: 534 LPQLQYHPNAFSXXXXXPHARAMAXQSTETLGLIVSDVSDPFFGAMVKAVEQVAYATHNF 355 + QLQYHPNA +ARA+A QSTET+G+IVSDVSDPFFGAMVKAVEQVAYAT NF Sbjct: 63 MEQLQYHPNA--------NARALAQQSTETVGMIVSDVSDPFFGAMVKAVEQVAYATGNF 114 Query: 354 LLIGNGYHDAEKERQAIEQLVRHRCEALVVHAKRLSDQELCGWMQQIPGMVLINRTLPGF 175 LLIGNGYHDAEKERQAIEQL+RHRC ALVVHAK+L D EL M+QIPGMVLINRTLPGF Sbjct: 115 LLIGNGYHDAEKERQAIEQLIRHRCAALVVHAKKLPDDELTSLMEQIPGMVLINRTLPGF 174 Query: 174 ETRCVALDDRYGAWLATVI*SSKGIXXXXXX-XXXXXSDAXDRLQG 40 E RC+ALDDRYGAWLAT +G SDA DR+QG Sbjct: 175 EPRCIALDDRYGAWLATRHLIQQGHKRVAFICSNHQISDALDRMQG 220 i.e. the similarity stops abruptly at a position (528) almost exactly corresponding to the predicted Pst I site (524).