This program reads an ASCII gene sequence file, finds the number of nucleotide bases (A, T, C, G), prints the complementary DNA sequence, translates exons into protein, and searches for restriction sites. Enter name of file containing gene sequence data: Base Count ... Total: 6170 A: 1565 C: 1518 G: 1511 T: 1576 The complementary sequence from 5' to 3': 1 ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat 61 aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 121 tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 181 gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 241 tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 301 aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 361 cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 421 agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg 481 ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 541 tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 601 tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 661 caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 721 accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact 781 attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc 841 ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga 901 taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 961 taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 1021 aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 1081 agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 1141 ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 1201 ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 1261 cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 1321 tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 1381 tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 1441 tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 1501 tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 1561 ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 1621 acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 1681 ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 1741 gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 1801 ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 1861 ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 1921 taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 1981 cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca 2041 tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc 2101 atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc tgcgccccga 2161 cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 2221 agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 2281 aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 2341 tctgcctgtt catccgcgtc cagctcgttg agtttctcca gaagcgttaa tgtctggctt 2401 ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga tgcctccgtg 2461 taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg 2521 atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg 2581 gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca gcgcttcgtt 2641 aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca gatccggaac 2701 ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg gaaaccgaag 2761 accattcatg ttgttgctca ggtcgcagac gttttgcagc agcagtcgct tcacgttcgc 2821 tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc tagccgggtc 2881 ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct gcccgagatg 2941 cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca agggttggtt 3001 tgcgcattca cagttctccg caagaattga ttggctccaa ttcttggagt ggtgaatccg 3061 ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat gcaccgcgac 3121 gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc aacccgttcc 3181 atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccagtg atcgaagtta 3241 ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca tctacctgcc 3301 tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa 3361 tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg 3421 ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg ggaccagtga 3481 cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg ccgatcatcg 3541 tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc 3601 ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc atgccccgcg 3661 cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga cgctctccct 3721 tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg 3781 ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg 3841 ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 3901 catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 3961 ccacgatgcg tccggcgtag aggatccaca ggacgggtgt ggtcgccatg atcgcgtagt 4021 cgataaatta ttgagataaa tgtcaaactc ccactacttt tgagatataa cactaagtgt 4081 gtaatgactt aattttaaat tgtcttctta tgagtagttt agttacatga atagcatttt 4141 gactgattta catcttggct tgtggtttct taagaatttc tctaattact ttagcatcaa 4201 ttttaccagt tagaccttta ggtacttcat ccacaaaacg aacgccacca cgcagacgtt 4261 tgtggttcac tacttgacta ttaacataat ccacaatttc cttttcagtc atagtttttc 4321 ctttttccat tacaactaca gcccctggaa gttcgccagc atcgggatcg gggacaccag 4381 ccacaccagc atcaaatata tttggatgtt gcaaaagaac ggattccaat tcagcaggtg 4441 gtacctggta ccccttgtat ttgattaatg atttcaaacg atcgacaatg aagaaatgtt 4501 cgtcttcgtc gtaatatcca atatctcctg tgtgcaacca accctcttca tcaatagttt 4561 ctcttgttgc ttccggattg ttcgagtagc ctaacataag actgggtcct tttacacaga 4621 tctctcctcg tcggttgaca cccaaagttt ttttagtgtc aagatcaata acttttactt 4681 tgaataaggg tactactttt ccagatgctc caggtttatc atcaccttct ggggtaataa 4741 taaatgcaga tgtcgtttct gttaatccgt aaccctgacg gacaccgggt agattaaatc 4801 ttctagcgac tgcttcgcca acttcttttg ccaaaggagc tccaccagaa gcaatttcag 4861 ttagattaga taaatcgaac ttatcgatca attcactctt gttgagaata gcaaataacg 4921 ttggtaccag aataacactg gtacacttat aatcttgcaa agttctcaaa aatagttctt 4981 catcaaattt tgttaacatt acaacacggt atccacaagc aaagtatcct aaagtggtaa 5041 acattccaaa tccatgatgg aacggaacga cagttaaaat agcagtacca ggtgaaactt 5101 ggtttccgta aattggatcc ttagcgtgtg agaatcttgt aactgcacct tcgtgggtaa 5161 ttcgtacacc tttaggtaaa ccagtagagc cagaagaatt cataagcaaa gcaacgtgtt 5221 gtttacggtt ctttacatca atgggtacaa agctacttgg ttgaaaacct aattctacat 5281 gtttcttaat aaaagtttcc atacaatcgt ggcccccaaa gtttacttta ctatctaaaa 5341 taacaatttt tttgatgcat gtaactgttt tttgcacttc taaaacttta ggtaagcctt 5401 ttctggagct gaatacaata gttggttgtg cgatgcccaa actatgatta agttcacgca 5461 atgtgtaaat ttcattagta ggtgcaacag ctaccccaat gtaaagacca gcaagtacag 5521 ggatgaaaaa ttcttcacaa ttttcactgc acaaagcaat atgttcttcc ggtttcatac 5581 caaagttttt catggcctca gctaaacgac atgtaatatc aaagtattct tggtaagaaa 5641 tgtcaactcc agtaagggcg ttactaaaag caattgctcc aagtttggca tattgatgca 5701 tgtacttatg caactgaatt ccagctgatc cttcttcaat ggggtagaat ggcagagggc 5761 catatacaac attctcctcc ttttccattt ccatttttca aaaacttttc ctctaccgtt 5821 tagggacttc gatagtggct ccaagtagcg aagcgagcag gactgggcgg cggccaaagc 5881 ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca tatagcgcta 5941 gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc aagaggcccg 6001 gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg ccgaggatga 6061 cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa tttaactgtg 6121 ataaactacc gcattaaagc ttatcgatga taagctgtca aacatgagaa ======================================================= Want to extract exon and translate? (Y/n) Enter Exon segment (start bp followed by end bp) ... (Enter start=0 to indicate no more.) (Enter end=99999 to indicate end-of-file.) Start: End: Start: Base Count in Mature Messenger RNA ... Total: 6170 A: 1565 C: 1518 G: 1511 T: 1576 The mature mRNA sequence from 5' to 3': 1 uucucauguu ugacagcuua ucaucgauaa gcuuuaaugc gguaguuuau cacaguuaaa 61 uugcuaacgc agucaggcac cguguaugaa aucuaacaau gcgcucaucg ucauccucgg 121 caccgucacc cuggaugcug uaggcauagg cuugguuaug ccgguacugc cgggccucuu 181 gcgggauauc guccauuccg acagcaucgc cagucacuau ggcgugcugc uagcgcuaua 241 ugcguugaug caauuucuau gcgcacccgu ucucggagca cuguccgacc gcuuuggccg 301 ccgcccaguc cugcucgcuu cgcuacuugg agccacuauc gaagucccua aacgguagag 361 gaaaaguuuu ugaaaaaugg aaauggaaaa ggaggagaau guuguauaug gcccucugcc 421 auucuacccc auugaagaag gaucagcugg aauucaguug cauaaguaca ugcaucaaua 481 ugccaaacuu ggagcaauug cuuuuaguaa cgcccuuacu ggaguugaca uuucuuacca 541 agaauacuuu gauauuacau gucguuuagc ugaggccaug aaaaacuuug guaugaaacc 601 ggaagaacau auugcuuugu gcagugaaaa uugugaagaa uuuuucaucc cuguacuugc 661 uggucuuuac auugggguag cuguugcacc uacuaaugaa auuuacacau ugcgugaacu 721 uaaucauagu uugggcaucg cacaaccaac uauuguauuc agcuccagaa aaggcuuacc 781 uaaaguuuua gaagugcaaa aaacaguuac augcaucaaa aaaauuguua uuuuagauag 841 uaaaguaaac uuugggggcc acgauuguau ggaaacuuuu auuaagaaac auguagaauu 901 agguuuucaa ccaaguagcu uuguacccau ugauguaaag aaccguaaac aacacguugc 961 uuugcuuaug aauucuucug gcucuacugg uuuaccuaaa gguguacgaa uuacccacga 1021 aggugcaguu acaagauucu cacacgcuaa ggauccaauu uacggaaacc aaguuucacc 1081 ugguacugcu auuuuaacug ucguuccguu ccaucaugga uuuggaaugu uuaccacuuu 1141 aggauacuuu gcuuguggau accguguugu aauguuaaca aaauuugaug aagaacuauu 1201 uuugagaacu uugcaagauu auaaguguac caguguuauu cugguaccaa cguuauuugc 1261 uauucucaac aagagugaau ugaucgauaa guucgauuua ucuaaucuaa cugaaauugc 1321 uucuggugga gcuccuuugg caaaagaagu uggcgaagca gucgcuagaa gauuuaaucu 1381 acccgguguc cgucaggguu acggauuaac agaaacgaca ucugcauuua uuauuacccc 1441 agaaggugau gauaaaccug gagcaucugg aaaaguagua cccuuauuca aaguaaaagu 1501 uauugaucuu gacacuaaaa aaacuuuggg ugucaaccga cgaggagaga ucuguguaaa 1561 aggacccagu cuuauguuag gcuacucgaa caauccggaa gcaacaagag aaacuauuga 1621 ugaagagggu ugguugcaca caggagauau uggauauuac gacgaagacg aacauuucuu 1681 cauugucgau cguuugaaau cauuaaucaa auacaagggg uaccagguac caccugcuga 1741 auuggaaucc guucuuuugc aacauccaaa uauauuugau gcuggugugg cugguguccc 1801 cgaucccgau gcuggcgaac uuccaggggc uguaguugua auggaaaaag gaaaaacuau 1861 gacugaaaag gaaauugugg auuauguuaa uagucaagua gugaaccaca aacgucugcg 1921 ugguggcguu cguuuugugg augaaguacc uaaaggucua acugguaaaa uugaugcuaa 1981 aguaauuaga gaaauucuua agaaaccaca agccaagaug uaaaucaguc aaaaugcuau 2041 ucauguaacu aaacuacuca uaagaagaca auuuaaaauu aagucauuac acacuuagug 2101 uuauaucuca aaaguagugg gaguuugaca uuuaucucaa uaauuuaucg acuacgcgau 2161 cauggcgacc acacccgucc uguggauccu cuacgccgga cgcaucgugg ccggcaucac 2221 cggcgccaca ggugcgguug cuggcgccua uaucgccgac aucaccgaug gggaagaucg 2281 ggcucgccac uucgggcuca ugagcgcuug uuucggcgug gguauggugg caggccccgu 2341 ggccggggga cuguugggcg ccaucuccuu gcaugcacca uuccuugcgg cggcggugcu 2401 caacggccuc aaccuacuac ugggcugcuu ccuaaugcag gagucgcaua agggagagcg 2461 ucgaccgaug cccuugagag ccuucaaccc agucagcucc uuccgguggg cgcggggcau 2521 gacuaucguc gccgcacuua ugacugucuu cuuuaucaug caacucguag gacaggugcc 2581 ggcagcgcuc ugggucauuu ucggcgagga ccgcuuucgc uggagcgcga cgaugaucgg 2641 ccugucgcuu gcgguauucg gaaucuugca cgcccucgcu caagccuucg ucacuggucc 2701 cgccaccaaa cguuucggcg agaagcaggc cauuaucgcc ggcauggcgg ccgacgcgcu 2761 gggcuacguc uugcuggcgu ucgcgacgcg aggcuggaug gccuucccca uuaugauucu 2821 ucucgcuucc ggcggcaucg ggaugcccgc guugcaggcc augcugucca ggcagguaga 2881 ugacgaccau cagggacagc uucaaggauc gcucgcggcu cuuaccagcc uaacuucgau 2941 cacuggaccg cugaucguca cggcgauuua ugccgccucg gcgagcacau ggaacggguu 3001 ggcauggauu guaggcgccg cccuauaccu ugucugccuc cccgcguugc gucgcggugc 3061 auggagccgg gccaccucga ccugaaugga agccggcggc accucgcuaa cggauucacc 3121 acuccaagaa uuggagccaa ucaauucuug cggagaacug ugaaugcgca aaccaacccu 3181 uggcagaaca uauccaucgc guccgccauc uccagcagcc gcacgcggcg caucucgggc 3241 agcguugggu ccuggccacg ggugcgcaug aucgugcucc ugucguugag gacccggcua 3301 ggcuggcggg guugccuuac ugguuagcag aaugaaucac cgauacgcga gcgaacguga 3361 agcgacugcu gcugcaaaac gucugcgacc ugagcaacaa caugaauggu cuucgguuuc 3421 cguguuucgu aaagucugga aacgcggaag ucagcgcccu gcaccauuau guuccggauc 3481 ugcaucgcag gaugcugcug gcuacccugu ggaacaccua caucuguauu aacgaagcgc 3541 uggcauugac ccugagugau uuuucucugg ucccgccgca uccauaccgc caguuguuua 3601 cccucacaac guuccaguaa ccgggcaugu ucaucaucag uaacccguau cgugagcauc 3661 cucucucguu ucaucgguau cauuaccccc augaacagaa aucccccuua cacggaggca 3721 ucagugacca aacaggaaaa aaccgcccuu aacauggccc gcuuuaucag aagccagaca 3781 uuaacgcuuc uggagaaacu caacgagcug gacgcggaug aacaggcaga caucugugaa 3841 ucgcuucacg accacgcuga ugagcuuuac cgcagcugcc ucgcgcguuu cggugaugac 3901 ggugaaaacc ucugacacau gcagcucccg gagacgguca cagcuugucu guaagcggau 3961 gccgggagca gacaagcccg ucagggcgcg ucagcgggug uuggcgggug ucggggcgca 4021 gccaugaccc agucacguag cgauagcgga guguauacug gcuuaacuau gcggcaucag 4081 agcagauugu acugagagug caccauaugc ggugugaaau accgcacaga ugcguaagga 4141 gaaaauaccg caucaggcgc ucuuccgcuu ccucgcucac ugacucgcug cgcucggucg 4201 uucggcugcg gcgagcggua ucagcucacu caaaggcggu aauacgguua uccacagaau 4261 caggggauaa cgcaggaaag aacaugugag caaaaggcca gcaaaaggcc aggaaccgua 4321 aaaaggccgc guugcuggcg uuuuuccaua ggcuccgccc cccugacgag caucacaaaa 4381 aucgacgcuc aagucagagg uggcgaaacc cgacaggacu auaaagauac caggcguuuc 4441 ccccuggaag cucccucgug cgcucuccug uuccgacccu gccgcuuacc ggauaccugu 4501 ccgccuuucu cccuucggga agcguggcgc uuucucauag cucacgcugu agguaucuca 4561 guucggugua ggucguucgc uccaagcugg gcugugugca cgaacccccc guucagcccg 4621 accgcugcgc cuuauccggu aacuaucguc uugaguccaa cccgguaaga cacgacuuau 4681 cgccacuggc agcagccacu gguaacagga uuagcagagc gagguaugua ggcggugcua 4741 cagaguucuu gaaguggugg ccuaacuacg gcuacacuag aaggacagua uuugguaucu 4801 gcgcucugcu gaagccaguu accuucggaa aaagaguugg uagcucuuga uccggcaaac 4861 aaaccaccgc ugguagcggu gguuuuuuug uuugcaagca gcagauuacg cgcagaaaaa 4921 aaggaucuca agaagauccu uugaucuuuu cuacgggguc ugacgcucag uggaacgaaa 4981 acucacguua agggauuuug gucaugagau uaucaaaaag gaucuucacc uagauccuuu 5041 uaaauuaaaa augaaguuuu aaaucaaucu aaaguauaua ugaguaaacu uggucugaca 5101 guuaccaaug cuuaaucagu gaggcaccua ucucagcgau cugucuauuu cguucaucca 5161 uaguugccug acuccccguc guguagauaa cuacgauacg ggagggcuua ccaucuggcc 5221 ccagugcugc aaugauaccg cgagacccac gcucaccggc uccagauuua ucagcaauaa 5281 accagccagc cggaagggcc gagcgcagaa gugguccugc aacuuuaucc gccuccaucc 5341 agucuauuaa uuguugccgg gaagcuagag uaaguaguuc gccaguuaau aguuugcgca 5401 acguuguugc cauugcugca ggcaucgugg ugucacgcuc gucguuuggu auggcuucau 5461 ucagcuccgg uucccaacga ucaaggcgag uuacaugauc ccccauguug ugcaaaaaag 5521 cgguuagcuc cuucgguccu ccgaucguug ucagaaguaa guuggccgca guguuaucac 5581 ucaugguuau ggcagcacug cauaauucuc uuacugucau gccauccgua agaugcuuuu 5641 cugugacugg ugaguacuca accaagucau ucugagaaua guguaugcgg cgaccgaguu 5701 gcucuugccc ggcgucaaca cgggauaaua ccgcgccaca uagcagaacu uuaaaagugc 5761 ucaucauugg aaaacguucu ucggggcgaa aacucucaag gaucuuaccg cuguugagau 5821 ccaguucgau guaacccacu cgugcaccca acugaucuuc agcaucuuuu acuuucacca 5881 gcguuucugg gugagcaaaa acaggaaggc aaaaugccgc aaaaaaggga auaagggcga 5941 cacggaaaug uugaauacuc auacucuucc uuuuucaaua uuauugaagc auuuaucagg 6001 guuauugucu caugagcgga uacauauuug aauguauuua gaaaaauaaa caaauagggg 6061 uuccgcgcac auuuccccga aaagugccac cugacgucua agaaaccauu auuaucauga 6121 cauuaaccua uaaaaauagg cguaucacga ggcccuuucg ucuucaagaa Start codon found at (in exon gene sequence): 6 Start codon found at (in original gene sequence): 6 Stop codon found at (in exon gene sequence): 57 Stop codon found at (in original gene sequence): 57 Number of peptides = 17 1-letter Peptide sequence: MFDSLSSISFNAVVYHS 3-letter Peptide sequence: Met-Phe-Asp-Ser-Leu-Ser-Ser-Ile-Ser-Phe-Asn-Ala-Val-Val-Tyr- His-Ser- Start codon found at (in exon gene sequence): 86 Start codon found at (in original gene sequence): 86 Stop codon found at (in exon gene sequence): 356 Stop codon found at (in original gene sequence): 356 Number of peptides = 90 1-letter Peptide sequence: MKSNNALIVILGTVTLDAVGIGLVMPVLPGLLRDIVHSDSIASHYGVLLALYALMQFLCA PVLGALSDRFGRRPVLLASLLGATIEVPKR 3-letter Peptide sequence: Met-Lys-Ser-Asn-Asn-Ala-Leu-Ile-Val-Ile-Leu-Gly-Thr-Val-Thr- Leu-Asp-Ala-Val-Gly-Ile-Gly-Leu-Val-Met-Pro-Val-Leu-Pro-Gly- Leu-Leu-Arg-Asp-Ile-Val-His-Ser-Asp-Ser-Ile-Ala-Ser-His-Tyr- Gly-Val-Leu-Leu-Ala-Leu-Tyr-Ala-Leu-Met-Gln-Phe-Leu-Cys-Ala- Pro-Val-Leu-Gly-Ala-Leu-Ser-Asp-Arg-Phe-Gly-Arg-Arg-Pro-Val- Leu-Leu-Ala-Ser-Leu-Leu-Gly-Ala-Thr-Ile-Glu-Val-Pro-Lys-Arg- Start codon found at (in exon gene sequence): 377 Start codon found at (in original gene sequence): 377 Stop codon found at (in exon gene sequence): 2021 Stop codon found at (in original gene sequence): 2021 Number of peptides = 548 1-letter Peptide sequence: MEMEKEENVVYGPLPFYPIEEGSAGIQLHKYMHQYAKLGAIAFSNALTGVDISYQEYFDI TCRLAEAMKNFGMKPEEHIALCSENCEEFFIPVLAGLYIGVAVAPTNEIYTLRELNHSLG IAQPTIVFSSRKGLPKVLEVQKTVTCIKKIVILDSKVNFGGHDCMETFIKKHVELGFQPS SFVPIDVKNRKQHVALLMNSSGSTGLPKGVRITHEGAVTRFSHAKDPIYGNQVSPGTAIL TVVPFHHGFGMFTTLGYFACGYRVVMLTKFDEELFLRTLQDYKCTSVILVPTLFAILNKS ELIDKFDLSNLTEIASGGAPLAKEVGEAVARRFNLPGVRQGYGLTETTSAFIITPEGDDK PGASGKVVPLFKVKVIDLDTKKTLGVNRRGEICVKGPSLMLGYSNNPEATRETIDEEGWL HTGDIGYYDEDEHFFIVDRLKSLIKYKGYQVPPAELESVLLQHPNIFDAGVAGVPDPDAG ELPGAVVVMEKGKTMTEKEIVDYVNSQVVNHKRLRGGVRFVDEVPKGLTGKIDAKVIREI LKKPQAKM 3-letter Peptide sequence: Met-Glu-Met-Glu-Lys-Glu-Glu-Asn-Val-Val-Tyr-Gly-Pro-Leu-Pro- Phe-Tyr-Pro-Ile-Glu-Glu-Gly-Ser-Ala-Gly-Ile-Gln-Leu-His-Lys- Tyr-Met-His-Gln-Tyr-Ala-Lys-Leu-Gly-Ala-Ile-Ala-Phe-Ser-Asn- Ala-Leu-Thr-Gly-Val-Asp-Ile-Ser-Tyr-Gln-Glu-Tyr-Phe-Asp-Ile- Thr-Cys-Arg-Leu-Ala-Glu-Ala-Met-Lys-Asn-Phe-Gly-Met-Lys-Pro- Glu-Glu-His-Ile-Ala-Leu-Cys-Ser-Glu-Asn-Cys-Glu-Glu-Phe-Phe- Ile-Pro-Val-Leu-Ala-Gly-Leu-Tyr-Ile-Gly-Val-Ala-Val-Ala-Pro- Thr-Asn-Glu-Ile-Tyr-Thr-Leu-Arg-Glu-Leu-Asn-His-Ser-Leu-Gly- Ile-Ala-Gln-Pro-Thr-Ile-Val-Phe-Ser-Ser-Arg-Lys-Gly-Leu-Pro- Lys-Val-Leu-Glu-Val-Gln-Lys-Thr-Val-Thr-Cys-Ile-Lys-Lys-Ile- Val-Ile-Leu-Asp-Ser-Lys-Val-Asn-Phe-Gly-Gly-His-Asp-Cys-Met- Glu-Thr-Phe-Ile-Lys-Lys-His-Val-Glu-Leu-Gly-Phe-Gln-Pro-Ser- Ser-Phe-Val-Pro-Ile-Asp-Val-Lys-Asn-Arg-Lys-Gln-His-Val-Ala- Leu-Leu-Met-Asn-Ser-Ser-Gly-Ser-Thr-Gly-Leu-Pro-Lys-Gly-Val- Arg-Ile-Thr-His-Glu-Gly-Ala-Val-Thr-Arg-Phe-Ser-His-Ala-Lys- Asp-Pro-Ile-Tyr-Gly-Asn-Gln-Val-Ser-Pro-Gly-Thr-Ala-Ile-Leu- Thr-Val-Val-Pro-Phe-His-His-Gly-Phe-Gly-Met-Phe-Thr-Thr-Leu- Gly-Tyr-Phe-Ala-Cys-Gly-Tyr-Arg-Val-Val-Met-Leu-Thr-Lys-Phe- Asp-Glu-Glu-Leu-Phe-Leu-Arg-Thr-Leu-Gln-Asp-Tyr-Lys-Cys-Thr- Ser-Val-Ile-Leu-Val-Pro-Thr-Leu-Phe-Ala-Ile-Leu-Asn-Lys-Ser- Glu-Leu-Ile-Asp-Lys-Phe-Asp-Leu-Ser-Asn-Leu-Thr-Glu-Ile-Ala- Ser-Gly-Gly-Ala-Pro-Leu-Ala-Lys-Glu-Val-Gly-Glu-Ala-Val-Ala- Arg-Arg-Phe-Asn-Leu-Pro-Gly-Val-Arg-Gln-Gly-Tyr-Gly-Leu-Thr- Glu-Thr-Thr-Ser-Ala-Phe-Ile-Ile-Thr-Pro-Glu-Gly-Asp-Asp-Lys- Pro-Gly-Ala-Ser-Gly-Lys-Val-Val-Pro-Leu-Phe-Lys-Val-Lys-Val- Ile-Asp-Leu-Asp-Thr-Lys-Lys-Thr-Leu-Gly-Val-Asn-Arg-Arg-Gly- Glu-Ile-Cys-Val-Lys-Gly-Pro-Ser-Leu-Met-Leu-Gly-Tyr-Ser-Asn- Asn-Pro-Glu-Ala-Thr-Arg-Glu-Thr-Ile-Asp-Glu-Glu-Gly-Trp-Leu- His-Thr-Gly-Asp-Ile-Gly-Tyr-Tyr-Asp-Glu-Asp-Glu-His-Phe-Phe- Ile-Val-Asp-Arg-Leu-Lys-Ser-Leu-Ile-Lys-Tyr-Lys-Gly-Tyr-Gln- Val-Pro-Pro-Ala-Glu-Leu-Glu-Ser-Val-Leu-Leu-Gln-His-Pro-Asn- Ile-Phe-Asp-Ala-Gly-Val-Ala-Gly-Val-Pro-Asp-Pro-Asp-Ala-Gly- Glu-Leu-Pro-Gly-Ala-Val-Val-Val-Met-Glu-Lys-Gly-Lys-Thr-Met- Thr-Glu-Lys-Glu-Ile-Val-Asp-Tyr-Val-Asn-Ser-Gln-Val-Val-Asn- His-Lys-Arg-Leu-Arg-Gly-Gly-Val-Arg-Phe-Val-Asp-Glu-Val-Pro- Lys-Gly-Leu-Thr-Gly-Lys-Ile-Asp-Ala-Lys-Val-Ile-Arg-Glu-Ile- Leu-Lys-Lys-Pro-Gln-Ala-Lys-Met- Start codon found at (in exon gene sequence): 2034 Start codon found at (in original gene sequence): 2034 Stop codon found at (in exon gene sequence): 2046 Stop codon found at (in original gene sequence): 2046 Number of peptides = 4 1-letter Peptide sequence: MLFM 3-letter Peptide sequence: Met-Leu-Phe-Met- Start codon found at (in exon gene sequence): 2162 Start codon found at (in original gene sequence): 2162 Stop codon found at (in exon gene sequence): 3083 Stop codon found at (in original gene sequence): 3083 Number of peptides = 307 1-letter Peptide sequence: MATTPVLWILYAGRIVAGITGATGAVAGAYIADITDGEDRARHFGLMSACFGVGMVAGPV AGGLLGAISLHAPFLAAAVLNGLNLLLGCFLMQESHKGERRPMPLRAFNPVSSFRWARGM TIVAALMTVFFIMQLVGQVPAALWVIFGEDRFRWSATMIGLSLAVFGILHALAQAFVTGP ATKRFGEKQAIIAGMAADALGYVLLAFATRGWMAFPIMILLASGGIGMPALQAMLSRQVD DDHQGQLQGSLAALTSLTSITGPLIVTAIYAASASTWNGLAWIVGAALYLVCLPALRRGA WSRATST 3-letter Peptide sequence: Met-Ala-Thr-Thr-Pro-Val-Leu-Trp-Ile-Leu-Tyr-Ala-Gly-Arg-Ile- Val-Ala-Gly-Ile-Thr-Gly-Ala-Thr-Gly-Ala-Val-Ala-Gly-Ala-Tyr- Ile-Ala-Asp-Ile-Thr-Asp-Gly-Glu-Asp-Arg-Ala-Arg-His-Phe-Gly- Leu-Met-Ser-Ala-Cys-Phe-Gly-Val-Gly-Met-Val-Ala-Gly-Pro-Val- Ala-Gly-Gly-Leu-Leu-Gly-Ala-Ile-Ser-Leu-His-Ala-Pro-Phe-Leu- Ala-Ala-Ala-Val-Leu-Asn-Gly-Leu-Asn-Leu-Leu-Leu-Gly-Cys-Phe- Leu-Met-Gln-Glu-Ser-His-Lys-Gly-Glu-Arg-Arg-Pro-Met-Pro-Leu- Arg-Ala-Phe-Asn-Pro-Val-Ser-Ser-Phe-Arg-Trp-Ala-Arg-Gly-Met- Thr-Ile-Val-Ala-Ala-Leu-Met-Thr-Val-Phe-Phe-Ile-Met-Gln-Leu- Val-Gly-Gln-Val-Pro-Ala-Ala-Leu-Trp-Val-Ile-Phe-Gly-Glu-Asp- Arg-Phe-Arg-Trp-Ser-Ala-Thr-Met-Ile-Gly-Leu-Ser-Leu-Ala-Val- Phe-Gly-Ile-Leu-His-Ala-Leu-Ala-Gln-Ala-Phe-Val-Thr-Gly-Pro- Ala-Thr-Lys-Arg-Phe-Gly-Glu-Lys-Gln-Ala-Ile-Ile-Ala-Gly-Met- Ala-Ala-Asp-Ala-Leu-Gly-Tyr-Val-Leu-Leu-Ala-Phe-Ala-Thr-Arg- Gly-Trp-Met-Ala-Phe-Pro-Ile-Met-Ile-Leu-Leu-Ala-Ser-Gly-Gly- Ile-Gly-Met-Pro-Ala-Leu-Gln-Ala-Met-Leu-Ser-Arg-Gln-Val-Asp- Asp-Asp-His-Gln-Gly-Gln-Leu-Gln-Gly-Ser-Leu-Ala-Ala-Leu-Thr- Ser-Leu-Thr-Ser-Ile-Thr-Gly-Pro-Leu-Ile-Val-Thr-Ala-Ile-Tyr- Ala-Ala-Ser-Ala-Ser-Thr-Trp-Asn-Gly-Leu-Ala-Trp-Ile-Val-Gly- Ala-Ala-Leu-Tyr-Leu-Val-Cys-Leu-Pro-Ala-Leu-Arg-Arg-Gly-Ala- Trp-Ser-Arg-Ala-Thr-Ser-Thr- Start codon found at (in exon gene sequence): 3086 Start codon found at (in original gene sequence): 3086 Stop codon found at (in exon gene sequence): 3161 Stop codon found at (in original gene sequence): 3161 Number of peptides = 25 1-letter Peptide sequence: MEAGGTSLTDSPLQELEPINSCGEL 3-letter Peptide sequence: Met-Glu-Ala-Gly-Gly-Thr-Ser-Leu-Thr-Asp-Ser-Pro-Leu-Gln-Glu- Leu-Glu-Pro-Ile-Asn-Ser-Cys-Gly-Glu-Leu- Start codon found at (in exon gene sequence): 3164 Start codon found at (in original gene sequence): 3164 Stop codon found at (in exon gene sequence): 3269 Stop codon found at (in original gene sequence): 3269 Number of peptides = 35 1-letter Peptide sequence: MRKPTLGRTYPSRPPSPAAARGASRAALGPGHGCA 3-letter Peptide sequence: Met-Arg-Lys-Pro-Thr-Leu-Gly-Arg-Thr-Tyr-Pro-Ser-Arg-Pro-Pro- Ser-Pro-Ala-Ala-Ala-Arg-Gly-Ala-Ser-Arg-Ala-Ala-Leu-Gly-Pro- Gly-His-Gly-Cys-Ala- Start codon found at (in exon gene sequence): 3332 Start codon found at (in original gene sequence): 3332 Stop codon found at (in exon gene sequence): 3530 Stop codon found at (in original gene sequence): 3530 Number of peptides = 66 1-letter Peptide sequence: MNHRYASEREATAAAKRLRPEQQHEWSSVSVFRKVWKRGSQRPAPLCSGSASQDAAGYPV EHLHLY 3-letter Peptide sequence: Met-Asn-His-Arg-Tyr-Ala-Ser-Glu-Arg-Glu-Ala-Thr-Ala-Ala-Ala- Lys-Arg-Leu-Arg-Pro-Glu-Gln-Gln-His-Glu-Trp-Ser-Ser-Val-Ser- Val-Phe-Arg-Lys-Val-Trp-Lys-Arg-Gly-Ser-Gln-Arg-Pro-Ala-Pro- Leu-Cys-Ser-Gly-Ser-Ala-Ser-Gln-Asp-Ala-Ala-Gly-Tyr-Pro-Val- Glu-His-Leu-His-Leu-Tyr- Start codon found at (in exon gene sequence): 3627 Start codon found at (in original gene sequence): 3627 Stop codon found at (in exon gene sequence): 3750 Stop codon found at (in original gene sequence): 3750 Number of peptides = 41 1-letter Peptide sequence: MFIISNPYREHPLSFHRYHYPHEQKSPLHGGISDQTGKNRP 3-letter Peptide sequence: Met-Phe-Ile-Ile-Ser-Asn-Pro-Tyr-Arg-Glu-His-Pro-Leu-Ser-Phe- His-Arg-Tyr-His-Tyr-Pro-His-Glu-Gln-Lys-Ser-Pro-Leu-His-Gly- Gly-Ile-Ser-Asp-Gln-Thr-Gly-Lys-Asn-Arg-Pro- Start codon found at (in exon gene sequence): 3754 Start codon found at (in original gene sequence): 3754 Stop codon found at (in exon gene sequence): 3913 Stop codon found at (in original gene sequence): 3913 Number of peptides = 53 1-letter Peptide sequence: MARFIRSQTLTLLEKLNELDADEQADICESLHDHADELYRSCLARFGDDGENL 3-letter Peptide sequence: Met-Ala-Arg-Phe-Ile-Arg-Ser-Gln-Thr-Leu-Thr-Leu-Leu-Glu-Lys- Leu-Asn-Glu-Leu-Asp-Ala-Asp-Glu-Gln-Ala-Asp-Ile-Cys-Glu-Ser- Leu-His-Asp-His-Ala-Asp-Glu-Leu-Tyr-Arg-Ser-Cys-Leu-Ala-Arg- Phe-Gly-Asp-Asp-Gly-Glu-Asn-Leu- Start codon found at (in exon gene sequence): 3919 Start codon found at (in original gene sequence): 3919 Stop codon found at (in exon gene sequence): 3952 Stop codon found at (in original gene sequence): 3952 Number of peptides = 11 1-letter Peptide sequence: MQLPETVTACL 3-letter Peptide sequence: Met-Gln-Leu-Pro-Glu-Thr-Val-Thr-Ala-Cys-Leu- Start codon found at (in exon gene sequence): 3959 Start codon found at (in original gene sequence): 3959 Stop codon found at (in exon gene sequence): 4025 Stop codon found at (in original gene sequence): 4025 Number of peptides = 22 1-letter Peptide sequence: MPGADKPVRARQRVLAGVGAQP 3-letter Peptide sequence: Met-Pro-Gly-Ala-Asp-Lys-Pro-Val-Arg-Ala-Arg-Gln-Arg-Val-Leu- Ala-Gly-Val-Gly-Ala-Gln-Pro- Start codon found at (in exon gene sequence): 4069 Start codon found at (in original gene sequence): 4069 Stop codon found at (in exon gene sequence): 4093 Stop codon found at (in original gene sequence): 4093 Number of peptides = 8 1-letter Peptide sequence: MRHQSRLY 3-letter Peptide sequence: Met-Arg-His-Gln-Ser-Arg-Leu-Tyr- Start codon found at (in exon gene sequence): 4107 Start codon found at (in original gene sequence): 4107 Stop codon found at (in exon gene sequence): 4287 Stop codon found at (in original gene sequence): 4287 Number of peptides = 60 1-letter Peptide sequence: MRCEIPHRCVRRKYRIRRSSASSLTDSLRSVVRLRRAVSAHSKAVIRLSTESGDNAGKNM 3-letter Peptide sequence: Met-Arg-Cys-Glu-Ile-Pro-His-Arg-Cys-Val-Arg-Arg-Lys-Tyr-Arg- Ile-Arg-Arg-Ser-Ser-Ala-Ser-Ser-Leu-Thr-Asp-Ser-Leu-Arg-Ser- Val-Val-Arg-Leu-Arg-Arg-Ala-Val-Ser-Ala-His-Ser-Lys-Ala-Val- Ile-Arg-Leu-Ser-Thr-Glu-Ser-Gly-Asp-Asn-Ala-Gly-Lys-Asn-Met- Start codon found at (in exon gene sequence): 4726 Start codon found at (in original gene sequence): 4726 Stop codon found at (in exon gene sequence): 4729 Stop codon found at (in original gene sequence): 4729 Number of peptides = 1 1-letter Peptide sequence: M 3-letter Peptide sequence: Met- Start codon found at (in exon gene sequence): 5004 Start codon found at (in original gene sequence): 5004 Stop codon found at (in exon gene sequence): 5031 Stop codon found at (in original gene sequence): 5031 Number of peptides = 9 1-letter Peptide sequence: MRLSKRIFT 3-letter Peptide sequence: Met-Arg-Leu-Ser-Lys-Arg-Ile-Phe-Thr- Start codon found at (in exon gene sequence): 5051 Start codon found at (in original gene sequence): 5051 Stop codon found at (in exon gene sequence): 5060 Stop codon found at (in original gene sequence): 5060 Number of peptides = 3 1-letter Peptide sequence: MKF 3-letter Peptide sequence: Met-Lys-Phe- Start codon found at (in exon gene sequence): 5080 Start codon found at (in original gene sequence): 5080 Stop codon found at (in exon gene sequence): 5113 Stop codon found at (in original gene sequence): 5113 Number of peptides = 11 1-letter Peptide sequence: MSKLGLTVTNA 3-letter Peptide sequence: Met-Ser-Lys-Leu-Gly-Leu-Thr-Val-Thr-Asn-Ala- Start codon found at (in exon gene sequence): 5232 Start codon found at (in original gene sequence): 5232 Stop codon found at (in exon gene sequence): 5496 Stop codon found at (in original gene sequence): 5496 Number of peptides = 88 1-letter Peptide sequence: MIPRDPRSPAPDLSAINQPAGRAERRSGPATLSASIQSINCCREARVSSSPVNSLRNVVA IAAGIVVSRSSFGMASFSSGSQRSRRVT 3-letter Peptide sequence: Met-Ile-Pro-Arg-Asp-Pro-Arg-Ser-Pro-Ala-Pro-Asp-Leu-Ser-Ala- Ile-Asn-Gln-Pro-Ala-Gly-Arg-Ala-Glu-Arg-Arg-Ser-Gly-Pro-Ala- Thr-Leu-Ser-Ala-Ser-Ile-Gln-Ser-Ile-Asn-Cys-Cys-Arg-Glu-Ala- Arg-Val-Ser-Ser-Ser-Pro-Val-Asn-Ser-Leu-Arg-Asn-Val-Val-Ala- Ile-Ala-Ala-Gly-Ile-Val-Val-Ser-Arg-Ser-Ser-Phe-Gly-Met-Ala- Ser-Phe-Ser-Ser-Gly-Ser-Gln-Arg-Ser-Arg-Arg-Val-Thr- Start codon found at (in exon gene sequence): 5505 Start codon found at (in original gene sequence): 5505 Stop codon found at (in exon gene sequence): 5673 Stop codon found at (in original gene sequence): 5673 Number of peptides = 56 1-letter Peptide sequence: MLCKKAVSSFGPPIVVRSKLAAVLSLMVMAALHNSLTVMPSVRCFSVTGEYSTKSF 3-letter Peptide sequence: Met-Leu-Cys-Lys-Lys-Ala-Val-Ser-Ser-Phe-Gly-Pro-Pro-Ile-Val- Val-Arg-Ser-Lys-Leu-Ala-Ala-Val-Leu-Ser-Leu-Met-Val-Met-Ala- Ala-Leu-His-Asn-Ser-Leu-Thr-Val-Met-Pro-Ser-Val-Arg-Cys-Phe- Ser-Val-Thr-Gly-Glu-Tyr-Ser-Thr-Lys-Ser-Phe- Start codon found at (in exon gene sequence): 5685 Start codon found at (in original gene sequence): 5685 Stop codon found at (in exon gene sequence): 5832 Stop codon found at (in original gene sequence): 5832 Number of peptides = 49 1-letter Peptide sequence: MRRPSCSCPASTRDNTAPHSRTLKVLIIGKRSSGRKLSRILPLLRSSSM 3-letter Peptide sequence: Met-Arg-Arg-Pro-Ser-Cys-Ser-Cys-Pro-Ala-Ser-Thr-Arg-Asp-Asn- Thr-Ala-Pro-His-Ser-Arg-Thr-Leu-Lys-Val-Leu-Ile-Ile-Gly-Lys- Arg-Ser-Ser-Gly-Arg-Lys-Leu-Ser-Arg-Ile-Leu-Pro-Leu-Leu-Arg- Ser-Ser-Ser-Met- Start codon found at (in exon gene sequence): 5914 Start codon found at (in original gene sequence): 5914 Stop codon found at (in exon gene sequence): 5932 Stop codon found at (in original gene sequence): 5932 Number of peptides = 6 1-letter Peptide sequence: MPQKRE 3-letter Peptide sequence: Met-Pro-Gln-Lys-Arg-Glu- Start codon found at (in exon gene sequence): 5948 Start codon found at (in original gene sequence): 5948 Stop codon found at (in exon gene sequence): 6029 Stop codon found at (in original gene sequence): 6029 Number of peptides = 27 1-letter Peptide sequence: MLNTHTLPFSILLKHLSGLLSHERIHI 3-letter Peptide sequence: Met-Leu-Asn-Thr-His-Thr-Leu-Pro-Phe-Ser-Ile-Leu-Leu-Lys-His- Leu-Ser-Gly-Leu-Leu-Ser-His-Glu-Arg-Ile-His-Ile- Start codon found at (in exon gene sequence): 6032 Start codon found at (in original gene sequence): 6032 Stop codon found at (in exon gene sequence): 6047 Stop codon found at (in original gene sequence): 6047 Number of peptides = 5 1-letter Peptide sequence: MYLEK 3-letter Peptide sequence: Met-Tyr-Leu-Glu-Lys- Start codon found at (in exon gene sequence): 6117 Start codon found at (in original gene sequence): 6117 Searched to the end of exon but could not find a matching terminator codon. ======================================================= Want to search for a given sequence? (Y/n) Stop - Program terminated.