NRPDB21203
(Arabidopsis lyrata)


1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
4441
>NRPDB21203 | Arabidopsis lyrata | genomic sequence | RNA Polymerase IV Small Subunit
caaagcacac taacacgaga acaacataag gacatacccg tcatttcaac aaacgacact 
caatttcttc actccctcgt caactgcttc gcttcgttta accatcgaaa aagtgagcca 
agggttttga ctctcttcgt tttctacggc gaaatctctc cgattttccg gcgacgttta 
ctctgccttc ctccaacacc gccgttttac tccatcgtgc cagcttaagc aatcaaggta 
cccattttag gtattacgct ttgattctgc ttttaagcat tggaaattcc ggagactata 
tgctttagag aatgattcgg ttctagggga aagtttttga ttgcgtgttt gtattcgtat 
gatgcatttt cgtggttcat gattttcacg gcttcttaat ctttgtttgg ggtttttttt 
ttgtttcagt gtgttttgag gtataccaga aaagatggac tatattgttg aacggaatta 
attttctgtt accagaaaag ATGGACATTG ATGAGATGGA TATTGAAGAG ATCGAGGCTA 
CTGCGGAGAT CAATCTATCT GAGCTAGGAG AAAGTTTTCT CCAGAGTTTC TGCAAGAAAG 
CTGCAACTTC CTTCTTTGAT AAGTATGGAC TTATAAGTCA TCAGCTCAAT TCCTACAACT 
TCTTCATTCA ACACGGGCTT CAGGATGTGT TTGAATCCTT TGGTGATATG CTTGTGGAAC 
CGTCGTTTGA TGTGATAAAG AAGAAGGATA ACGATTGGAG ATACGCTACG GTGAAATTCG 
GAAAAGTCAC TGTGGAGAAG CCCACTTTCT TTTCCGATGA CAAGGAGCTT GAGTTTCTCC 
CATGGCATGC CAGGCTTCAG AACATGACAT ATTCAGCAAG GATCAAAGTC AATGTCCAAG 
TTGAGgtaac aaaatctttg tcgaaaaatt aagtaagctt gtctggattt gataaatgat 
ttcccttgct tgaaaactca gaaagaccag ttaactatca ctttttagtt caacattatg 
caatatgtct atgtagtcga gagtaagctc attttttgat gtttctacta gactcttgct 
gacacatata tgaagatgtt gacatacact gaggttcctg tcatagattt ctcaaactta 
tcaaaacctt taacttgcca taaaataata tattaagggt tatggcacat atatgtctgg 
aaactggttt cactcttttt ggctttacaa gttttctatt cttggatttg gttccttatt 
tgcattcgct ggatttctta cgtgagcaaa atatctagta aaagagattt attacattta 
cattttcgtg tgaagtagag gtatgtttca ggcttcgttg tttttaagat tgatgatttt 
gtctgctccc aatctttaga tgtttcttgc tttttttccg ggccaaaatt tgaattgtga 
ttactttttc ttgtagtagt gggtgctcaa acgaaataag ctttagtttg tttcatttta 
aagattggat gcaataaaag aaaaacatct tcagcttttt atttatttag ttcttcccca 
ttccctcact gtgctttaat ttgagtgttt catgcttgtg tgcaatgact cttgtactat 
caaacttttg atgctgtttc tgttttgctg tccatgtatc ttattcttat aaatgtagtt 
tattgtctaa ctgcctcttc actttataaa ttcactagGT GTTCATAAAA ACTGTTGTTA 
AAAGCGACAA ATTCAAGACG GGACAAGACG AATATGTCGA GAAGAAGATA CTTGAGGTCA 
AAAAGCAGGA CATTCTAATT GGTAGCATTC CTGTCATGGT GAAATCTGTC CTTTGCAAAA 
CAAGCGAGAA AGGAAAAGAA AACTGCAGAA AGGGGGATTG TGCCTTTGAT CAGGGTGGCT 
ATTTTGTGAT AAAGGGGGCT GAGAAGgtta gttaaactaa tacatacata tatgcatatt 
gccattcaat acttaaaata aactttattt tctaagccaa aacggatttt gtttgtcagc 
aatttataca acacaaacaa gagtataatt acaattttct atcatcagat aatagtagtt 
atcagcaaaa aagatgttac aaattagaca ataacctatt tggttcattt tattttccta 
atggagatgg ttaagaaagt aagaaactta acttatttat gactttatat gcttaaacat 
acatacaaca aaacctttat caacacaaac acatactgaa ggaaaggaaa aacaattcaa 
gtacttcaaa cttcattaaa tactaaacaa attaatcgtg tttgggatct tttttcatag 
ccactgtcat gggatttcta ttttgaacta ttttagtgga aattagtttt ctcgccgttt 
tcctttgctc agcaggctcc gtctttgctt gtcctttatg catactttat tagcatcaaa 
gatatagact ttttctttcc tttctggctt gaccatgagg ccatgactat tcaaatctta 
caggaagcgt tctttgcagt cataggctct gggacagatg acttgactct gatatatact 
gcaaaaaata ttttcaagtt gttatacaac ttcctaacgt gattatattg tgttttgcag 
GTGTTTATAG CTCAAGAACA GATGTGCACA AAGAGACTGT GGATTTCTAA CTCACCATGG 
ACAGTCTCTT TCAGGTCCGA AAATAAAAGA AATAGGTTCA TTGTGCGCCT CTCGGAGAAT 
GAGAAATCAG AAGACTATAA GAAAAGGGAG AAAGTACTGA CAGTGTACTT CTTGTCGACT 
GAGATTCCAG TCTGGCTCCT GTTCTTTGCG CTGGGTGTTT CGTCAGACAA AGAAGCCATG 
AATCTGATTG CTTTTGATGG TGATGATGCA AGCATTACCA ACAGTCTCAT AGCTTCTATC 
CATGAAGCTG ATGCAGTTTG TGAAGCTTTT CGCTGTGGGA ACAATGCTTT AAGTTATGTT 
GAACAGCAGA TCAAACCTTG GAGgcctgga tgacaggcaa gtatctctga caagcaagta 
tctctgacag gcaaaataga agtgaaagcc ctggtacaga gatacttgcc tgtcatatat 
ctctgtaaga ctaaaaaact aagaagtttc caggcctcca agtacagaga tatatgacag 
gcaagtatct ctgacaggca aaatagaagt gagcatgagt tatatgacag gcaagtatct 
ctgtaagact aaaaaactaa gaagttcaat gttctctggt tgattaatac ttctattgtt 
cctgaaaaac gtctagagaa tacacaaaaa taggctcaaa agcaatgtac cagtatataa 
attagttaga ggattgatgc tgtgagcctt gtgatttatg tctgattcat ttaacctttc 
ttagattatt gttgattctt gagtcctgat tccattacca atggtaaata tttgtggtta 
gGGCCGGCTT TTCAAGATGG GGAAACGAAA GGGTCTACAA TGGTAGATCG GGTGAGATGA 
TGCGTTCTCT GATATTCATG GGCCCAACTT TCTACCAGCG ACTTGTCCAC ATGTCAGAGG 
ACAAAGTCAA GTTCAGGAAC ACCGGACCAG TCCACCCGCT CACACGCCAG CAAGTCGCAG 
ACAGGAAGAG GTTTGGCGGG ATAAGGTTTG GAGAAATGGA GCGAGACTGC CTAATAGCTC 
ACGGTGCATC TGCTAATCTG CACGAGCGTC TCTTCACTCT AAGTGACTCT TCTCAGATGC 
ACATCTGCAG AAAATGTAAG ACCTATGCGA ATGTGATCGA GAGGACTCCA AGCAGTGGAA 
GAAAGATCAG AGGGCCATAT TGTAGAGTCT GCGTATCCTC AGACCATGTG GTTAGAGTCT 
ATGTTCCGTA TGGAGCTAAA CTTCTGTGTC AGGAGCTGTT CAGCATGGGC ATCACTCTCA 
ACTTCGACAC CAAGCTCTGC TGAttacccc tctttattat gtaaaggtct taatgcctta 
agaccatgtt atgtgtagtt tgcttccatc ccggttctgg ttagtagcat tggttttggt 
ttggttgatt cggtaaggtt atccgaaccg aagaaatcgt taaaccaagc cacggaacta 
acccgtaaat gttgcttttg tgagatttga ctctttaacc aactgttaag ctgttttttt 
tttttgtcaa ctggtgtatt attattatta ataaaaccaa agaaactata caatgcttgg 
ttatgagtaa accaagacaa acataggcca ttatttttta ctggacctga aaatgaaaac 
caagcccaac aaagtaaacc cttaagcaaa gataattaaa acgcacgatc acatctctca 
acttggaagc ttgccgcaca gatgcttatt acatcgaact tcatcgcttt gtgaacacgc 
gtccagctcg tcatcatcgt ctagtctacc gccggtttgg aa
Transcript Splice Model

501-905
1719-1946
2641-3023
3482-3983



1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
>NRPDB21203 | Arabidopsis lyrata | transcript sequence | RNA Polymerase IV Small Subunit
ATGGACATTG ATGAGATGGA TATTGAAGAG ATCGAGGCTA CTGCGGAGAT CAATCTATCT 
GAGCTAGGAG AAAGTTTTCT CCAGAGTTTC TGCAAGAAAG CTGCAACTTC CTTCTTTGAT 
AAGTATGGAC TTATAAGTCA TCAGCTCAAT TCCTACAACT TCTTCATTCA ACACGGGCTT 
CAGGATGTGT TTGAATCCTT TGGTGATATG CTTGTGGAAC CGTCGTTTGA TGTGATAAAG 
AAGAAGGATA ACGATTGGAG ATACGCTACG GTGAAATTCG GAAAAGTCAC TGTGGAGAAG 
CCCACTTTCT TTTCCGATGA CAAGGAGCTT GAGTTTCTCC CATGGCATGC CAGGCTTCAG 
AACATGACAT ATTCAGCAAG GATCAAAGTC AATGTCCAAG TTGAGGTGTT CATAAAAACT 
GTTGTTAAAA GCGACAAATT CAAGACGGGA CAAGACGAAT ATGTCGAGAA GAAGATACTT 
GAGGTCAAAA AGCAGGACAT TCTAATTGGT AGCATTCCTG TCATGGTGAA ATCTGTCCTT 
TGCAAAACAA GCGAGAAAGG AAAAGAAAAC TGCAGAAAGG GGGATTGTGC CTTTGATCAG 
GGTGGCTATT TTGTGATAAA GGGGGCTGAG AAGGTGTTTA TAGCTCAAGA ACAGATGTGC 
ACAAAGAGAC TGTGGATTTC TAACTCACCA TGGACAGTCT CTTTCAGGTC CGAAAATAAA 
AGAAATAGGT TCATTGTGCG CCTCTCGGAG AATGAGAAAT CAGAAGACTA TAAGAAAAGG 
GAGAAAGTAC TGACAGTGTA CTTCTTGTCG ACTGAGATTC CAGTCTGGCT CCTGTTCTTT 
GCGCTGGGTG TTTCGTCAGA CAAAGAAGCC ATGAATCTGA TTGCTTTTGA TGGTGATGAT 
GCAAGCATTA CCAACAGTCT CATAGCTTCT ATCCATGAAG CTGATGCAGT TTGTGAAGCT 
TTTCGCTGTG GGAACAATGC TTTAAGTTAT GTTGAACAGC AGATCAAACC TTGGAGGGCC 
GGCTTTTCAA GATGGGGAAA CGAAAGGGTC TACAATGGTA GATCGGGTGA GATGATGCGT 
TCTCTGATAT TCATGGGCCC AACTTTCTAC CAGCGACTTG TCCACATGTC AGAGGACAAA 
GTCAAGTTCA GGAACACCGG ACCAGTCCAC CCGCTCACAC GCCAGCAAGT CGCAGACAGG 
AAGAGGTTTG GCGGGATAAG GTTTGGAGAA ATGGAGCGAG ACTGCCTAAT AGCTCACGGT 
GCATCTGCTA ATCTGCACGA GCGTCTCTTC ACTCTAAGTG ACTCTTCTCA GATGCACATC 
TGCAGAAAAT GTAAGACCTA TGCGAATGTG ATCGAGAGGA CTCCAAGCAG TGGAAGAAAG 
ATCAGAGGGC CATATTGTAG AGTCTGCGTA TCCTCAGACC ATGTGGTTAG AGTCTATGTT 
CCGTATGGAG CTAAACTTCT GTGTCAGGAG CTGTTCAGCA TGGGCATCAC TCTCAACTTC 
GACACCAAGC TCTGCTGA


1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
>NRPDB21203 | Arabidopsis lyrata | orf sequence | RNA Polymerase IV Small Subunit
ATGGACATTG ATGAGATGGA TATTGAAGAG ATCGAGGCTA CTGCGGAGAT CAATCTATCT 
GAGCTAGGAG AAAGTTTTCT CCAGAGTTTC TGCAAGAAAG CTGCAACTTC CTTCTTTGAT 
AAGTATGGAC TTATAAGTCA TCAGCTCAAT TCCTACAACT TCTTCATTCA ACACGGGCTT 
CAGGATGTGT TTGAATCCTT TGGTGATATG CTTGTGGAAC CGTCGTTTGA TGTGATAAAG 
AAGAAGGATA ACGATTGGAG ATACGCTACG GTGAAATTCG GAAAAGTCAC TGTGGAGAAG 
CCCACTTTCT TTTCCGATGA CAAGGAGCTT GAGTTTCTCC CATGGCATGC CAGGCTTCAG 
AACATGACAT ATTCAGCAAG GATCAAAGTC AATGTCCAAG TTGAGGTGTT CATAAAAACT 
GTTGTTAAAA GCGACAAATT CAAGACGGGA CAAGACGAAT ATGTCGAGAA GAAGATACTT 
GAGGTCAAAA AGCAGGACAT TCTAATTGGT AGCATTCCTG TCATGGTGAA ATCTGTCCTT 
TGCAAAACAA GCGAGAAAGG AAAAGAAAAC TGCAGAAAGG GGGATTGTGC CTTTGATCAG 
GGTGGCTATT TTGTGATAAA GGGGGCTGAG AAGGTGTTTA TAGCTCAAGA ACAGATGTGC 
ACAAAGAGAC TGTGGATTTC TAACTCACCA TGGACAGTCT CTTTCAGGTC CGAAAATAAA 
AGAAATAGGT TCATTGTGCG CCTCTCGGAG AATGAGAAAT CAGAAGACTA TAAGAAAAGG 
GAGAAAGTAC TGACAGTGTA CTTCTTGTCG ACTGAGATTC CAGTCTGGCT CCTGTTCTTT 
GCGCTGGGTG TTTCGTCAGA CAAAGAAGCC ATGAATCTGA TTGCTTTTGA TGGTGATGAT 
GCAAGCATTA CCAACAGTCT CATAGCTTCT ATCCATGAAG CTGATGCAGT TTGTGAAGCT 
TTTCGCTGTG GGAACAATGC TTTAAGTTAT GTTGAACAGC AGATCAAACC TTGGAGGGCC 
GGCTTTTCAA GATGGGGAAA CGAAAGGGTC TACAATGGTA GATCGGGTGA GATGATGCGT 
TCTCTGATAT TCATGGGCCC AACTTTCTAC CAGCGACTTG TCCACATGTC AGAGGACAAA 
GTCAAGTTCA GGAACACCGG ACCAGTCCAC CCGCTCACAC GCCAGCAAGT CGCAGACAGG 
AAGAGGTTTG GCGGGATAAG GTTTGGAGAA ATGGAGCGAG ACTGCCTAAT AGCTCACGGT 
GCATCTGCTA ATCTGCACGA GCGTCTCTTC ACTCTAAGTG ACTCTTCTCA GATGCACATC 
TGCAGAAAAT GTAAGACCTA TGCGAATGTG ATCGAGAGGA CTCCAAGCAG TGGAAGAAAG 
ATCAGAGGGC CATATTGTAG AGTCTGCGTA TCCTCAGACC ATGTGGTTAG AGTCTATGTT 
CCGTATGGAG CTAAACTTCT GTGTCAGGAG CTGTTCAGCA TGGGCATCAC TCTCAACTTC 
GACACCAAGC TCTGCTGA


1
61
121
181
241
301
361
421
481
>NRPDB21203 | Arabidopsis lyrata | protein sequence | RNA Polymerase IV Small Subunit
MDIDEMDIEE IEATAEINLS ELGESFLQSF CKKAATSFFD KYGLISHQLN SYNFFIQHGL 
QDVFESFGDM LVEPSFDVIK KKDNDWRYAT VKFGKVTVEK PTFFSDDKEL EFLPWHARLQ 
NMTYSARIKV NVQVEVFIKT VVKSDKFKTG QDEYVEKKIL EVKKQDILIG SIPVMVKSVL 
CKTSEKGKEN CRKGDCAFDQ GGYFVIKGAE KVFIAQEQMC TKRLWISNSP WTVSFRSENK 
RNRFIVRLSE NEKSEDYKKR EKVLTVYFLS TEIPVWLLFF ALGVSSDKEA MNLIAFDGDD 
ASITNSLIAS IHEADAVCEA FRCGNNALSY VEQQIKPWRA GFSRWGNERV YNGRSGEMMR 
SLIFMGPTFY QRLVHMSEDK VKFRNTGPVH PLTRQQVADR KRFGGIRFGE MERDCLIAHG 
ASANLHERLF TLSDSSQMHI CRKCKTYANV IERTPSSGRK IRGPYCRVCV SSDHVVRVYV 
PYGAKLLCQE LFSMGITLNF DTKLC
FASTA view