SDG20130
(Micromonas pusilla NOUM17)

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
4441
4501
4561
4621
4681
4741
4801
4861
4921
4981
5041
5101
5161
5221
5281
5341
5401
5461
5521
5581
5641
>SDG20130 | Micromonas pusilla NOUM17 | genomic sequence | SET domain proteins
gtcgcgcaat atcttctcct cgtcatcccc ggcctcgtcg tcctcgaaca agggattggc 
gtggctgctc acggagctat cggcgtcggc gcccggtccg tcggctgtga tttcaaccgc 
cccggaaccc tcgccgtcgt cgttgaacag ggggttgtcg aagctgccgg tggaatccct 
cagcggcgac ttgttgacgc cgagcggcga cgggctgggg ggtatcgggc tggaggctga 
cggcgtcgga ccgtccgtcg ccaaggcggg agcttggaag ggcaccggcg actggtgcac 
cgagggagga ccggttacac cctcggtcgg gaggtcgaaa agtgtgtcat cactctccga 
atctggagcg cccttgggag gagtgccgaa caggccgctc atggcctcgg cgggcatgac 
ggtgtatccg ttgtgcccgc cgccgccgtt ttggagagcg gactccggcg ggagttcgcc 
tgcgcccatg gtgcggttgg gatatgcccc ggcagtcagc taaaatcgat cttccgagat 
gcgaactccc gctcgccttt ttgaccggag cgagccgact ggcgctcaga ttcaaactcg 
ATGGAGAAGC GCCCCGCCGA GGACCCGCCC GCGGGGGAGG ATCCTCCCAA GGCCCCTCGC 
GTGGGTCCCA CGGCCGACGC GGACACGCAA ACCACGGGAG GGTACCTCCC GCCCGCCCCG 
CCCGCGAAGG TCGACGCCTC CGTCGCGGCG TCGGAGGGTG TCGCCTCGGC TTCGCCCTAC 
GGCCCCACCG CGCGCCTGGT GCCCGAGGCG ATGATCACCG CGGCGGTGGC GGCGATGCGC 
CGGGAGGACT TCCCGCGGAA AACGACGGAA AACGACGAGG ACGAGGCGCC CGCGATGCGC 
ACGGAGACGC CGGACGAGAA GCCCCCTCCT CCCGCCAACG AGGAGACCCC TCCGGAGACG 
GAGCCGACGC CGATGGAGGA TGATGGCGAG GAGGTTGTCA TCGAGGAGCG CGCGTGGACG 
GGAGGAGAGA CGCTTTCGAC GGCGCCCGCG TCCTCGTCGC CTCACGCGCG CCCCGCGCCG 
CCGCACCCCC CGATATCCGC TCCCGCGATC GTGCACGAGC CTCCCGCGCG CGCTCCGCCG 
CCGAAGGTGT TGCTCGAAAG TAGGCGAGGA CCTACAACCC AGGAAGCCCT GGATGCGCTG 
AAAGAAGAGT TGAGAACGCA TCGGGAGCGG GACCCGCACT GGCGCTTCCT GCAAGCTCAC 
CAGTTCAAGA CGGCGTGGGC GGAGGAGTGG CCAGGCGGTT TGTGGATTGC TAACCGTGTC 
GGTGATGGTG AAAGAGCGAG GAGATACCGT GCGTCGATGT CGGAGGCAGA GCTCGAGTTG 
CTCAACGAGC TGATCTCATC CACCGCCGCG GCGCCCCGCA AGCCCCTCTG GACCGTGCCC 
GGCCTGGACC CGGCGTCGCT GTGGAAAGCC ATCCAGCGCG ACCCGGATCC GTTCGCGCGC 
GACATCCTAC GGGACCTCGA ACGCGCCGAA CGCACCGAAC CCTACCTCCA GGTGGGAGAC 
CGATGGCGAA GAGAACCGTG GGAGGTGAAC CCCTACGAGC ACATCGGTCC GGGGTTCGTC 
GTCGCCGACG ACTTGCCGGA ACCCGACGCT TCGGAGGTTT GCGAGAAACT TCACACGACG 
ATGCCGCGCG GGTGGACGGA CTTACCCCCG GAGCCGCCGA GCGCGATGCT GAACGGCGCG 
AAGATGGCCG CCGCTGTGGA CGATGTCAAA TGCCTCGAGT GCGGTCGCGC CGACGGCGAA 
GCTGATTTCG TGCTCTGCGA CGGGTGCCCC GACGACGACG TCAGGGGCGG GCACTGGCGG 
TGCCTCGGCA TGGCGCGCTT ACCCACCGGC GATTGGTTCT GCGATCGATG CGTGACGGAC 
GGCAAGGGGA CGAACGACGA CGCGATGTAC GGCGATGCGG GCGACGACGG GGCCGGTGAC 
TCATCGCCCG GTGACTCACC ACCTGTGCCG TCGATCGCCC CCATCGTCCC TCTGCATCCG 
TACGTCCCGA ACGCGAAGCC GCGGGTCCAC GACCGAGTTC CGCTGCGACT GGGCGTGGAC 
GTGGAGGAAC GGCCGATGTG GGGGTGCGAC TGCTACACGC GCGTCGCCGT CGACGCGGCG 
CTGTCGCGAG CGCCCGGGTA CGCGGGCGAC TGCGTGGACG CTCGACGGAG ACGCGATTTG 
TTTTTTTCCA AGTTGCTCAT GCCCGCGGTG CACACGATGG GGGCGGACGG GTGGGATCTC 
GCGTTGGCGG TGCAGAAGCT CGCGGCGGGG ACGTCGCCGA GGGAACCGTA CGACGACGCG 
GCGGCGCGGA ACGAGGCTGG AGCGGACGGG GGCGGTGACT CACCGGGCGG TGACTCACCG 
GCTCCGTCGG CGTTTGAGAG GGACTTTGCG ACGATCAAGG AGGGCTGCGA CGCCATCCTT 
CGGGCGATTC GCGAGGTTGA CGAGGCTGCG CTGCCCACGG TGCCCCCGCC CAAGCCCGCG 
AAGGGTCAGA AGACCAAGCT ACAGATGGAA GGGACGGGCG TCGCGGGGAT TAAGGGTTCG 
AAGGAGTCGC GAGACGCCGC CGCCAACGCC GCCGGGATCA AGCGACCCAT GACGGCTTTC 
TTCATATTCT CGCAGGAGCA ACGGGCGTTG TTAATCGAGC AACGCCCCGA ACTCCGCACG 
AACATCTCGG CGGTGGGTAA GCTGATGGGC GAGCGGTGGC GGAAGCTGTC CGACGAGGAG 
AAGTTTCCCT ACGCAATCAA AGCCGAGGAG GCGAGACACG AGTACGAAAT CGCCGCGACA 
AAGGCGGAGG AGGAGGCTCA CGCGGCGGCG AAGGCTCGCG AGGAGGCGGA GATGGCCGCG 
CTGGCGCAGG AACGCGCGGA CGCCGAGGCT AAGAAGGCGG AGGCGGCAGC CGCGCTGGAA 
CGCGAGATGG CGGAGGCTGC GGAGAAGGGG ATCATACTGC AGGTGTACGG AGCGGGGCGA 
AAGCCTCGAA AACCGAACCA GTCATCGCTC AAGTCACACA AGCGCCGACA CTTTCGCATG 
CACCCAAAGG GAATCGGGAT CGTGTGCATA CGTCCCGAGG GGTTACCCCC CGGGACGTAC 
ATTCAGGATT ACCTCGGCGA GCTGTACTCG CCGTGGCGGT GGTTCGAGCG ACAGGACGCC 
ATCAAGAAGA GGGAGCCCGA CAAGGAGCTC CCGGATTTTT TCAACATCAC TCTGGAGAGA 
CCCGCGGAGG ACGCCGCCGG TCACGACGTG CTGTTCGTGG AGGCGGCGCA CAGGTGCACG 
TTCGCGTCTC GGCTCTCGCA CTCGTGCGCG CCCAACTGCC AGACGGTGGG CGTCGCGGTG 
GCGGACCAGA CGGACCAAAA GTTGGACCAA AAGTTGGACC AAAATAATTT GGACCAAAAG 
TTGGGCCAAA CCGCCGACCC GCCGCGGACG AAGCTGTCCA TCGCGCAGTA CACGACGAGG 
CACGTGTCGT ACGGAGAGGA ACTCTGCTGG AACTACAGCT GCGTCACCGA GTCCGAGAAG 
GAGTACCGGG CCGCCATATG CCTGTGCTCG TCGACGACGT GCAAAGGCGC CTTTTTGGAC 
TACGCCGGAT CATCCGCGTT CACCGCGGTG ATGAACGTCC GGCACAATTT CCTCGACCGC 
AACGCCCTGT TGATCCGCGC GTGCTCCGAG CCCCTCACCT CAGACGATCG CGCGAGGCTG 
GCGACGGCGG GGATCAAGAG CGCGGCGCTG ACGATGCCGG GGGAGCGGAC GCGAACCGGC 
GAGCGGGTAG AGTGCCCGGA ATGGCTCATC AAGTGGGCGT CGCTCACCTT GGAGTACATC 
GAGATGGAGA AGGAGCTGCT GCCGGCGGCG CTGACGGCCA AACCCATCGA CGGCATCGTG 
TACGACGCCG GGTTCGCCGC GGCGACCGCG GCGGGAGTCG TCGCCACGAG GATATCCAAC 
CTGGTGGTGA CGCTCGACAA GATCAAGTAC GTGATGCGGC AGCCCGGTCA GAACCGCGCG 
CCTTTTCTCC GACACCTGTC CGACAACGAG GTCGTGGACC ACCTGTTGGG GGACATCCTC 
AAGCGAGCGG CGGACACGTT CGCGAAGAAG GTTGGCGTCA AAGCCGGGTT GCCGTTTTTC 
GGAGGAAAAG GCGCGAGAAA CGCGGGCGCC GAGGCTAAGA TGCCAGCCGC GGTTGGACAG 
AGGGAGGGGG ACGTCCTGAG GTTCATCCTC GGCGTGTTGG CCAAGCCCCC GTCCGAGTTT 
ACCCCCCAGG AGGCTTCGCA AACCCTGGAA ACGTGCTCGC GGAAGATTCG CGATCTCGGC 
GCGGTGCACT GCGCGATGGC GGATCTGCTG CTCCTCTACG CGAGGACCGC GCACTGGTGC 
ACTCCCGAGG CGTACGCGGG ATTCCAATCG CCTCCCGTGC GACTGGTGCC GCTGCCCAAG 
GACAAGCTGG GTGGGCGGGA TCGACGGCTT AAGGACGGGA ACGACGCAGG GGACGCAGCA 
GAGGGGACGA CTGTTCAAAT TCCCGAGGGG ACGACTGTTC CAATTCCCGA GGGGACGGTT 
CAAATTCCGC ATCCGCCTGA CGGCGCTTCC GACGCCGCGC CGACGAAATC GACGCTCGCG 
AACGGCCGCA AGCTCCCGGC GGTGTTCAAG GGCAACATCG ACAACGTCAT GAAGAAGAAG 
TACCAGCCGC ACTTCGCGTG GGGCCAGCTC GTGTCGTGGT TCAAGCAGAC CATCTACGAC 
CCCTCGGCGT CTTTATCCGC CGAGCGAAGG GGCGCCATGT CCTTACCCGA CCCCGAGAGC 
GCGTACGGCG ACAAGAACTA CGTCACCGGG GACAGGCGAT CGATGCTGCG ACAGATCGCG 
AGGGATCCGA GCAAGATGTG GCCGACGACG TGGGCGTGGT CGTTTCGTAA CCCCGGCAAG 
GTGTACGGAT CTCCGTTCAT AGACGACGCG ATTAGAGCGG CCAAGGGGGA GGAGCGGACG 
CTGCCGGGGC TGCTGGAGGA GCTGAGGGCG GTTTTAGCGG AGGAAGGGGG GGGAGCGGGG 
GATGGCGCGA CGGGGGCGGC GGCGGCGGGA GGGAAGAAGA GGAAGAAATA Gagcgtcgac 
ggcgcgcgac gtgacgacga gtgacgcttc aactactggt acttttacca tccatcgtat 
tacaagataa agtttagggc ggtcatggta aggaggagga agaagccttg gcgtgccgat 
gcgtggtcgt cgtcgtcttt gatgaggtct ggcggcggag gcggcggtgg cggcgaagag 
acggggggtg gaggcggcga cgggggcatc gtcgctgcag cagcggtcgc ctgtgccttg 
aatgtctcca gtgtgctcga gtccacacct tcgatcgtgc cgagctcgac aatgggatca 
acatcgttca cctcgacgga aatgccttcc gcctttaagg cgtttgttgc ggctgtcagc 
gtcgagtcgt cgacctcggc gtcgctgaaa aacaccgaga catcgtacgt cgtcgccgcg 
agcgcacggc ctcggcgcct ggatgccacc gtcgcgatgc acgcgccgag tgagcttgag 
agtccggcct tcgcatagta gtccgagcaa gcggtgtcct cgttcggggc tgtgagcttg 
gcggacatct ttctcacctt cttcccggtg gtggcggcgt cggcaagtag c
Transcript Splice Model

601-5091



1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
4441
>SDG20130 | Micromonas pusilla NOUM17 | transcript sequence | SET domain proteins
ATGGAGAAGC GCCCCGCCGA GGACCCGCCC GCGGGGGAGG ATCCTCCCAA GGCCCCTCGC 
GTGGGTCCCA CGGCCGACGC GGACACGCAA ACCACGGGAG GGTACCTCCC GCCCGCCCCG 
CCCGCGAAGG TCGACGCCTC CGTCGCGGCG TCGGAGGGTG TCGCCTCGGC TTCGCCCTAC 
GGCCCCACCG CGCGCCTGGT GCCCGAGGCG ATGATCACCG CGGCGGTGGC GGCGATGCGC 
CGGGAGGACT TCCCGCGGAA AACGACGGAA AACGACGAGG ACGAGGCGCC CGCGATGCGC 
ACGGAGACGC CGGACGAGAA GCCCCCTCCT CCCGCCAACG AGGAGACCCC TCCGGAGACG 
GAGCCGACGC CGATGGAGGA TGATGGCGAG GAGGTTGTCA TCGAGGAGCG CGCGTGGACG 
GGAGGAGAGA CGCTTTCGAC GGCGCCCGCG TCCTCGTCGC CTCACGCGCG CCCCGCGCCG 
CCGCACCCCC CGATATCCGC TCCCGCGATC GTGCACGAGC CTCCCGCGCG CGCTCCGCCG 
CCGAAGGTGT TGCTCGAAAG TAGGCGAGGA CCTACAACCC AGGAAGCCCT GGATGCGCTG 
AAAGAAGAGT TGAGAACGCA TCGGGAGCGG GACCCGCACT GGCGCTTCCT GCAAGCTCAC 
CAGTTCAAGA CGGCGTGGGC GGAGGAGTGG CCAGGCGGTT TGTGGATTGC TAACCGTGTC 
GGTGATGGTG AAAGAGCGAG GAGATACCGT GCGTCGATGT CGGAGGCAGA GCTCGAGTTG 
CTCAACGAGC TGATCTCATC CACCGCCGCG GCGCCCCGCA AGCCCCTCTG GACCGTGCCC 
GGCCTGGACC CGGCGTCGCT GTGGAAAGCC ATCCAGCGCG ACCCGGATCC GTTCGCGCGC 
GACATCCTAC GGGACCTCGA ACGCGCCGAA CGCACCGAAC CCTACCTCCA GGTGGGAGAC 
CGATGGCGAA GAGAACCGTG GGAGGTGAAC CCCTACGAGC ACATCGGTCC GGGGTTCGTC 
GTCGCCGACG ACTTGCCGGA ACCCGACGCT TCGGAGGTTT GCGAGAAACT TCACACGACG 
ATGCCGCGCG GGTGGACGGA CTTACCCCCG GAGCCGCCGA GCGCGATGCT GAACGGCGCG 
AAGATGGCCG CCGCTGTGGA CGATGTCAAA TGCCTCGAGT GCGGTCGCGC CGACGGCGAA 
GCTGATTTCG TGCTCTGCGA CGGGTGCCCC GACGACGACG TCAGGGGCGG GCACTGGCGG 
TGCCTCGGCA TGGCGCGCTT ACCCACCGGC GATTGGTTCT GCGATCGATG CGTGACGGAC 
GGCAAGGGGA CGAACGACGA CGCGATGTAC GGCGATGCGG GCGACGACGG GGCCGGTGAC 
TCATCGCCCG GTGACTCACC ACCTGTGCCG TCGATCGCCC CCATCGTCCC TCTGCATCCG 
TACGTCCCGA ACGCGAAGCC GCGGGTCCAC GACCGAGTTC CGCTGCGACT GGGCGTGGAC 
GTGGAGGAAC GGCCGATGTG GGGGTGCGAC TGCTACACGC GCGTCGCCGT CGACGCGGCG 
CTGTCGCGAG CGCCCGGGTA CGCGGGCGAC TGCGTGGACG CTCGACGGAG ACGCGATTTG 
TTTTTTTCCA AGTTGCTCAT GCCCGCGGTG CACACGATGG GGGCGGACGG GTGGGATCTC 
GCGTTGGCGG TGCAGAAGCT CGCGGCGGGG ACGTCGCCGA GGGAACCGTA CGACGACGCG 
GCGGCGCGGA ACGAGGCTGG AGCGGACGGG GGCGGTGACT CACCGGGCGG TGACTCACCG 
GCTCCGTCGG CGTTTGAGAG GGACTTTGCG ACGATCAAGG AGGGCTGCGA CGCCATCCTT 
CGGGCGATTC GCGAGGTTGA CGAGGCTGCG CTGCCCACGG TGCCCCCGCC CAAGCCCGCG 
AAGGGTCAGA AGACCAAGCT ACAGATGGAA GGGACGGGCG TCGCGGGGAT TAAGGGTTCG 
AAGGAGTCGC GAGACGCCGC CGCCAACGCC GCCGGGATCA AGCGACCCAT GACGGCTTTC 
TTCATATTCT CGCAGGAGCA ACGGGCGTTG TTAATCGAGC AACGCCCCGA ACTCCGCACG 
AACATCTCGG CGGTGGGTAA GCTGATGGGC GAGCGGTGGC GGAAGCTGTC CGACGAGGAG 
AAGTTTCCCT ACGCAATCAA AGCCGAGGAG GCGAGACACG AGTACGAAAT CGCCGCGACA 
AAGGCGGAGG AGGAGGCTCA CGCGGCGGCG AAGGCTCGCG AGGAGGCGGA GATGGCCGCG 
CTGGCGCAGG AACGCGCGGA CGCCGAGGCT AAGAAGGCGG AGGCGGCAGC CGCGCTGGAA 
CGCGAGATGG CGGAGGCTGC GGAGAAGGGG ATCATACTGC AGGTGTACGG AGCGGGGCGA 
AAGCCTCGAA AACCGAACCA GTCATCGCTC AAGTCACACA AGCGCCGACA CTTTCGCATG 
CACCCAAAGG GAATCGGGAT CGTGTGCATA CGTCCCGAGG GGTTACCCCC CGGGACGTAC 
ATTCAGGATT ACCTCGGCGA GCTGTACTCG CCGTGGCGGT GGTTCGAGCG ACAGGACGCC 
ATCAAGAAGA GGGAGCCCGA CAAGGAGCTC CCGGATTTTT TCAACATCAC TCTGGAGAGA 
CCCGCGGAGG ACGCCGCCGG TCACGACGTG CTGTTCGTGG AGGCGGCGCA CAGGTGCACG 
TTCGCGTCTC GGCTCTCGCA CTCGTGCGCG CCCAACTGCC AGACGGTGGG CGTCGCGGTG 
GCGGACCAGA CGGACCAAAA GTTGGACCAA AAGTTGGACC AAAATAATTT GGACCAAAAG 
TTGGGCCAAA CCGCCGACCC GCCGCGGACG AAGCTGTCCA TCGCGCAGTA CACGACGAGG 
CACGTGTCGT ACGGAGAGGA ACTCTGCTGG AACTACAGCT GCGTCACCGA GTCCGAGAAG 
GAGTACCGGG CCGCCATATG CCTGTGCTCG TCGACGACGT GCAAAGGCGC CTTTTTGGAC 
TACGCCGGAT CATCCGCGTT CACCGCGGTG ATGAACGTCC GGCACAATTT CCTCGACCGC 
AACGCCCTGT TGATCCGCGC GTGCTCCGAG CCCCTCACCT CAGACGATCG CGCGAGGCTG 
GCGACGGCGG GGATCAAGAG CGCGGCGCTG ACGATGCCGG GGGAGCGGAC GCGAACCGGC 
GAGCGGGTAG AGTGCCCGGA ATGGCTCATC AAGTGGGCGT CGCTCACCTT GGAGTACATC 
GAGATGGAGA AGGAGCTGCT GCCGGCGGCG CTGACGGCCA AACCCATCGA CGGCATCGTG 
TACGACGCCG GGTTCGCCGC GGCGACCGCG GCGGGAGTCG TCGCCACGAG GATATCCAAC 
CTGGTGGTGA CGCTCGACAA GATCAAGTAC GTGATGCGGC AGCCCGGTCA GAACCGCGCG 
CCTTTTCTCC GACACCTGTC CGACAACGAG GTCGTGGACC ACCTGTTGGG GGACATCCTC 
AAGCGAGCGG CGGACACGTT CGCGAAGAAG GTTGGCGTCA AAGCCGGGTT GCCGTTTTTC 
GGAGGAAAAG GCGCGAGAAA CGCGGGCGCC GAGGCTAAGA TGCCAGCCGC GGTTGGACAG 
AGGGAGGGGG ACGTCCTGAG GTTCATCCTC GGCGTGTTGG CCAAGCCCCC GTCCGAGTTT 
ACCCCCCAGG AGGCTTCGCA AACCCTGGAA ACGTGCTCGC GGAAGATTCG CGATCTCGGC 
GCGGTGCACT GCGCGATGGC GGATCTGCTG CTCCTCTACG CGAGGACCGC GCACTGGTGC 
ACTCCCGAGG CGTACGCGGG ATTCCAATCG CCTCCCGTGC GACTGGTGCC GCTGCCCAAG 
GACAAGCTGG GTGGGCGGGA TCGACGGCTT AAGGACGGGA ACGACGCAGG GGACGCAGCA 
GAGGGGACGA CTGTTCAAAT TCCCGAGGGG ACGACTGTTC CAATTCCCGA GGGGACGGTT 
CAAATTCCGC ATCCGCCTGA CGGCGCTTCC GACGCCGCGC CGACGAAATC GACGCTCGCG 
AACGGCCGCA AGCTCCCGGC GGTGTTCAAG GGCAACATCG ACAACGTCAT GAAGAAGAAG 
TACCAGCCGC ACTTCGCGTG GGGCCAGCTC GTGTCGTGGT TCAAGCAGAC CATCTACGAC 
CCCTCGGCGT CTTTATCCGC CGAGCGAAGG GGCGCCATGT CCTTACCCGA CCCCGAGAGC 
GCGTACGGCG ACAAGAACTA CGTCACCGGG GACAGGCGAT CGATGCTGCG ACAGATCGCG 
AGGGATCCGA GCAAGATGTG GCCGACGACG TGGGCGTGGT CGTTTCGTAA CCCCGGCAAG 
GTGTACGGAT CTCCGTTCAT AGACGACGCG ATTAGAGCGG CCAAGGGGGA GGAGCGGACG 
CTGCCGGGGC TGCTGGAGGA GCTGAGGGCG GTTTTAGCGG AGGAAGGGGG GGGAGCGGGG 
GATGGCGCGA CGGGGGCGGC GGCGGCGGGA GGGAAGAAGA GGAAGAAATA G

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
4441
>SDG20130 | Micromonas pusilla NOUM17 | orf sequence | SET domain proteins
ATGGAGAAGC GCCCCGCCGA GGACCCGCCC GCGGGGGAGG ATCCTCCCAA GGCCCCTCGC 
GTGGGTCCCA CGGCCGACGC GGACACGCAA ACCACGGGAG GGTACCTCCC GCCCGCCCCG 
CCCGCGAAGG TCGACGCCTC CGTCGCGGCG TCGGAGGGTG TCGCCTCGGC TTCGCCCTAC 
GGCCCCACCG CGCGCCTGGT GCCCGAGGCG ATGATCACCG CGGCGGTGGC GGCGATGCGC 
CGGGAGGACT TCCCGCGGAA AACGACGGAA AACGACGAGG ACGAGGCGCC CGCGATGCGC 
ACGGAGACGC CGGACGAGAA GCCCCCTCCT CCCGCCAACG AGGAGACCCC TCCGGAGACG 
GAGCCGACGC CGATGGAGGA TGATGGCGAG GAGGTTGTCA TCGAGGAGCG CGCGTGGACG 
GGAGGAGAGA CGCTTTCGAC GGCGCCCGCG TCCTCGTCGC CTCACGCGCG CCCCGCGCCG 
CCGCACCCCC CGATATCCGC TCCCGCGATC GTGCACGAGC CTCCCGCGCG CGCTCCGCCG 
CCGAAGGTGT TGCTCGAAAG TAGGCGAGGA CCTACAACCC AGGAAGCCCT GGATGCGCTG 
AAAGAAGAGT TGAGAACGCA TCGGGAGCGG GACCCGCACT GGCGCTTCCT GCAAGCTCAC 
CAGTTCAAGA CGGCGTGGGC GGAGGAGTGG CCAGGCGGTT TGTGGATTGC TAACCGTGTC 
GGTGATGGTG AAAGAGCGAG GAGATACCGT GCGTCGATGT CGGAGGCAGA GCTCGAGTTG 
CTCAACGAGC TGATCTCATC CACCGCCGCG GCGCCCCGCA AGCCCCTCTG GACCGTGCCC 
GGCCTGGACC CGGCGTCGCT GTGGAAAGCC ATCCAGCGCG ACCCGGATCC GTTCGCGCGC 
GACATCCTAC GGGACCTCGA ACGCGCCGAA CGCACCGAAC CCTACCTCCA GGTGGGAGAC 
CGATGGCGAA GAGAACCGTG GGAGGTGAAC CCCTACGAGC ACATCGGTCC GGGGTTCGTC 
GTCGCCGACG ACTTGCCGGA ACCCGACGCT TCGGAGGTTT GCGAGAAACT TCACACGACG 
ATGCCGCGCG GGTGGACGGA CTTACCCCCG GAGCCGCCGA GCGCGATGCT GAACGGCGCG 
AAGATGGCCG CCGCTGTGGA CGATGTCAAA TGCCTCGAGT GCGGTCGCGC CGACGGCGAA 
GCTGATTTCG TGCTCTGCGA CGGGTGCCCC GACGACGACG TCAGGGGCGG GCACTGGCGG 
TGCCTCGGCA TGGCGCGCTT ACCCACCGGC GATTGGTTCT GCGATCGATG CGTGACGGAC 
GGCAAGGGGA CGAACGACGA CGCGATGTAC GGCGATGCGG GCGACGACGG GGCCGGTGAC 
TCATCGCCCG GTGACTCACC ACCTGTGCCG TCGATCGCCC CCATCGTCCC TCTGCATCCG 
TACGTCCCGA ACGCGAAGCC GCGGGTCCAC GACCGAGTTC CGCTGCGACT GGGCGTGGAC 
GTGGAGGAAC GGCCGATGTG GGGGTGCGAC TGCTACACGC GCGTCGCCGT CGACGCGGCG 
CTGTCGCGAG CGCCCGGGTA CGCGGGCGAC TGCGTGGACG CTCGACGGAG ACGCGATTTG 
TTTTTTTCCA AGTTGCTCAT GCCCGCGGTG CACACGATGG GGGCGGACGG GTGGGATCTC 
GCGTTGGCGG TGCAGAAGCT CGCGGCGGGG ACGTCGCCGA GGGAACCGTA CGACGACGCG 
GCGGCGCGGA ACGAGGCTGG AGCGGACGGG GGCGGTGACT CACCGGGCGG TGACTCACCG 
GCTCCGTCGG CGTTTGAGAG GGACTTTGCG ACGATCAAGG AGGGCTGCGA CGCCATCCTT 
CGGGCGATTC GCGAGGTTGA CGAGGCTGCG CTGCCCACGG TGCCCCCGCC CAAGCCCGCG 
AAGGGTCAGA AGACCAAGCT ACAGATGGAA GGGACGGGCG TCGCGGGGAT TAAGGGTTCG 
AAGGAGTCGC GAGACGCCGC CGCCAACGCC GCCGGGATCA AGCGACCCAT GACGGCTTTC 
TTCATATTCT CGCAGGAGCA ACGGGCGTTG TTAATCGAGC AACGCCCCGA ACTCCGCACG 
AACATCTCGG CGGTGGGTAA GCTGATGGGC GAGCGGTGGC GGAAGCTGTC CGACGAGGAG 
AAGTTTCCCT ACGCAATCAA AGCCGAGGAG GCGAGACACG AGTACGAAAT CGCCGCGACA 
AAGGCGGAGG AGGAGGCTCA CGCGGCGGCG AAGGCTCGCG AGGAGGCGGA GATGGCCGCG 
CTGGCGCAGG AACGCGCGGA CGCCGAGGCT AAGAAGGCGG AGGCGGCAGC CGCGCTGGAA 
CGCGAGATGG CGGAGGCTGC GGAGAAGGGG ATCATACTGC AGGTGTACGG AGCGGGGCGA 
AAGCCTCGAA AACCGAACCA GTCATCGCTC AAGTCACACA AGCGCCGACA CTTTCGCATG 
CACCCAAAGG GAATCGGGAT CGTGTGCATA CGTCCCGAGG GGTTACCCCC CGGGACGTAC 
ATTCAGGATT ACCTCGGCGA GCTGTACTCG CCGTGGCGGT GGTTCGAGCG ACAGGACGCC 
ATCAAGAAGA GGGAGCCCGA CAAGGAGCTC CCGGATTTTT TCAACATCAC TCTGGAGAGA 
CCCGCGGAGG ACGCCGCCGG TCACGACGTG CTGTTCGTGG AGGCGGCGCA CAGGTGCACG 
TTCGCGTCTC GGCTCTCGCA CTCGTGCGCG CCCAACTGCC AGACGGTGGG CGTCGCGGTG 
GCGGACCAGA CGGACCAAAA GTTGGACCAA AAGTTGGACC AAAATAATTT GGACCAAAAG 
TTGGGCCAAA CCGCCGACCC GCCGCGGACG AAGCTGTCCA TCGCGCAGTA CACGACGAGG 
CACGTGTCGT ACGGAGAGGA ACTCTGCTGG AACTACAGCT GCGTCACCGA GTCCGAGAAG 
GAGTACCGGG CCGCCATATG CCTGTGCTCG TCGACGACGT GCAAAGGCGC CTTTTTGGAC 
TACGCCGGAT CATCCGCGTT CACCGCGGTG ATGAACGTCC GGCACAATTT CCTCGACCGC 
AACGCCCTGT TGATCCGCGC GTGCTCCGAG CCCCTCACCT CAGACGATCG CGCGAGGCTG 
GCGACGGCGG GGATCAAGAG CGCGGCGCTG ACGATGCCGG GGGAGCGGAC GCGAACCGGC 
GAGCGGGTAG AGTGCCCGGA ATGGCTCATC AAGTGGGCGT CGCTCACCTT GGAGTACATC 
GAGATGGAGA AGGAGCTGCT GCCGGCGGCG CTGACGGCCA AACCCATCGA CGGCATCGTG 
TACGACGCCG GGTTCGCCGC GGCGACCGCG GCGGGAGTCG TCGCCACGAG GATATCCAAC 
CTGGTGGTGA CGCTCGACAA GATCAAGTAC GTGATGCGGC AGCCCGGTCA GAACCGCGCG 
CCTTTTCTCC GACACCTGTC CGACAACGAG GTCGTGGACC ACCTGTTGGG GGACATCCTC 
AAGCGAGCGG CGGACACGTT CGCGAAGAAG GTTGGCGTCA AAGCCGGGTT GCCGTTTTTC 
GGAGGAAAAG GCGCGAGAAA CGCGGGCGCC GAGGCTAAGA TGCCAGCCGC GGTTGGACAG 
AGGGAGGGGG ACGTCCTGAG GTTCATCCTC GGCGTGTTGG CCAAGCCCCC GTCCGAGTTT 
ACCCCCCAGG AGGCTTCGCA AACCCTGGAA ACGTGCTCGC GGAAGATTCG CGATCTCGGC 
GCGGTGCACT GCGCGATGGC GGATCTGCTG CTCCTCTACG CGAGGACCGC GCACTGGTGC 
ACTCCCGAGG CGTACGCGGG ATTCCAATCG CCTCCCGTGC GACTGGTGCC GCTGCCCAAG 
GACAAGCTGG GTGGGCGGGA TCGACGGCTT AAGGACGGGA ACGACGCAGG GGACGCAGCA 
GAGGGGACGA CTGTTCAAAT TCCCGAGGGG ACGACTGTTC CAATTCCCGA GGGGACGGTT 
CAAATTCCGC ATCCGCCTGA CGGCGCTTCC GACGCCGCGC CGACGAAATC GACGCTCGCG 
AACGGCCGCA AGCTCCCGGC GGTGTTCAAG GGCAACATCG ACAACGTCAT GAAGAAGAAG 
TACCAGCCGC ACTTCGCGTG GGGCCAGCTC GTGTCGTGGT TCAAGCAGAC CATCTACGAC 
CCCTCGGCGT CTTTATCCGC CGAGCGAAGG GGCGCCATGT CCTTACCCGA CCCCGAGAGC 
GCGTACGGCG ACAAGAACTA CGTCACCGGG GACAGGCGAT CGATGCTGCG ACAGATCGCG 
AGGGATCCGA GCAAGATGTG GCCGACGACG TGGGCGTGGT CGTTTCGTAA CCCCGGCAAG 
GTGTACGGAT CTCCGTTCAT AGACGACGCG ATTAGAGCGG CCAAGGGGGA GGAGCGGACG 
CTGCCGGGGC TGCTGGAGGA GCTGAGGGCG GTTTTAGCGG AGGAAGGGGG GGGAGCGGGG 
GATGGCGCGA CGGGGGCGGC GGCGGCGGGA GGGAAGAAGA GGAAGAAATA G

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
>SDG20130 | Micromonas pusilla NOUM17 | protein sequence | SET domain proteins
MEKRPAEDPP AGEDPPKAPR VGPTADADTQ TTGGYLPPAP PAKVDASVAA SEGVASASPY 
GPTARLVPEA MITAAVAAMR REDFPRKTTE NDEDEAPAMR TETPDEKPPP PANEETPPET 
EPTPMEDDGE EVVIEERAWT GGETLSTAPA SSSPHARPAP PHPPISAPAI VHEPPARAPP 
PKVLLESRRG PTTQEALDAL KEELRTHRER DPHWRFLQAH QFKTAWAEEW PGGLWIANRV 
GDGERARRYR ASMSEAELEL LNELISSTAA APRKPLWTVP GLDPASLWKA IQRDPDPFAR 
DILRDLERAE RTEPYLQVGD RWRREPWEVN PYEHIGPGFV VADDLPEPDA SEVCEKLHTT 
MPRGWTDLPP EPPSAMLNGA KMAAAVDDVK CLECGRADGE ADFVLCDGCP DDDVRGGHWR 
CLGMARLPTG DWFCDRCVTD GKGTNDDAMY GDAGDDGAGD SSPGDSPPVP SIAPIVPLHP 
YVPNAKPRVH DRVPLRLGVD VEERPMWGCD CYTRVAVDAA LSRAPGYAGD CVDARRRRDL 
FFSKLLMPAV HTMGADGWDL ALAVQKLAAG TSPREPYDDA AARNEAGADG GGDSPGGDSP 
APSAFERDFA TIKEGCDAIL RAIREVDEAA LPTVPPPKPA KGQKTKLQME GTGVAGIKGS 
KESRDAAANA AGIKRPMTAF FIFSQEQRAL LIEQRPELRT NISAVGKLMG ERWRKLSDEE 
KFPYAIKAEE ARHEYEIAAT KAEEEAHAAA KAREEAEMAA LAQERADAEA KKAEAAAALE 
REMAEAAEKG IILQVYGAGR KPRKPNQSSL KSHKRRHFRM HPKGIGIVCI RPEGLPPGTY 
IQDYLGELYS PWRWFERQDA IKKREPDKEL PDFFNITLER PAEDAAGHDV LFVEAAHRCT 
FASRLSHSCA PNCQTVGVAV ADQTDQKLDQ KLDQNNLDQK LGQTADPPRT KLSIAQYTTR 
HVSYGEELCW NYSCVTESEK EYRAAICLCS STTCKGAFLD YAGSSAFTAV MNVRHNFLDR 
NALLIRACSE PLTSDDRARL ATAGIKSAAL TMPGERTRTG ERVECPEWLI KWASLTLEYI 
EMEKELLPAA LTAKPIDGIV YDAGFAAATA AGVVATRISN LVVTLDKIKY VMRQPGQNRA 
PFLRHLSDNE VVDHLLGDIL KRAADTFAKK VGVKAGLPFF GGKGARNAGA EAKMPAAVGQ 
REGDVLRFIL GVLAKPPSEF TPQEASQTLE TCSRKIRDLG AVHCAMADLL LLYARTAHWC 
TPEAYAGFQS PPVRLVPLPK DKLGGRDRRL KDGNDAGDAA EGTTVQIPEG TTVPIPEGTV 
QIPHPPDGAS DAAPTKSTLA NGRKLPAVFK GNIDNVMKKK YQPHFAWGQL VSWFKQTIYD 
PSASLSAERR GAMSLPDPES AYGDKNYVTG DRRSMLRQIA RDPSKMWPTT WAWSFRNPGK 
VYGSPFIDDA IRAAKGEERT LPGLLEELRA VLAEEGGGAG DGATGAAAAG GKKRKK
FASTA view