ATGCTGACCA GCGACTGTGC AGCCACTCCT CGTTGGTGTG AGCTAGCAGG CACGCTTTGC
CTCAGCGCGT GTCTCTGGGA CGGAGGGGTG GCACGCAACG CTGCGGCCGG GCCTCGTGCC
GTGCATGCAG GGTGCCTGCA GCTCCAGGGG TGGCGAATCA GCGATCGTTC CTCTCCTCAG
GAGAAAGGCA GCGAGGTATG CGCTGGCAGA GGTACTCCTA CCTTGTTAGG AAACCTCTTC
CGAGAGGATT T
GCTGCTGT TAGCAGACAT AGTGCTCTTT
CTAACTGGAA AAGCTGCTTT GTCCTTGAGT TCTGCAAAGG TTGAGAGTCA TCGTCATTTT
AGCCCTGTTA ATAGACCTAT TCATTTTGAC AAAGGCTTAG ATATTTATCA GGCGTCACAC
AAACTGAATC CATACCGAGC AAACA
gtacaa
atctctgtat tatcccattc attttttcac tttcccacag tcattctcat ctttctcact
tacatatcca tacccatcta tctctagcaa ctctctgcat cctgtctcct cacctattct
tcctagacat tctccctata tgtttcccct ggtccttccc tctcttccta tca
ATAGGAATGG AATTACATGG GAATGGTAAT GAATGTGGTC GCAGGCCTTT ATCTGTTGAA
CTTTCCTCTC CTACAAAGCT GTG
TACATGGGGC
AACTTCCACT TCAAGACTTT TGATGGAGAC ATCTTCCGCT ACCCTGGGCT CTGTAATTAT
GTTTTTGCAT CCCATTGCAA CGCTCCCTAT GAAGAGTTCA ACATCCAAAT TAGGCGTACA
CTGGTGGAAA ATATTCTGAC AATCCACCGC ATTGCCATAA AACTCGAAGG CGTAGCTATT
GAGCTAACAA AGGATGTTGT
CCTGATCAAC AGCAACAG
AGTT CAACTGCCAT ACAGTCAAGC TGGAATTACT
ATAGAGAGAA GCAGCATTTA TGTGAAAGTT GTCAGCAAAA CGGATATTGT ATTTATGTGG
AATCAACGTG ACAGTATCTT G
CTAGA
ACTGCATGAG AAATATGCCA ATCAGACTTG TGGTCTTTGT GGGGACTTCA ATGGCATTCC
AGTTTACAAT GAGTTCTATT
CAAACA
AT
ATCAGAATGA CAGCCTTACA GTTTGGGAAT ATGCAGAAAA TGGATGGGCC GACAGAGTAC
TGTGAAGACC CAACTCCTTC
CTCACTGGAT AATTGCACAT ATAACTTT
CAC TGTGAAGATG AAATATTCAC CATCTTGGTA GAACTACGCA AATGTGGCAT
GACGAACACT GAAACGTGCT
TAAAGACAGT GACCCTCAGC ATGAATAAAG GACAAACG
CT CATTGAGGTC AAAACTGATG GTGGTGTGTT TATGAACTCA ATCTACACCC
AGCTGCCGAT CTCAGCAG
C CAACGTCACC
ATCTTTAGAC CTTCATCATT TTTCGTTATC ATGCACACAA ATATTGGCAT CCAGGTTGAA
ATCCAGATCA CACCCATCAT GCAAGTGTTT GTGCGATTGG ACCCTGTGTT CAAAGACCAA
ACCTGTG
GTCTTTGTG GAAATTTCAA TAATGTCCAA
ACTGATGACT TTAAGGCCAT AAGTGGAGTA GTTGAAGGAA CAGCAGCAGC ATTTGCTAAT
ACATGGAAAA CTCAGGCTTC TTGCCCTGAT ATCCAGAGCA GTTTTGAGAA CCCTTGTGCA
CTCAGCATTG ATAATG
AAA AATATGCTAA GCATTGGTGT
GGACTACTGA CTAACAGTGA AGGACCTTTT GCTGATTGTC ATTATCTACT GAACCCCTCT
ACTTACTACA CG
AACTGCATGT TTGACACGTG TAACTGTCAA AACAGTGAGG
GCTGTCTTTG TGCAGCCCTT TCTTCCTATG TGAGAGCTTG TGCTGCAAGA GGAATCCAAA
TAAATGGATG GAGGACTAAC
GTCTGCT
C AAAGTACACC ACCTCATGTC CCAAATCCCT AAGTTACAGC TATACCATCA
ATTCATGTCA ACCCACCTGT CGCTCTCTGA GTGAGCCTGA TGTCACATGC AAAATTAAGT
TTGTCCCAGT TGATGGCTGC ACCTGTGTAA ATGGAACCTA CATGGATGAA AGTGGCAAAT
GCGTGCCTGC TAATGAATGC CCCTGCTACT ACCGAGGATC TCCCATACCA CTTGGAGAAG
TTGTCCATGA AAATGGGCTT
GTATG
CACACAGTG TGTATCTGGC TGTGTGTGCC CATCTGGGCT TGTTTTAGAT
GGGAAGGGTG GTTGTATTCC TGAAGAGGAA TGTCCATGTA TTCATAACGG AGCGATGTAT
CAACCCGGGG AAAAGATCAA
TGCTGACTGT AATACCTG
TGTGTG CAAGGATAGG
ATGTGGGAAT GCACCAAAAA ACAATGTCTG GGTACATGTG CTGTTTATGG AGATGGACAT
TATACAACCT TTGATGACCA AAGGTTCAGC TTCAGTGGAA ACTGTGAATA CACACTAGTC
CAG
AAC CATTGCGGGA AGAGTGACAC AGTGAATGGG
ACCTTCCAGG TTATCACTGA AAATATTCCC TGTGGCAGCA CTGGGACAAC TTGCTCAAAA
TCTATCAAAG TTTTCTTAGA G
CACTATG AACTGGTACT CAGTGAGGAG
CGTGTCAGTG TGATAAAGAG AGGACAAGAT GGTGAAGTAC CATTCACAGT CCGCTATATA
GGTATGTACA CAGTGATTGA CACCACCAGC GGACTGATCC TCATGTGGGA CAAGAAAACC
AGCATCTTCA TCAAGCTGAG CCCTGAATTT
AAG
G GCCAGATCTG TGGCCTCTGT GGAAATTATG
ATGGCAACGG CATCAATGAC TTCACTACAA GGAGTCTGTC TGTAGTGGAA AACGTCCTAG
AGTTTGGAAA CAGCTGGAAA GTATCCTCTA CATGTCCTGA TGCTAGCTTC GTAAAGGACC
CATGCTCTAC CAACCCTTAT CGAAAGTCCT GGTCAGAGAA GCAGTGTAGC ATCATCAACA
GCAATGTCTT TGCTGCCTGT
CACTCACAG
GTTGAACCAG CAAAATATTA TCAAGCATGT GTGACAGATG CTTGTGCCTG
TGACTCCGGA GGAGACTGTG ATTGCTTTTG TACAGCAGTA GCTGCCTATG CTCAAGCATG
TAATGAAGTT GGTGTCTGTA
TAGCCTGGAG AACCCCATCA ATCTGTC
G CTGCTACCCC AACTGCCCAC CAGACAAGCC
TTATTTTCAC GAGGATCAGA TGAAGTGTGT CAGTCTCTGT GACTGTTATG ATAATGAAGA
CGGGAAGATA TATATGCGGG
ATAAATGTCA ATTATG
CGAG TGCACATCAG AAGGTTTACA
GTGCAAGTTT GATGATAAAG
CA TGTAACTGCA TATATGAAAA CAACACCTAT
AAATACAATG AACTCATCTA CACCACTACA GATGGAACAG GATGGTGTTT CAGTGCAACA
TGTGATGTCA ATGGAACAAT AAACAGAAAT ATATTTAAAT GTGAAATCAC AACTACCCCA
TTTACATTTT CTACCAGTCA
ACCTGCAACA G
AAACCA CAACACCAGC ACAAACTGTA TGCGTGAAGT
ATGTATGTGA GTGGTCACAA TGGTATGATG GGACACAGCT TGAAAATGAA AAGGAAACAA
TTCAAAATAT CGCAGGGAAA GGAATAAACA CTTGTAATAA CACGAGCATA ATTGAATGCA
GAGTACATGA ACGCTCAGAT AAGACAATAA ACTATCTAAA CCAAACGATA CAATGCAATA
AAACCAATGA AGTGAAATGC AATAACGAAG ACCAAGATGA CAAAAAATGC TACAATTACG
AAATCAGA
ACCAC CACCACAAGC ACCACGGTAC TCACCAAGAC CACCAGCCCT CCGACAACAG
GCCCAACCAC AACCACAGCT GGAAGTACAA CCAAGTCTGT TCCCGTTTCA ACAGGAGGAA
CGAGCACCAT CACATCCACA ATTTTACAGA GCACGGTACC TACCAGCAGT GCCAGCACCA
CCAAGACCTC ACCCCCTCCA
ACAAAAG
ATTTCCC AAGCGCTGGT AGCCAGGGTG GAGACTTTGA GACCTTTGCA AATATCCGTG
CCGCTGGTTA TGACCTGTGC TCCCAGCCAC AGGACATCAG GTGCCGTGCT GAAAACTACC
CTGACAGGAG CCTGCAGAGT TTGGACCAGA AACTCAAGTG CAGCCTCGAT GTTGGACTTG
AGTGCAGAAA CAAGGATCAG AACAGCAAGT ACCCCATGTG CTTCAACTAC CAGGTCAGTG
TGCTGTGCTG TGACCACCAG CACTGCTTGA CAACATTAAC AACGCCCGGC ACCACGACAA
GCACCACAAC TGGCACCACT ACAGCAACGC CGACAACTGG ACTACCGACT ACAACAATAC
CTACTGTCTC TGCTGGCTCC ACGCTTTCCG AGACCACCAC CACAAGCACC ACGGTAGTCA
CCAAGACCAC CAGCCCTCCG ACAACAGGCC CAACCACAAC CACAGCTGGA AGTACAACCA
AGTCTGTTCC CGTTTCAACA GAAGGAACGA GCACCATCGC ATCCACAATT TTACAGAGCA
CTGTATCTAC CAGCAGTGCC AGTAGCCCCG TGACCAGTCC TGTTACAGAA AGCACAGCCA
CCACTATCCT CGCACCTACA CAGACTACTG CAGCTGCCAG CAGCTCTGCC ACTACCAGGG
TTTCTCCTAT TTTGACAGGA CATTCAACTG CAATTGCAAG TACCCGTACC AACAATGCCA
GCACTACCAA AATATCACCC TCTGTGACAG TATTCACTAT CCCTTCATCT GCCAGCTCAC
TGCCCAGCTC TTCAACTATA CTCACTCCTA CATCAATCTC CACTGTAGTA AAACAGACTA
CAGCACAGTC CCCTACTTCA
AGCCGTTCTG
C
TGGTACTTCA CAGCCAAGCC ACATTACAAG CACTTCTGCA TTCTCCAGTT CCTCCCTCAC
CACACCTTCC ACAACAGTGG CAAGTGGAAC GACTCCATGT TTTTGTCACG TGTCTGGTGG
TTTGTTTTCT CCCG
G CGAAGTTGTA
TACAACAGAA CAGATAAGGC TGGATGTAAT TTTTATGCAC TTTGCAGCAA TGAATGTGAA
ATTGCCCCAT TTCAGGGGCC ATGTCCTTCA ACAACTCCTT TAACTTCGGT GTCTTCAGAG
ACTACAACAC AAGCCTCCTC ATCCACAGAA CATATATCTT CTTCTATTTC ATCAACCACA
GCAACATCTC TCACAACTTT ACTAGAAAGA AACTGTACTG ATGTCAATCC TCCACGAAAG
CCT GGAGAGAAAT GGACAAGTCC GTGCCAGGAG
TGTGTCTGTG ATCCACTCAC TGGTACAGCG CAATGTCGGC CACTCCTATG CCAAACACCT
CGGGAAATAG TTTGTGACCC AGGATTTATT GCCTTGCCTG TAGTACCAGT AAAAGACCTG
TGCTGTCCTG AACTTGAATG CC
GGGAA AGTTCCCACA
GCTGTGTGAT TAATGGTAAA ACATATATG
AATGGA ACATCTGTAA TTGATTCATG
TAGAAATTGT ACCTGCAGTT CTGAGAAAGA TCCATTGACT AAGAAAAACA TTGTGAAGTG
TGAAGCACTT AAATGTGAAT
CATCTTGCCC ACTG
gg ttccgaatac aagattgagg atggagagtg
ctgtgggagg tgtattgaag tagcatgcag aattcagcta agtaacaaca ctgtttatgt
gctcgaa
GTTG GTAAGACTCT GCCAATTGAT CTCTGCAGCC AGTACAAGTG
TGAAATAATT GAAGATCAAT ATGTTGCAGT TCAAAGTAAA AAAGTATGTC CCGACTACGG
TTCACTGGAG TGCGATCCT
GATGAAGCAG
AGACTACTCC TGATGGCTGC
TGCAAGATAT GTAAAC
CTAC TAACTGCAAA GCTTACAGCA CAAAAACTAT AATTCAACAC
GAAGACTGTG AATCTTCTGA AGCTGTTGAG TTGACATATT GTGAAGGGAC TTGTCCTGGC
TCTTCCAT
GTATTCTCT
TAAAGCAAAC CAGATGCAAC ATGACTGTAC CTGCTGCCAA GAGCTCAGCA GCCACAGAAG
AGATGTGACC CTGAACTGCC AAAATGGAAC AAGCATAAAT TATAGCTATG TCTATGTGGA
TCAATGCCAG TGTATAAATG CATGCAACCC ACAAACCCTT GAAGCAGATG ATTACCAAGC
AGAAATCAGC CAAAGATGGA CA
Note: The red part (3rd exon
other than 1st nucleotide in it) might be different. This exon is just a hypothetical one to prevent frameshift which both follows gt-ag
rule and keeps same amino acid with GENSCAN00000071260 (ensemble chicken
version 31.5k). The first red
nucleotide T might be C but cannot be A or G for Ser can only be coded with AGT
or AGC (if the first nucleotide of the triplet has to be A).