>HOMSA|gi|5729790|ref|NP_006556.1| transcriptional repressor CTCF isoform 1 [Homo sapiens] -----------------MEGDAVEAIVEESETFIKGKERKTYQRRRE------------- ---------------GGQEEDACHLPQNQTDGGEVV------------------------ -QDVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVAP---EAE- -----------------AAVDDTQIITLQVVN--------------------------ME EQ----PINIGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKEGLAESEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELP----------------------------PQEDPSWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCAGPD------------------GVEGENGGE--------T----KKSKRGRKRKMRS KK-EDSSDS-ENA---E-------PDLDD--N------EDEEE--PAV-----EIEPEPE P--QPV-TPAPPPAKKRRGRPPGR-TNQPK------------QNQPTAIIQVEDQNTGAI ENIIVEVKKEPD---------------AEPAEGEEEEAQPAAT----------------- ---------DAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >PELSI|ENSPSIP00000003796 pep:novel scaffold:PelSin_1.0:JH205822.1:16124:37080:-1 gene:ENSPSIG00000003593 transcript:ENSPSIT00000003816 gene_biotype:protein_coding transcript_biotype:protein_coding -----------MEQGDKMEGEAIEAIGEESETFIKGKERKTYQRRRE------------- ---------------GGQEEDACHMPPNQADGAEVV------------------------ -QEVSGGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCSGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRS KK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--EPV-APTPPPAKKRRGRPPGK-ANQPK------------QPQPAAIIQVEDQNTGAI ENIIVEVKKEPD---------------AETV-GEEEEAQPAAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >ORNAN|ENSOANP00000007536 pep:known supercontig:OANA5:Contig28397:1521:4779:1 gene:ENSOANG00000004762 transcript:ENSOANT00000007538 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -----------------------------------------------------QVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCTGPD------------------GVEGENGGE--------T----KKGKRGRKRKMRS KK-EDSSDS--------------------------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >gi|171474905|gb|ACB47393.1| CCCTC-binding factor [Pogona vitticeps] -----------------MEGEVVEAVGEESETFIKGKERKTYQRRRE------------- ---------------GGQEEDACPMPPNQADGSEVV------------------------ -QDVNAGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVQQ---EAE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVSV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQS ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PTFVPAAFVCSKCGKTFTRRNTMARHA DNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRS KK-EDSSDSEENA---E-------PELDN--N------EEEEE--TAI-----EIEAEPE V--EPV-APVPPPAKKRRGRPPGK-SNQPK------------QTQPTTIIQVEDQNTDAI ENIIVEVKKEPE---------------AETV-GATAGTQPAAA----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|327281289|ref|XP_003225381.1| PREDICTED: transcriptional repressor CTCF-like [Anolis carolinensis] -----------------MEGEVVEAIGEESETFIKGKERKTYQRRRE------------- ---------------GGQEEDVCSMPPNQADGTEVV------------------------ -QDVNTGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVQQ---EAE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------DL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQA ELQ----------------------------PQEDPGWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRS KK-ENSSDSEENA---E-------PELYDIEE------EDEEE--TAV-----EIEAEPE IEAEPV-APPPPPAKKRRGRPPGK-ANQPK------------QPQPTAIIQVEDESTGTI ENIIVEVKKEPE---------------AETV-GVAAGAQPEAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >MONDO|ENSMODP00000007129 pep:novel chromosome:BROADO5:1:685282646:685300408:1 gene:ENSMODG00000005757 transcript:ENSMODT00000007273 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE------------- ---------------GGQDEDACHISQTQADGSEVV------------------------ -QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVAKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRS KK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--QPV-TPAPPPAKKRRGRPPGK-SSQPK------------QTQPTAIIQVEDQNTGAI ENIIVEVKKEPD---------------AETVEGEEEEPQSAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >SARHA|ENSSHAP00000004296 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004340 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE------------- ---------------GGQDEDACHISQTQADGSEVV------------------------ -QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRS KK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAI ENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >MACEU|ENSMEUP00000005228 pep:novel genescaffold:Meug_1.0:GeneScaffold_6565:184:14145:1 gene:ENSMEUG00000005728 transcript:ENSMEUT00000005742 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAVMEESETFIKGKERKTYQRRRE------------- ---------------GGQDEDACHISQTQADGSEVV------------------------ -QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---EAD- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXX-------XXXXXXX-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFV-AAFVCSKCGKTFTRRNTMARHA DNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRS KK-EESSDS-ENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QAQPTAIIQVEDQNTGAI ENIIVEVKKEPD---------------AEAAEGEEEEPQSAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|396094|emb|CAA80319.1| CTCF protein [Gallus gallus] -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE------------- ---------------GGQEDEACHIAPNQADGGEVV------------------------ -QDVNSGVQ-MVMMEHLDP-TLLQMKTEVME-----------------GAVPQ---ETE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRS KK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--SAE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEI ENIIVEVKKEPD---------------AETVE-EEEEAQPAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|326927215|ref|XP_003209788.1| PREDICTED: transcriptional repressor CTCF-like [Meleagris gallopavo] -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE------------- ---------------GGQEDEACHIAPNQADGGEVV------------------------ -QDVNSGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---ETE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRS KK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--EPE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEI ENIIVEVKKEPD---------------AETVE-EEEEAQPAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >MELGA|ENSMGAP00000001773 pep:novel chromosome:UMD2:13:1721989:1735383:-1 gene:ENSMGAG00000002200 transcript:ENSMGAT00000002439 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE------------- ---------------GGQEDEACHIAPNQADGGEVV------------------------ -QDVNSGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---ETE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKKGLG --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHSATVGEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRS KK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--EPE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEI ENIIVEVKKEPD---------------AETVE-EEEEAQPAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >SARHA|ENSSHAP00000004295 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004339 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE------------- ---------------GGQDEDACHISQTQADGSEVV------------------------ -QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQGERNGFCIRI------------TVDTIKN FRVSVSSGAFDRQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHA DNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRS KK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPE V--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAI ENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|224063911|ref|XP_002196108.1| PREDICTED: CCCTC-binding factor (zinc finger protein) [Taeniopygia guttata] -----------------MEGEAVDAIVEESETFIKGKERKTYQRRRE------------- ---------------GGQEDDACHIPPNQADGSEVV------------------------ -QDVSSGVQ-MVMMDQLDP-TLLQMKTEVME-----------------GAVSQ---ETE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL- QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQG ELQ----------------------------PQEDPNWQKDP-------DYQP---PAK- KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACR------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------PVSWENA---E-------PNLDD--N------EDEEE--TAV-----EIEAEPE V--EQE-APAPPPSKKRRGRPPGKAAAQPK------------QSQPAAIIQVEDQNTGEI ENIIVEVKKEPD---------------AETAE-EEEEAQPAVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|34785484|gb|AAH57697.1| Ctcf protein [Xenopus laevis] -----------------MEGEMAEDIVEDSETFMKRKETKTYQRRRE------------- ---------------GGVDEENCVIVQSQTDICEVP------------------------ -HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GMVSQ---EGD- -----------------PTVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL- HAA-------FENDVSKEVLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQA ELQ----------------------------PQEEPGWQKDP-------DYVP---PMK- KSKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYAL IQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHA DSCTGPD------------------GTDGENGEE--------GEVIHKKGKRGRKRKMRS KK-EGSTDSEDNA---E-------PELDD--DDEDEDDDEEEE--TPV-----EIEADPE P-EEPV-SPIPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVEDHNTRAI ENIIVQVKKESD---------------LEAEVVVEAPVLTPAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|11878220|gb|AAG40852.1|AF305695_1 transcriptional repressor [Xenopus laevis] -----------------MEGEMAEDIVEDSETFMKRKETKTYQRRRE------------- ---------------GGVDEENCVIVQSQTDICEVP------------------------ -HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GMVSQ---EGD- -----------------PTVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL- HAA-------FENDVSKEVLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQA ELQ----------------------------PQEEPGWQKDP-------DYVP---PMK- KSKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYAL IQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHA DSCTGPD------------------GTDGENGEE--------GEVIHKKGKRGRKRKMRS KK-EGSTDSEDNA---E-------PELDD--DDEDEDDDEEEE--TPV-----EIEADPE P-EEPV-SPIPPPAKKRRGRPPGK-ANQAR--------------QNAAVIQVEDHNTRAI ENIIVQVKKESD---------------LEAEVVVEAPVLTPAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >XENTR|ENSXETP00000062683 pep:known scaffold:JGI_4.2:GL172782.1:1468935:1488905:1 gene:ENSXETG00000015615 transcript:ENSXETT00000060905 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MESEMAEAVVEDSETFMKRKETKTYQRRRE------------- ---------------GGVDEDNCVIVQSQTDISEVP------------------------ -HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GVVSQ---EGD- -----------------PTVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL- HAA-------FENEVSKEGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQA ELQ----------------------------QQEEPGWQKDP-------DYVP---PIK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYAL IQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHA DNCTGPD------------------GTDGENGGE--------SEVVHKKGKRGRKRKMRS KK-EGSSDSDHNN---E-------PFFQD--N------AEPEQ--TPV-----EIEADPE P-EEPL-TPLPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVDDHSNRAI ENIIVQVKKESD---------------LEAEGGVEAAVPTPAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|170284950|gb|AAI61099.1| ctcf protein [Xenopus (Silurana) tropicalis] -----------------MESEMAEAVVEDSETFMKRKETKTYQRRRE------------- ---------------GGVDEDNCVIVQSQTDISEVP------------------------ -HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GVVSQ---EGD- -----------------PTVDDTQIITLQVVN--------------------------ME EQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL- HAA-------FENEVSKEGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQA ELQ----------------------------QQEEPGWQKDP-------DYVP---PIK- KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYAL IQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKP YACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHA DNCTGPD------------------GTDGENGGE--------SEVVHKKGKRGRKRKMRS KK-EGSSDSEDNA---E-------PELED--DD-DEDEDDEDE--TPV-----EIEADPE P-EEPL-TPLPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVDDHSNRAI ENIIVQVKKESD---------------LEAEGGVEAAVPTPAV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >LATCH|ENSLACP00000011174 pep:novel scaffold:LatCha1:JH126699.1:452823:489575:1 gene:ENSLACG00000009833 transcript:ENSLACT00000011258 gene_biotype:protein_coding transcript_biotype:protein_coding MQIYCTFLFLRERGREKMENEPSEVILEENETFSKGKERKTYQRRRE------------- ---------------GGQEEDNGAVIQNHPDGIEVVQDLQNQPDGTEEAQDLQNQSDGIE AQDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------AGVNQ---EGE- -----------------ATVDDTQIITLQVVN--------------------------ME EQ----PISLGELQLVQVPVPVSV---PVTATTVG----------------------QL- QGT-------YENDVSKEGLQ-GEPVICHTLPLPE------GFQVVKVGANGEVETLEQE ELQPP-------------------------PPQEDPNWAKDP-------EFQP---PAK- K-KKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKP YACSHCEKTF-RQKQLLDMHFKRYHD-------PTFVPATFVCTKCGKTFTRRNTMARHA ENCTGPD------------------SVEGENGGE--------P----KKSKRGRKKKMRS KR-DDSSGSDENA---E-------PELDD--I------DEEEE--EAVVINDEEMEGGPE A------LPAPPPAKKKRGRPPGK-SNQAK------------STQTAAIIQVEDQNAGTI ENIIVEVKKEPD---------------TEEEEGEVEQAQPVVV----------------- ---------EAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >ORENI|ENSONIP00000007230 pep:novel scaffold:Orenil1.0:GL831150.1:2536970:2540553:1 gene:ENSONIG00000005736 transcript:ENSONIT00000007235 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ --------------------------EGGEALTQ-------------------------- -GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV- -----------------SGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL- QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQ----------------------------PQDDPEWTKDP-------DYQPIT-AVR- KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHA DNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQS RR-DDDDDDTVNI---E-------GELDE--------AEEEEDMLTEI-----EVEQAPS V--VPIPAPVEPPVKRKRGRPPKSKPDSK-------------RIIAAAIIRVEDETTGEV DDIIV--KKEVG------------ADQDDD---GNEAAQEVVV----------------- ---------APPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|348509710|ref|XP_003442390.1| PREDICTED: transcriptional repressor CTCF-like [Oreochromis niloticus] -----------------MISCQGNCLNFVSQAICLQGKKARKRAPTLTEADFIGFLKQPR ICWPIAMEAEVVSMESAQATDGKVLPEGGEALTQ-------------------------- -GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV- -----------------SGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL- QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAAHEELQGTRVEEEEEDEEPAEVETSVPPQDDPEWTKDP-------DYQPIT-AVR- KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHA DNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQS RR-DDDDDDT------E-------GELDE--------AEEEEDMLTEI-----EVEQAPS V--VPIPAPVEPPVKRKRGRPPKSKPDSK----------------PAAIIRVEDETTGEV DDIIV--KKEVG------------ADQDDD---GNEAAQEVVV----------------- ---------APPNGDLTPE------------MILSMMDR--------------------- ------------------------- >ORENI|ENSONIP00000007229 pep:novel scaffold:Orenil1.0:GL831150.1:2536889:2541050:1 gene:ENSONIG00000005736 transcript:ENSONIT00000007234 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------MEAEVVSMESAQATDGKVLPEGGEALTQ-------------------------- -GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV- -----------------SGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL- QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAAHEELQGTRVEEEEEDEEPAEVETSVPPQDDPEWTKDP-------DYQPIT-AVR- KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHA DNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQS RR-DDDDDDT------E-------GELDE--------AEEEEDMLTEI-----EVEQAPS V--VPIPAPVEPPVKRKRGRPPKSKPD------------------TAAIIRVEDETTGEV DDIIV--KKEVG------------ADQDDD---GNEAAQEVVVGEGKSTIQMEELSQGEG VAQAGQLSEAPPNGDLTPE------------MILSMMDR--------------------- ------------------------- >TAKRU|ENSTRUP00000045998 pep:novel scaffold:FUGU4:scaffold_14:1533474:1536803:-1 gene:ENSTRUG00000017943 transcript:ENSTRUT00000046152 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ --EVTGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL- QAT-------YVDTSTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAA----------------------------EDPDWSKDP-------DYQPIT-TVR- KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCSKCSKTFTRRNTMLRHA ENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQS RK-DDDDDDDDDT---E-------PDQED--------MDDEDEMLSEI-----EVEQAPP V--VPIPAPVEPPVKRKRGRPPKNKPAGEF------------QDQPAAIIRVEDEVTGEV DDIIV--KKEVG------------ADQDDQEICNEEAVEQVVV----------------- ---------APPNGDLTPE------------MILSMMDR--------------------- ------------------------- >TAKRU|ENSTRUP00000045999 pep:novel scaffold:FUGU4:scaffold_14:1533474:1536800:-1 gene:ENSTRUG00000017943 transcript:ENSTRUT00000046153 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ---VTGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL- QAT-------YVDTSTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAAHEELQGTREEEEEEEEEPA--------EEDPDWSKDP-------DYQPIT-TVR- KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCSKCSKTFTRRNTMLRHA ENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQS RK-DDDDDDDDDTVS-E-------PDQED--------MDDEDEMLSEI-----EVEQAPP V--VPIPAPVEPPVKRKRGRPPKNKPAVAKIVFFGG------FFPAAAIIRVEDEVTGEV DDIIV--KKEVG------------ADQDDQEICNEEAVEQVM----------EELAQEEA AAQEVPLSEAPPNGDLTPE------------MILSMMDR--------------------- ------------------------- >TETNI|ENSTNIP00000012180 pep:novel chromosome:TETRAODON8:5:5277710:5281213:1 gene:ENSTNIG00000009314 transcript:ENSTNIT00000012371 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ --EVAGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL- QAT-------YVDASTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAAHEEL-----------------------QEDPDWSKDP-------DYQPIT-TVR- KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCSKTFTRRNTMLRHA ENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQS RK-DDDDDDTGS----E-------PEPEE--------MDEEDEMLSEI-----EVEQAPP V--VPIPAPVEPPVKRKRGRPPKNKPA------------------TAAIIRVEDEATGEV DDIIV--KKEVG------------ADQDDQEICSEEAVERCSC----------------- -------LRRRRNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|47230373|emb|CAF99566.1| unnamed protein product [Tetraodon nigroviridis] ------------------------------------------------------------ -------MEDDVVSMETTQADGKVLPEGVDSLIQGS--------------------AIAQ QAEVAGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL- QAT-------YVDASTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQE ELQAAHEELQGTREEEEEEEEEAADVEPVVSQQEDPDWSKDP-------DYQPIT-TVR- KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCSKTFTRRNTMLRHA ENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQS RK-DDDDDDT------E-------PEPEE--------MDEEDEMLSEI-----EVEQAPP V--VPIPAPVEPPVKRKRGRPPKNKPA------------------TAAIIRVEDEATGEV DDIIV--KKEVG------------ADQDDQEICSEEAVEQVVVGGGKSTIQMEELAQEEV AGQEVQLSEAPPKRRPNPRDDPQHDGPVMDASVKSVNTRLGFKKNLFLFFFFFFFFFSFS FGFRLTNACILKRNKHHCCNLRFHF >GASAC|ENSGACP00000020939 pep:novel group:BROADS1:groupII:11613372:11616719:-1 gene:ENSGACG00000015865 transcript:ENSGACT00000020979 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ -GEVVDDMG-MVVMDALDP-TLLQMKTEVLE-----------------GGGT----VTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-TGAALGLGQLQLVQ------V---PVTRATVE----------------------GL- QAT-------YVDASTAN--KDADPVICHTLPLPE------GFQVVKVGANGEVETVEQ- ------------------------EVEATVPLEEDPEWSKDP-------DYQPIS-SLRN KGKKGKKSRLR----Y-GEG---------------------------NRDMDVSVYDFEE EQQ-------------------------EGMLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYCSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFLEKGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCEQ-------CDYCCRQ-ERHMVMHKR------------THTGEKP FACSQCDKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCNKTFTRRNTMLRHT ENCSGEI------------------E-E-ENGTP--------AP---KKARRGRKRKMQT RR-DDDDTGSNAK---E-------DELDE--------VEEEEE-LSEL-----EVEQDPP V--VPIPAPVEPPVKRKRGRPPKNKPNIPKSDLK--------LLTAAAIIRVEDEVTGEV DDIIV--KKEVG------------VDRDDQEEATDGAVEE-----------------EAV AAPEV--SEAPPNGDLTPE------------MILSMMDR--------------------- ------------------------- >GASAC|ENSGACP00000020941 pep:novel group:BROADS1:groupII:11613372:11616695:-1 gene:ENSGACG00000015865 transcript:ENSGACT00000020981 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ----------MVVMDALDP-TLLQMKTEVLE-----------------GGGT----VTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-TGAALGLGQLQLVQ------V---PVTRATVE----------------------GL- QAT-------YVDASTAN--KDADPVICHTLPLPE------GFQVVKVGANGEVETVEQE EMEADHDELLEVR----------AEVEATVPLEEDPEWSKDP-------DYQPIS-SLRN KGKKGKKSRLR----Y-GEG---------------------------NRDMDVSVYDFEE EQQ-------------------------EGMLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYCSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFLEKGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCEQ-------CDYCCRQ-ERHMVMHKR------------THTGEKP FACSQCDKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCNKTFTRRNTMLRHT ENCSGEI------------------E-E-ENGTP--------AP---KKARRGRKRKMQT RR-DDDDT--------E-------DELDE--------VEEEEE-LSEL-----EVEQDPP V--VPIPAPVEPPVKRKRGRPPKNKPNIPKSDLK--------LLTAAAIIRVEDEVTGEV DDIIV--KKEVG------------VDRDDQEEATDGAVEEVVVGEGKSTIQLEELPQEAV AAPEV--SEAPPNGDLTPE------------MILSMMDR--------------------- ------------------------- >ORYLA|ENSORLP00000011017 pep:known chromosome:MEDAKA1:3:20410268:20415910:-1 gene:ENSORLG00000008771 transcript:ENSORLT00000011018 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------METGQATAL---------------------------------- ------------------ASDGKVLSEGGEALIQTG------------------------ QGDEAGTME-MMVMDALDP-ALLQMKTEVLE-----------------GGGT----VTV- -----------------TGGDEGQIITLQVVN--------------------------ME EQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL- QAT-------YVEASAAN--KDA--VICHTLPLPE------GFQVVKVGANGEVETVEQD ELQAAQEDLQGQEGEEVEEDEEAAEIVTSV-PQDDPEWTKDP-------DYQPIT-AVR- KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKYHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKP FACEQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCTKCSKTFTRRNTMLRHA EGCTGEA------------------SGD-ENGTP--------TP---KKGRRGRKRKMQA RE-KKPDKVDSDT---E-------GELDE--------IEEEDDLLTEI-----EVEQAAP V--IPIPAPIEPPVKRKRGRPPKNKPEVCPCFSGIS------NLSVAAIIHVEDEVQ-EV EELV---KKEVG---------------AEQVNCTDETTEQVITGGGKPGAQSEELSQADA AAQEVQLSAAPSNGDLTPE------------MILSMMDR--------------------- ------------------------- >ORENI|ENSONIP00000005164 pep:novel scaffold:Orenil1.0:GL831206.1:2751803:2762795:1 gene:ENSONIG00000004097 transcript:ENSONIT00000005168 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MDGRPTDGV---GVVDVPTKEFPSIQAVHSQDAMVADLLQQAA EAG-------------GHGEGMAAATQSQQQLME------------------------GV GVEGGTGVE-MMVMDSLDP-TLLQMKTEVIDAAVGGSSAAVGVVGGVPGSAHQ---ATV- -----------------TTVDQTQIITLQVVN--------------------------ME EQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQ QGN-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVVKVGANGEVETVEQE EEG-----AETQPDEEEDEEEEPVQ-----PPNDDPNWAKDP-------DYQPPSGVVK- KIKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKP YACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPATFVCPKCNKTFTRRNTMARHA ENCSGEV------------------E-DAENGAT--------IP---KKGRRGRKRKMRS RRDDDDDSDEDHA---EQDDDEEEGEGEE--ESSLLQEEEDP---ESM-----ELDQAPA A--IPVPAPDEPPVKRKRGRPPKNAPK-PPTPSKSVRVATKTTASAAAIIQVEDESTGAV ENIIV--KKEEGDASAATPLDQGVALTVEGVGLD-EGVETVEL----------------P VNEET--AAASANGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|348523553|ref|XP_003449288.1| PREDICTED: transcriptional repressor CTCF-like [Oreochromis niloticus] -----------------MDGRPTDGV---GVVDVPTKEFPSIQAVHSQDAMVADLLQQAA EAGGVVEGQAGVVVEQGHGEGMAAATQSQQQLME------------------------GV GVEGGTGVE-MMVMDSLDP-TLLQMKTEVIDAAVGGSSAAVGVVGGVPGSAHQ---ATV- -----------------TTVDQTQIITLQVVN--------------------------ME EQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQ QGN-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVVKVGANGEVETVEQE EEG-----AETQPDEEED-EEEPVQ-----PPNDDPNWAKDP-------DYQPPSGVVK- KIKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKP YACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPATFVCPKCNKTFTRRNTMARHA ENCSGEV------------------E-DAENGAT--------IP---KKGRRGRKRKMRS RRDDDDDSDEDHA---EQDDDEEEGEGEE--ESSLLQEEEDP---ESM-----ELDQAPA A--IPVPAPDEPPVKRKRGRPPKNAPK-PPTPSKSVRVATKTTASAAAIIQVEDESTGAV ENIIV--KKEEGDASAATPLDQGVALTVEGVGLD-EGVETVEL----------------P VNEET--AAASANGDLTPE------------MILSMMDR--------------------- ------------------------- >ORYLA|ENSORLP00000022986 pep:novel ultracontig:MEDAKA1:ultracontig72:341588:352749:1 gene:ENSORLG00000018357 transcript:ENSORLT00000022987 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ----------------------MVAGQTQQQLMDAG---------------------VGV AVDGGAGVD-MMVMDSLDP-TLLQMKTEVMDAAVGASSPSAAVVGGVAGAAHQ---ATV- -----------------TTVDQTQIITLQVVN--------------------------ME EQ---AALGIGELQLVQ------V---PVSATTVE----------------------ALQ QGT-------FVDASSIP--KDGDPVICHTLPLPE------GFQVLEDQKG--------- --------FSRSVRAHQHSAGAPMQ-----PPNNDPSWAKDP-------DYQPPSGVVK- KVKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKP FGCSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCPKCSKTFTRRNTMARHA ENCSGEV------------------D-DAENGAP--------TP---KKGRRGRKKKMRS RR-DEDDSDEDQL---E-PDDEDEAEEEE--EASLLLEEDEP---ESL-----ELDQAPA A--VPVPAPEEPPVKRKRGRPPKNAPK-VPAPSKPVRTPSK-TSSAAAVIQVEDESTGAV -DIIV--KKEEADGAAEAPLQGGVALAVEDAAMDAEGAETVEL----------------A DGEET---VAAANGDLTPE------------MILSMMDR--------------------- ------------------------- >GADMO|ENSGMOP00000006796 pep:novel genescaffold:gadMor1:GeneScaffold_4125:40745:45236:1 gene:ENSGMOG00000006386 transcript:ENSGMOT00000006994 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ----------------------MEAVHNQQQLLE-------------------------- --EGGGGVE-MMVMESLDP-ALLQMKTE----------------GGVAGGAHQ---ATV- -----------------TTVDQTQIITLQVVN--------------------------ME EQ---AALGLGELQLVQ------V---PVSASTVE----------------------ALQ QGT-------FVDATAMP--KDGDPVICHTLPLPE------GFQVGKQ------------ ------------------------------VQNDDSAWSKDP-------DYQPPSAALK- KSKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNIKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKP YACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTSFVCPKCSKTFTRRNTMARHA ENCNGEI------------------D-DAENGTP--------TP---KRGRRGRKRKMRS RR-DEEEDSEDHA---D--------------PDLLLQEEEEQ---DAM-----ELDQAPA T--VPVPAPEEPPVKRKRGRPPKNAPKPAPTPTKSPRVAAKAAATAAAIIQVEDESTGAV ENIIV--KKED-------PRAPGAGLAVEAVGLEAEEVEAVEV----------------Q GTEDEAGAAAAANGDLTPE------------MILSMMDR--------------------- ------------------------- >GASAC|ENSGACP00000003270 pep:novel group:BROADS1:groupXIX:1433175:1439803:1 gene:ENSGACG00000002504 transcript:ENSGACT00000003281 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ -----------------------MVAAAQHQLMEAA---------------------VGV GVDGGASVE-MMVMDSLDPIPLLQMKTEVIDSAVGGSSATLGVVGGVAGAAHQ---ATV- -----------------TTVDQTQIITLQVVN--------------------------ME EQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQ QGT-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVNKQTHS--------- --------QSTFGEERKEPPAALLH-----PQNDDASWAKDP-------DYQPPHGAFK- KPKKGKKSRLR----Y-GEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKP YACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTAFVCPKCSKTFTRRNTMVRHS ENCNGEV------------------E-DAENGAP--------AP---KKGRRGRKRKMRS RR-DEEDSEDDNA---E--------FGEE--ETSLLQEEEEEEEPESM-----ELDQAPA A--IPVPAPDEPPVKRKRGRPPKNAPKPPPTPSRSARVAAKAAASAAALVQLEDESTGAV ENITV--KKEDSQAPEATPAEQGAAPA-------AEGAETVEL----------------P VNEDTASAAAAANGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|126632718|emb|CAM56716.1| CCCTC-binding factor (zinc finger protein) [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGRKCRYCDAVFHERYAL IQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKP YACSQCEKTF-RQKQLLDMHFRRYHD-------PNFVPTSFVCTKCGKTFTRRNTMARHA ENCTGMD------------------SADGENGTP--------P----KRGRGGRKRKMRS RK-DDDDDDDSDEHGEP--------------DLDDIDEEDEDDLLDEDQMG--LLDQAPP S--VPIPAPAEPPIKRKRGRPPKNAPKVSPTKSITK------TTTAAAIIQVEDESTGAI ENIIV--KKEPE--------------------------GTDAVVAAQPIIEEVEAVEADV ETVQLTVPEAAPNGDLTPE------------MILSMMDR--------------------- ------------------------- >gi|111306380|gb|AAI21783.1| Ctcf protein [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKKKKK--- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >gi|120538162|gb|AAI29308.1| Ctcf protein [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYNFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKKKKK--- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >gi|66911889|gb|AAH97009.1| Ctcf protein, partial [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDRPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKKKKK--- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >gi|53734069|gb|AAH83236.1| Ctcf protein, partial [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVRWCRE CIAVQ------------------------------------------------------- --------LIKKYCPK-------------------------------------------- ------------------------------------------------------------ -----------KKKK--------------------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >DANRE|ENSDARP00000059912 pep:known chromosome:Zv9:18:22066704:22074092:1 gene:ENSDARG00000056621 transcript:ENSDART00000059913 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAV IEQ--------------AQAEVEPVVEAQQQLVESV------------------------ -VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV- -----------------TTVDDTQIITLQVVN--------------------------ME EQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL- QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQD ELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK- KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEE EQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G --------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLR NHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVRWCRE CIAVQ------------------------------------------------------- --------LIK-YCP--------------------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >SARHA|ENSSHAP00000015681 pep:novel scaffold:DEVIL7.0:GL834666.1:1222728:1242523:1 gene:ENSSHAG00000013360 transcript:ENSSHAT00000015809 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ---------------------------RMGTEASASPEQFTKIKGTDVIQEKA-KENDVD KVSKLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELA-PS- -----------------NEENEKHILTLQTVH--------------------------FA TD-ETDHQEMSQLTVQP---AEGM---HVMVQQGE----------------------SGL QSL-------LVLQQDIN-----VQAELNEIPHQN------LHQCVAISIQEEVFSLHEM EVMEINVVEESVEVSSEEDKLTVNS-----PLDENTELIK----------------LCEE REFTDQKEEIF----T-FEKLREGEK---------------------EEIILLPANSEIE EHE-------------------------DVHSS--EQDIDEVSGTAK----NQAKSK--G --------M-KRTFHCEICIFTSSRISSFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLR NHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKR HIRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIH ILQKHSENVPKHQCPH--CSTVIARKSDL-RVHLRNLHSYKATEMKCRYCPAVFHERYAL IQHQKTHRNEKRFKCDD-------CNYACKQ-ERHMTVHKR------------THTGEKP FTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHS EHCEAVK------------------GKSI-------------PS---AKGRKNKKKKQKD PK-QDAKEEGRQT---------------RNFRSDKVVEQMPIEDTSIVNIEHHPNEIVPV VYGMAA-DV-EE------------------------------------------------ ------------------------------------------------------------ -----------PKTEVTCE------------MILNMMDK--------------------- ------------------------- >SARHA|ENSSHAP00000015680 pep:novel scaffold:DEVIL7.0:GL834666.1:1220619:1244690:1 gene:ENSSHAG00000013360 transcript:ENSSHAT00000015808 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ --------------MEILK-SY-------------------------------------- ---------------------RRQI----------------------------------- RD-ETDHQEMSQLTVQP---AEGM---HVMVQQGE----------------------SGL QSL-------LVLQQDIN-----VQAELNEIPHQN------LHQCVAISIQEEVFSLHEM EVMEINVVEESVEVSSEEDKLTVNS-----PLDENTELIKDKDKGIYGAEEGEVQQLCEE REFTDQKEEIF----T-FEKLREGEK---------------------EEIILLPANSEIE EHE-------------------------DVHSS--EQDIDEVSGTAK----NQAKSK--G --------M-KRTFHCEICIFTSSRISSFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLR NHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKR HIRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIH ILQKHSENVPKHQCPH--CSTVIARKSDL-RVHLRNLHSYKATEMKCRYCPAVFHERYAL IQHQKTHRNEKRFKCDD-------CNYACKQ-ERHMTVHKR------------THTGEKP FTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHS EHCEAVK------------------GKSI-------------PS---AKGRKNKKKKQKD PK-QDAKEE--------------------------VVEQMPIEDTSIVNIEHHPNEIVPV VYGMAA-DV-EE------------------------------------------------ ------------------------------------------------------------ -----------PKTEVTCE------------MILNMMDK--------------------- ------------------------- >MONDO|ENSMODP00000020611 pep:novel chromosome:BROADO5:1:486248077:486266900:1 gene:ENSMODG00000016490 transcript:ENSMODT00000020976 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ---------------------------RMGTEASASPEHFTKIKGTDLIQEKA-KESDVD KVSRLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELT-PQ- -----------------TQENEKHILTLQTVH--------------------------FA TD-EMDHQEM---TVEP---AEGM---HVMVQQGE----------------------SGL QSL-------L----------------LNEIPHQN------LHHCVAISIQEEVFSLHEL EVMEINVVEESVEISSEEDKLTVNP-----PLDENTESVKVE-------KNYEVPQLCEE REITDQKEGLF----T-FDKLREGEK---------------------EEIILLPANSEIE EHE-------------------------DIPSS--EQDTDEVSGTAK----NQAKTK--D --------V-KQTFHCEICIFTSSKISCFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLR NHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCTMCKYASVEASKLKR HIRSHTGERPFHCCLCNYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIH ILQKHSENVPKHQCPH--CATVIARKSDL-RVHLRNLHSYKAAEMKCRYCTDVFHERYAL IQHQKTHRNEKRFKCDD-------CSYACKQ-ERHMRVHKR------------THTGEKP FTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHS ELCEVIR------------------GKAV-------------QS---AKGRKTKKKKQKG PK-QDVKEEGK-------------------------FEQMPIEDISNVNIERHTNEIVPV GYGIAT-DVAEE------------------------------------------------ ------------------------------------------------------------ -----------QKTEVTCE------------MILNMMDK--------------------- ------------------------- >ORNAN|ENSOANP00000025255 pep:known ultracontig:OANA5:Ultra516:6791142:6804936:-1 gene:ENSOANG00000009260 transcript:ENSOANT00000029059 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -------------------------------------SCQDWYKSHL----KAYFIS--G --------V-NKTFHCDICKFTTSRQSSLNRHLKIHSDVKP--HVCHLCLKAFRSATLLR NHVNTHTGTKPYKCG--DCTMAFVTSGELVRHRRYKHTHEKPFQCTICKYASVEASKLKR HIRSHTGERPFRCRLCSYASRDTYKLKRHMRTHS---GEKPYECSVCQTKFTQRGTMKIH MLQKHTENAPKHQCPH--CGTMIARKSDL-RVHLKNLHSYKTTEIKCHYCSAAFHERYLL LQHQKTHRDEKRFKCGD-------CDYACKQ-ERHMIVHKR------------THTGEKP FSCLHCNKRF-RQKRLLSVHFRKYHD-------ENFTPIVYECPKCGKAFSRLWILYERS -HFRYVE------------------WNSIFLCCRCARLFQCPSV---CNLRRTKRS---- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >ORNAN|ENSOANP00000014738 pep:known ultracontig:OANA5:Ultra516:6791279:6804666:-1 gene:ENSOANG00000009260 transcript:ENSOANT00000014741 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ---------------------------------KIHSDVKP--HVCHLCLKAFRSATLLR NHVNTHTGTKPYKCG--DCTMAFVTSGELVRHRRYKHTHEKPFQCTICKYASVEASKLKR HIRSHTGERPFRCRLCSYASRDTYKLKRHMRTHS---GEKPYECSVCQTKFTQRGTMKIH MLQKHTENAPKHQCPH--CGTMIARKSDL-H----------------------------- -EHQKTHRDEKRFKCGD-------CDYACKQ-ERHMIVHKR------------THTGEKP FSCLHCNKRF-RQKRLLSVHFRKYHD-------ENFTPIVYECPKCGKAFSR-------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >HOMSA|gi|29570785|ref|NP_542185.2| transcriptional repressor CTCFL [Homo sapiens] ------------------------------------------------------------ ---------------------------MAATEISVLSEQFTKIKELELMPEKGLKEEEKD GVCREKDHR-SPSELEAER-TSGAFQDSVLE-------------------EEVELVLAP- -----------------SEESEKYILTLQTVH--------------------------FT SE-AVELQDMSLLSIQQ---QEGV---QVVVQQPG----------------------PGL -------------------------LWLEEGPRQS------LQQCVAISIQQELYSPQEM EVLQFHALEENVMVASEDSKLAVS-------LAETTGLIKLE-------EEQEKNQLLAE R----TKEQLF----F-VETMSGDERS--------------------DEIVLTVSNSNVE EQE-------------------------DQPTA--GQADAEKAKSTK----NQRKTK--G --------A-KGTFHCDVCMFTSSRMSSFNRHMKTHTSEKP--HLCHLCLKTFRTVTLLR NHVNTHTGTRPYKCN--DCNMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKR HVRSHTGERPFQCCQCSYASRDTYKLKRHMRTHS---GEKPYECHICHTRFTQSGTMKIH ILQKHGENVPKYQCPH--CATIIARKSDL-RVHMRNLHAYSAAELKCRYCSAVFHERYAL IQHQKTHKNEKRFKCKH-------CSYACKQ-ERHMTAHIR------------THTGEKP FTCLSCNKCF-RQKQLLNAHFRKYHD-------ANFIPTVYKCSKCGKGFSRWINLHRHS EKCGSGE------------------AKSA-------------AS---GKGRRTRKRKQTI LK-EATKGQKEAA---------------KGWKEAANGDEAAAEEASTTKGEQFPGEMFPV ACRETTARVKEE------------------------------------------------ ------------------------------------------------------------ -----------VDEGVTCE------------MLLNTMDK--------------------- ------------------------- >PELSI|ENSPSIP00000012493 pep:novel scaffold:PelSin_1.0:JH209331.1:1517019:1535688:1 gene:ENSPSIG00000011195 transcript:ENSPSIT00000012554 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ---------------------------MMAAQDSHLPEPFTKIKGAERIWDRAREDDGGD RLPWVKERN-SICDPDVEV-LNGAPPAKALE-----------------GGRNLELS-PS- -----------------LIQSEKHLIMLQTVR--------------------------LK EG-EEDLQAVSQLNIQQ---QSGL---HMVVQRGA----------------------SVL QPL-------VVVQQGV-------------GAQQN------IPTGVAISLQDGVYTFHDM EVMQINVLQEKVQAKDEENKS----------MDKSPGMLLIK---------KLVPKNLKN SVKIDRTKDLH----A-VEEILSCTA---------------------KDDISVSLNEPKE QGE---------------------------QSV--VKKTDTLEAHTN----TQHRKK--G --------E-KVTIHCDLCAFTSLRMSSLNRHMKTHSDEKP--HLCHLCLKAFRTVTLLR NHVNTHTGTRPYKCS--DCEMAFVTSGELARHRRYKHTLEKPFKCSVCKYSSVEASKLKR HIRSHTGERPYNCCLCSYASKDTYKLKRHMVTHS---GEKPYECYVCQARFTQSGTMKIH ILQKHSENVPRYQCPH--CNAFIARKSDL-GVHLRNLHSYLAVAMKCSYCEAVFHERYAL IQHKKTHRNEKRFKCDR-------CSYACKQ-ERHLIVHKR------------THTGEKP FTCVSCSKCF-RQKQLLTVHFRKHHD-------SNFKPTVYECPKCGKGYSRWNNMHKHA ENCGLAR------------------AKVV-------------TR---HKGSKGKKKRWNS LK-QDVKQEGCSE-----------VSCGGLWESAGTVDLGSFQDVSVVNTECCASEIVPV EYGIET-STPRE------------------------------------------------ ------------------------------------------------------------ -----------QKTEMTCE------------MILNMMDK--------------------- ------------------------- >TAEGU|ENSTGUP00000008541 pep:novel chromosome:taeGut3.2.4:20:12851742:12856852:1 gene:ENSTGUG00000008275 transcript:ENSTGUT00000008632 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ --------------------FTSAKLSSRRCHLRSHSDEKR--HTCHLCPKAFHTAALLH NHVNAHTGNKPHKCS--ECDAAFVTRGELSRHRRYKHTLEKPFKCTICEYSSVEASKMRR HVRSHTGERPYPCHLCSYASKDAYQLKRHMLTHT---GEKRYECYICQARFTQSGTMKIH VLQKHGENVPKHQCPH--CSTFLSRKRDL-GVHLRNLHSYMEEAVKCRDCGAAFHERYAF LQHRRTHRSEERFRCAQ-------CSCTCNQ----------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >gi|171474913|gb|ACB47397.1| brother of regulator of imprinted sites [Pogona vitticeps] ------------------------------------------------------------ ------------------------------------------------------------ ----------------------------------------------------------A- -----------------LGEGEKHLVLLKTVH--------------------------LK IE-ENDAQGPSVAN-QH---DGVL---HAVMQRER----------------------CIL EPL-------EVMTQSI-------------GIRNN------LEEVVAVGLPEGIYTVQEM EVMHINCLKE-MQAFNEDEKS----------TRKTLDALRIE-----------RDRSIDV ATDGDKGQPLL----V-AEEDRISPF-------------------------------CLA EAA-------------------------KNLSS--RSKEDPVSFHNS----EKKDQS--N --------EGDVPQHCPFCTFTCFSIAGLRRHMKKHSEERP--HMCHLCLKAFRTVSLLR NHVNTHTGTKPHKCG--ECDMAFVTSGELSRHRRYKHTLEKPFKCTFCSYCSVEASKLKR HIRSHTGERPYHCTLCSYASRDTYKLKRHMVTHS---GEKPFECLICKARFTQAGTLKFH ILHKHETNVPKHQCPH--CQTSVARKGDL-SIHLRNLHSYIEVPLRCNYCDAAFHERYAF RQHKKTHRNEKRFKCDQ-------CNYACKQ-ERHMVIHKW------------THTGEKP FVCVACSKCF-RQKQLLRVHFKKHHD-------SSFKPKVYECSKCSKEYSRWSNMHKHA EKCEDRR------------------AI---------------QP---SKGSKGKKKADKR SS-HNRQREGSN------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >ANOCA|ENSACAP00000016003 pep:novel scaffold:AnoCar2.0:GL343217.1:1034665:1047855:1 gene:ENSACAG00000016243 transcript:ENSACAT00000016323 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ --------------HCRFCTYTSSSVTGLNRHMKRHSDKNP--HMCHLCLKVCRTVALLR NHMNTHTGTKPYKCS--ECDMAFVTGGELSRHRRYKHTHEKPFKCTFCNYSSVEASKLKR HIRSHTGERPYNCTLCSYASRDTYKLKRHMLIHS---GEKPFECLICKARFTQAGTLKFH KLHKHGTNVPKYQCPH--CNTAVARKGDL-RIHLQNLHSYIKVPLKCNFCEDAFHERHAF KQHKKTHINEKKFKCDQ-------CNYACKQ-GRHMVMHKR------------THTGEKP FVCISCSKCF-RQKQLLTIHAKKYHD-------SSFQPKVFECPQCGKEYSRWNNMRKHA ANCKGKS------------------VV---------------QP---SKGSKKRKKEEKR S----------------------------------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >MELGA|ENSMGAP00000009338 pep:novel chromosome:UMD2:22:12488611:12494479:1 gene:ENSMGAG00000009067 transcript:ENSMGAT00000010157 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -------------FSCELCTYTSLKISSLNRHRKIHSKEKC--HVCHICLKAFRTAALLQ NHLNVHTGTRPYKCS--DCDMAFVTSGELARHRRYKHTFEKPFKCSVCKYSSVEASKLKR HVRSHTGERPYACYLCTYAGKDAYKLKRHMATHS---GEKPHECYICHTKFAQSG----- -------------------------------VHLQNVHSHMAVAGQCSHCEAAFHNHYAL TQHKKIHRDGRSFECDQ-------CSSACKQ-ECHLIVHKR------------THSGEKS FTCSCCSKTF-QQKQLFTVHSKKHCD-------SSV------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >SARHA|ENSSHAP00000001830 pep:novel scaffold:DEVIL7.0:GL867782.1:14413:31463:1 gene:ENSSHAG00000001630 transcript:ENSSHAT00000001851 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------EEGELDWH-PS- -----------------NEENERHLLALQMLP--------------------------FT TD-ETG-QEMSQLAVQP---AEGM---HIMEPQGE----------------------SGL QSL-------LGWQQYIN-----VQPELNEMPHQD------LSRCVELNIPEEIFSLEEM EMIENYIRDESADFFNED-KWIFDF-----PFDE--ELLKDQDKGIYG-AEWEAQELCEE REYTDPKEEIV----N-FETPRDGEK---------------------EEIILLLANSEIE EHE-------------------------DIHSP--PQDLDEVSQTAA----NQAETK--G --------T-KWTFHCEICKFTSSKMSSLTRHMKTHTAEKP--HMCHLCPKAFRTGTLLR NHLNTHTGTRPYKCS--DCEMAFVTSGELGRHRRYKHTHEKRFKCSMCNYASVEASKLKR HIRSHTGERPFPCSFCNYASKDIFKLKRHMTSHS---GEKPYECSFCSARFSQSGTLKIH VLQKHSGNAPKHQCPH--CATLITRKSDL-RVHLRNLHSYSAAEMKCRYCTAAFHERYAL IQHQKTHRDEKRFKCDV-------CSYACKQ-AQHMTIHKR------------IHTGEKP FTCLSCNKSF-RQKQLLKVHFKKYHD-------ETSVPPVHECPKCGKGFSRLNNMRKHS EHCEVVR------------------GKAV-------------PS---A-----KEKDQKG PE-QGAREEVLIG-----------FQTAQGLRNYKVIEEMPIEDISIVNIENATVEMVPV VYGTTS--DVQE------------------------------------------------ ------------------------------------------------------------ -----------PQTEITFE------------MILNMIEK--------------------- ------------------------- >PETMA|ENSPMAP00000004689 pep:putative scaffold:Pmarinus_7.0:GL483954:4616:13063:1 gene:ENSPMAG00000004257 transcript:ENSPMAT00000004708 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ --------------------EDTHIITLHPVS--------------------------LE ETGEGGGTSIGEITLVQVQADLTV---IIHNVKL-----------------------QLN SVL-------KIDVRRTL--LQGVP-ITHTLPLPE------GVQVVKVGPNGELE-VERA PMG----------------------------SQDERSKEKDP-------DYQL---PVKK PVRKGRKNKLR----Y-KQE---------------------------AADADISVYDFEE QEE-------------------------GRLVSSQDVGVEKAIAP-KPPKPTRIKKK--G --------A-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HCCHLCDRAFRTVTLLR NHVNTHTGTKPHKCM--ECDMAFVTSGELVRHRRYRHTHEKPFKCSMCDYASVEVSKLKR HIRSHTGERPFQCGLCSYASRDTYKLKRHMRTHS---GEKPYECHVCHARFTQSGTMKMH VLQKHTDNVPKYHCPH--CDAVIARKSDL-GVHLRKQHAVLERELRCRYCRAIFHERYAL MQHQRTHRNEKRFKCDQ-------CEYACKQ-ERHMIMHKR------------VHTGEKP FECTLCDKTF-RQKQLLDFHFKRYHD-------PSFVPTTYECSKCHRNFTRRSTMMKHF DMCDGEL------------------ESGEQNGK---------AR---RGRRRGRKRKMQS RK-HGSSSESDEMPTDE---------------------DEEEEELNEE-----SVEVADE E----VEEPEPPPMKRRRGRPPKAKPGRPA------KKVAGSDSVACGIIEIIPVTVGGP DGPDDDEEEE--------------EEEEEAAEGEEGAVETAADEG--------------- -----------PKNDITPE------------MILSMMDQ--------------------- ------------------------- >BRAFL|gi|260819198|ref|XP_002604924.1| hypothetical protein BRAFLDRAFT_217118 [Branchiostoma floridae] ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ---------MR------------------------------------------------- ------------------------------------------------------------ --------VGGKTYQCYKCDYTCQRMAFLERHMKVHTDERP--FKCGTCEREFRTMQSLQ NHINSHNGVKPHKCD--QCPMSFVTSGELMRHRRYKHTHEKPHKCTMCDYASVEISKLKR HMRSHTGERPFQCGMCSYASPDSYKLKRHMRTHT---GEKPYECSVCLATFTQSGSLKMH -MQRHLGTAPSYVCDI--CGTALTRKSDL-KSHVRKLHTGDKL-LTCKYCDSAFPDKYNL TKHLKTHQGEKRFRCED-------CNYCCTQ-ERHLINHKR------------CHTGEKP FVCVQCDHTF-RQEQLLKQHIKVHHT-------PGYTPPRYACTNCDKSFTRKGNLR--- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >MACEU|ENSMEUP00000007622 pep:novel genescaffold:Meug_1.0:GeneScaffold_3588:549:7519:-1 gene:ENSMEUG00000008337 transcript:ENSMEUT00000008359 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ------------------------------TEASALAEQFTKRKGPDLIQEKDAESDGGL SSRRGSGRG-LEVGC-----PYGVLQAKLVE---------------------GELQLVP- -----------------SQEGEKHVLTLQTVH--------------------------LA PE------ERGRVAVPP---AEGM---HVVVQPAE----------------------GGF PAV-------LVLQQDLS-----------VLARPS------VRRCVAISLQEELFSLHEM ELVEIDVVEDSTDVSCEDRKLE---------------------------KQGMNQPLSEE RELPDQREHLF----T-FEALGEGEE---------------------EEIILLPASSEME ERE-------------------------DVPAS--EQDLDEGSETAE----TQAEST--X --------X-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--XXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXX--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXASKLKR HMRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMH ILQKHSENVPKHQCPH--CATVIARKSDL-RVHLRNLHSYKATEMKCRYC-AVFHERYAL IQHQKTHRNEKRFKCDD-------CSYACKQ----------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >OIKDI|GSOIDT00001753001 ------------------------------------------------------------ ------------------------------------------------------------ ---MSQQYEEEPIMEE-------------------------------------------- --------------------EDGNTTTVQFIA--------------------------VA PEEHERMVREGELPETIQGADVQM---TIGGTESRPVRLIAVDSNGIPVTDPTILQAAAE QAG-------IMFVSKDENDHENQISIDEAMQLRN---------------EENQQVPGYQ DQ----------------------------PPADPYDYQAEP-----------------E YHDSEKGVPLR----NYTSQDVVYEEAPENG----------------GIKQQENIYDYQD VPIQN-----------------------EPKIDNHYKKSQSNTQRTKYPGTIQVGAN--G --------EKRKVYQCSECAFYSHRHSNLIRHMKIHTDERP--YKCHLCARAFRTNTLLR NHINTHLGVKPYKCPEANCEMAFVTSGELTRHRRYKHTHEKPFKCTLCEYASVEISKLRR HFRSHTGERPFSCDICGKAFADSFHLKRHKFSHT---GEKPYECPHCKARFTQHGSLKMH VMQQHTKTAPKFECEREICHTMLGRKSDL-NVHLRKQHSYQEIPMQCRYCEEVFHDRWSL MQHQKTHRS-GRYRIDEDGNQIFDPDYDSEM-EDMEGGHGSGNLVYTEDGHVIGQDGSPN VIVQRVQHIDGQQGVPIEDHQQMQHDEHNMHAVEHQMDHSQEAPPIHQGHHRQAHAEAKS ERFEQEQQHDPFAFDDDQNMGNGEYMHAPQMGQTMAQPVE-------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >CIOIN|ENSCINP00000013892 pep:known scaffold:KH:HT000097.1:44901:51453:1 gene:ENSCING00000006765 transcript:ENSCINT00000013892 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ----------------------------MADDGKDSVEVTSQIESVNGTVAPPNDSIEPE ENEVSEKNE-EKIIEEVSNEAPAEAPV--------------------------------- --------------------EDGTTTTVQFIA--------------------------VS ADEHARMVTAGELPESVHGADVHM---QIPGAEAQQVRLIAVDSNGVPVTESAILQAAAE QAG-------IVFITKDDNGHDSQITIDQAMQLSM------RENPPMITATTAAVVLEQK EK-----------------EVHQDT-----DGNDTEDESETK-----------------K PSVRKPRLKLR----VYSQKDGGVDEVATSFLDDGSLALTDEVETLEEKPNDESVYEFQ- ----------------------------DPP----TGGDTEQPSDVTMLDPTMFKKN--S KRATDKSRDRRKIYQCRECSFYSHRHSNLVRHMKIHTDERP--YKCHLCERSFRTNTLLR NHINTHTGVKPYKCTVDGCVMAFVTSGELTRHTRYIHTHEKPFRCTLCDYASVEISKLRR HFRSHTGERPYSCEECGKAFADSFHLKRHRMSHT---GEKPYECPECNQRFTQRGSVKMH IMQQHTKTAPKFKCEI--CRTLLGRKSDL-NVHMRKQHAFQ------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >CIOIN|ENSCINP00000031196 pep:known scaffold:KH:HT000097.1:44901:51453:1 gene:ENSCING00000006765 transcript:ENSCINT00000036493 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------ ----------------------------MADDGKDSVEVTSQIESVNGTVAPPNDSIEPE ENEVSEKNE-EKIIEEVSNEAPAEAPV--------------------------------- --------------------EDGTTTTVQFIA--------------------------VS ADEHARMVTAGELPESVHGADVHM---QIPGAEAQQVRLIAVDSNGVPVTESAILQAAAE QAG-------IVFITKDDNGHDSQITIDQAMQLSM------RENPPMITATTAAVVLEQK EK-----------------EVHQDT-----DGNDTEDESETK-----------------K PSVRKPRLKLR----VYSQKDGGVDEVATSFLDDGSLALTDEVETLEEKPNDESVYEFQQ RQRGTMEEWRDRECEIILKDILQTLKDEDPP----TGGDTEQPSDVTMLDPTMFKKN--S KRATDKSRDRRKIYQCRECSFYSHRHSNLVRHMKIHTDERP--YKCHLCERSFRTNTLLR NHINTHTGVKPYKCTVDGCVMAFVTSGELTRHTRYIHTHEKPFRCTLCDYASVEISKLRR HFRSHTGERPYSCEECGKAFADSFHLKRHRMSHT---GEKPYECPECNQRFTQRGSVKMH IMQQHTKTAPKFKCEI--CRTLLGRKSDL-NVHMRKQHAFQ------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------- >STRPU|gi|72028083|ref|XP_797592.1| PREDICTED: uncharacterized protein LOC593001 [Strongylocentrotus purpuratus] ---------------MDENTDQPGSQTEEPVAPADQGEAEGTDAASMNVLETYLQNFNEE LSSGPATAAGAVQQAAASTVAATEEDDTESPLAVHVEEVADAEEDVQLEEGAEGDNVAVV KQEIGEEEEMEEEVGEASQPAPDQATIDITHTLQLLANASANISNPNALQENQEGEMGTE NTFMQQLDTSNLVDENGAKVDPSRIAGIQTVNGEQVVMVHNMEDGTSGIGQQQVLMVAMQ DNGQLSSMDQGVAQIAFSGAPVNMVGDNIVYQTTQ-------NTQYVPVSHNGTTQLAMT QSGGEPGTEVYTILQTVDGAETTTITTPTTMVVSSGGVHHLGDEQTHVIATTSGDHPSYA ELQ---------------------------PVSEDAAQQEEGALQMAAAEHEEVALIVQK PVKRKRGRPRKDEAAKMQTQIVIVREVVEG-----------------EDGQDPSVYDFYA GED-------------------------DTAPV--AGGDEK----------SGIEVG--G -------AKKKTRYVVPKFDDGRLLDQVLNRAKKGGPGRRPKVHECHLCGRIFRTSTLLR NHENTHSGTKPYKCE--LCPKAFGTSGELGRHMKYMHTHEKPHKCPLCDYLSVEASKIKR HMRSHTGEKPYKCTLCEYASTDNYKLKRHMRVHT---GERPFNCSQCDQSFSQKSSLKEH -EWKHVGNRPSHKCDH--CDTTFGRYADM-KTHVRKMHTAGE-PMICKICENAFTDRFTY MQHVRGHRGEKIYKCGE-------CGYSAPQ-KRHLVIHMR------------VHTGERP YECEECHETF-KHKQTLINHQRSKHNLIQEADGTKKRKATDEITSPSKRITRRQRMQIQE EEETEEMVEPHTLTTADGNTIQVSMAQGGEGTVQLVQTSDGTMPVILTVGGDGQNVDEAL QMMNGSLAAVQGHQDGEGQLMMAVPQGDGDNLHLEGQQASQQLQTSDD------------ --------------IADDSQPPELQQEGQVQEVSAPRESSSITQEQAAALQQQMVSQGII SEGSVIAAMEEDEDGTGDGTIYLFVEEQ-------------------------------- ------------------------------------------------------------ -------------------------