>HOMSA|gi|5729790|ref|NP_006556.1| transcriptional repressor CTCF isoform 1 [Homo sapiens] -----------------MEGDAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEEDACHLPQNQTDGGEVV-------------------------QDVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVAP---EAE------------------AAVDDTQIITLQVVN--------------------------MEEQ----PINIGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKEGLAESEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELP----------------------------PQEDPSWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPD------------------GVEGENGGE--------T----KKSKRGRKRKMRSKK-EDSSDS-ENA---E-------PDLDD--N------EDEEE--PAV-----EIEPEPEP--QPV-TPAPPPAKKRRGRPPGR-TNQPK------------QNQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEPAEGEEEEAQPAAT--------------------------DAPNGDLTPE------------MILSMMDR-------------------------------------------------- >PELSI|ENSPSIP00000003796 pep:novel scaffold:PelSin_1.0:JH205822.1:16124:37080:-1 gene:ENSPSIG00000003593 transcript:ENSPSIT00000003816 gene_biotype:protein_coding transcript_biotype:protein_coding -----------MEQGDKMEGEAIEAIGEESETFIKGKERKTYQRRRE----------------------------GGQEEDACHMPPNQADGAEVV-------------------------QEVSGGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--EPV-APTPPPAKKRRGRPPGK-ANQPK------------QPQPAAIIQVEDQNTGAIENIIVEVKKEPD---------------AETV-GEEEEAQPAAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|171474905|gb|ACB47393.1| CCCTC-binding factor [Pogona vitticeps] -----------------MEGEVVEAVGEESETFIKGKERKTYQRRRE----------------------------GGQEEDACPMPPNQADGSEVV-------------------------QDVNAGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVQQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVSV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQSELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PTFVPAAFVCSKCGKTFTRRNTMARHADNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PELDN--N------EEEEE--TAI-----EIEAEPEV--EPV-APVPPPAKKRRGRPPGK-SNQPK------------QTQPTTIIQVEDQNTDAIENIIVEVKKEPE---------------AETV-GATAGTQPAAA--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|327281289|ref|XP_003225381.1| PREDICTED: transcriptional repressor CTCF-like [Anolis carolinensis] -----------------MEGEVVEAIGEESETFIKGKERKTYQRRRE----------------------------GGQEEDVCSMPPNQADGTEVV-------------------------QDVNTGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVQQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------DL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------PQEDPGWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-ENSSDSEENA---E-------PELYDIEE------EDEEE--TAV-----EIEAEPEIEAEPV-APPPPPAKKRRGRPPGK-ANQPK------------QPQPTAIIQVEDESTGTIENIIVEVKKEPE---------------AETV-GVAAGAQPEAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >MONDO|ENSMODP00000007129 pep:novel chromosome:BROADO5:1:685282646:685300408:1 gene:ENSMODG00000005757 transcript:ENSMODT00000007273 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVAKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-SSQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AETVEGEEEEPQSAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >SARHA|ENSSHAP00000004296 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004340 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >MACEU|ENSMEUP00000005228 pep:novel genescaffold:Meug_1.0:GeneScaffold_6565:184:14145:1 gene:ENSMEUG00000005728 transcript:ENSMEUT00000005742 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAVMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------XXXXXXX-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFV-AAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDS-ENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QAQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEAAEGEEEEPQSAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|396094|emb|CAA80319.1| CTCF protein [Gallus gallus] -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDEACHIAPNQADGGEVV-------------------------QDVNSGVQ-MVMMEHLDP-TLLQMKTEVME-----------------GAVPQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--SAE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETVE-EEEEAQPAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|326927215|ref|XP_003209788.1| PREDICTED: transcriptional repressor CTCF-like [Meleagris gallopavo] -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDEACHIAPNQADGGEVV-------------------------QDVNSGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--EPE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETVE-EEEEAQPAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >MELGA|ENSMGAP00000001773 pep:novel chromosome:UMD2:13:1721989:1735383:-1 gene:ENSMGAG00000002200 transcript:ENSMGAT00000002439 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDEACHIAPNQADGGEVV-------------------------QDVNSGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKKGLG--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHSATVGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--EPE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETVE-EEEEAQPAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >SARHA|ENSSHAP00000004295 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004339 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQGERNGFCIRI------------TVDTIKNFRVSVSSGAFDRQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|224063911|ref|XP_002196108.1| PREDICTED: CCCTC-binding factor (zinc finger protein) [Taeniopygia guttata] -----------------MEGEAVDAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDDACHIPPNQADGSEVV-------------------------QDVSSGVQ-MVMMDQLDP-TLLQMKTEVME-----------------GAVSQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACR------------------------------------------------------------------------------------------------------------------------------------------------------------PVSWENA---E-------PNLDD--N------EDEEE--TAV-----EIEAEPEV--EQE-APAPPPSKKRRGRPPGKAAAQPK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETAE-EEEEAQPAVV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|34785484|gb|AAH57697.1| Ctcf protein [Xenopus laevis] -----------------MEGEMAEDIVEDSETFMKRKETKTYQRRRE----------------------------GGVDEENCVIVQSQTDICEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GMVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENDVSKEVLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------PQEEPGWQKDP-------DYVP---PMK-KSKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADSCTGPD------------------GTDGENGEE--------GEVIHKKGKRGRKRKMRSKK-EGSTDSEDNA---E-------PELDD--DDEDEDDDEEEE--TPV-----EIEADPEP-EEPV-SPIPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVEDHNTRAIENIIVQVKKESD---------------LEAEVVVEAPVLTPAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|11878220|gb|AAG40852.1|AF305695_1 transcriptional repressor [Xenopus laevis] -----------------MEGEMAEDIVEDSETFMKRKETKTYQRRRE----------------------------GGVDEENCVIVQSQTDICEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GMVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENDVSKEVLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------PQEEPGWQKDP-------DYVP---PMK-KSKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADSCTGPD------------------GTDGENGEE--------GEVIHKKGKRGRKRKMRSKK-EGSTDSEDNA---E-------PELDD--DDEDEDDDEEEE--TPV-----EIEADPEP-EEPV-SPIPPPAKKRRGRPPGK-ANQAR--------------QNAAVIQVEDHNTRAIENIIVQVKKESD---------------LEAEVVVEAPVLTPAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >XENTR|ENSXETP00000062683 pep:known scaffold:JGI_4.2:GL172782.1:1468935:1488905:1 gene:ENSXETG00000015615 transcript:ENSXETT00000060905 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MESEMAEAVVEDSETFMKRKETKTYQRRRE----------------------------GGVDEDNCVIVQSQTDISEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GVVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENEVSKEGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------QQEEPGWQKDP-------DYVP---PIK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADNCTGPD------------------GTDGENGGE--------SEVVHKKGKRGRKRKMRSKK-EGSSDSDHNN---E-------PFFQD--N------AEPEQ--TPV-----EIEADPEP-EEPL-TPLPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVDDHSNRAIENIIVQVKKESD---------------LEAEGGVEAAVPTPAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|170284950|gb|AAI61099.1| ctcf protein [Xenopus (Silurana) tropicalis] -----------------MESEMAEAVVEDSETFMKRKETKTYQRRRE----------------------------GGVDEDNCVIVQSQTDISEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GVVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENEVSKEGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------QQEEPGWQKDP-------DYVP---PIK-KTKKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADNCTGPD------------------GTDGENGGE--------SEVVHKKGKRGRKRKMRSKK-EGSSDSEDNA---E-------PELED--DD-DEDEDDEDE--TPV-----EIEADPEP-EEPL-TPLPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVDDHSNRAIENIIVQVKKESD---------------LEAEGGVEAAVPTPAV--------------------------EAPNGDLTPE------------MILSMMDR-------------------------------------------------- >LATCH|ENSLACP00000011174 pep:novel scaffold:LatCha1:JH126699.1:452823:489575:1 gene:ENSLACG00000009833 transcript:ENSLACT00000011258 gene_biotype:protein_coding transcript_biotype:protein_coding MQIYCTFLFLRERGREKMENEPSEVILEENETFSKGKERKTYQRRRE----------------------------GGQEEDNGAVIQNHPDGIEVVQDLQNQPDGTEEAQDLQNQSDGIEAQDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------AGVNQ---EGE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PISLGELQLVQVPVPVSV---PVTATTVG----------------------QL-QGT-------YENDVSKEGLQ-GEPVICHTLPLPE------GFQVVKVGANGEVETLEQEELQPP-------------------------PPQEDPNWAKDP-------EFQP---PAK-K-KKTKKSKLR----Y-TEE---------------------------GKDVDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCEKTF-RQKQLLDMHFKRYHD-------PTFVPATFVCTKCGKTFTRRNTMARHAENCTGPD------------------SVEGENGGE--------P----KKSKRGRKKKMRSKR-DDSSGSDENA---E-------PELDD--I------DEEEE--EAVVINDEEMEGGPEA------LPAPPPAKKKRGRPPGK-SNQAK------------STQTAAIIQVEDQNAGTIENIIVEVKKEPD---------------TEEEEGEVEQAQPVVV--------------------------EAPNGDLTPE------------MILSMMDR----------------------------------------------KKKK >ORENI|ENSONIP00000007230 pep:novel scaffold:Orenil1.0:GL831150.1:2536970:2540553:1 gene:ENSONIG00000005736 transcript:ENSONIT00000007235 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------EGGEALTQ---------------------------GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV------------------SGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL-QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQ----------------------------PQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHADNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQSRR-DDDDDDTVNI---E-------GELDE--------AEEEEDMLTEI-----EVEQAPSV--VPIPAPVEPPVKRKRGRPPKSKPDSK-------------RIIAAAIIRVEDETTGEVDDIIV--KKEVG------------ADQDDD---GNEAAQEVVV--------------------------APPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|348509710|ref|XP_003442390.1| PREDICTED: transcriptional repressor CTCF-like [Oreochromis niloticus] -----------------MISCQGNCLNFVSQAICLQGKKARKRAPTLTEADFIGFLKQPRICWPIAMEAEVVSMESAQATDGKVLPEGGEALTQ---------------------------GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV------------------SGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL-QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEELQGTRVEEEEEDEEPAEVETSVPPQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHADNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQSRR-DDDDDDT------E-------GELDE--------AEEEEDMLTEI-----EVEQAPSV--VPIPAPVEPPVKRKRGRPPKSKPDSK----------------PAAIIRVEDETTGEVDDIIV--KKEVG------------ADQDDD---GNEAAQEVVV--------------------------APPNGDLTPE------------MILSMMDR-------------------------------------------------- >ORENI|ENSONIP00000007229 pep:novel scaffold:Orenil1.0:GL831150.1:2536889:2541050:1 gene:ENSONIG00000005736 transcript:ENSONIT00000007234 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------------MEAEVVSMESAQATDGKVLPEGGEALTQ---------------------------GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV------------------SGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL-QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEELQGTRVEEEEEDEEPAEVETSVPPQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHADNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQSRR-DDDDDDT------E-------GELDE--------AEEEEDMLTEI-----EVEQAPSV--VPIPAPVEPPVKRKRGRPPKSKPD------------------TAAIIRVEDETTGEVDDIIV--KKEVG------------ADQDDD---GNEAAQEVVVGEGKSTIQMEELSQGEGVAQAGQLSEAPPNGDLTPE------------MILSMMDR-------------------------------------------------- >TAKRU|ENSTRUP00000045998 pep:novel scaffold:FUGU4:scaffold_14:1533474:1536803:-1 gene:ENSTRUG00000017943 transcript:ENSTRUT00000046152 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------EVTGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDTSTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAA----------------------------EDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDDDDT---E-------PDQED--------MDDEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPAGEF------------QDQPAAIIRVEDEVTGEVDDIIV--KKEVG------------ADQDDQEICNEEAVEQVVV--------------------------APPNGDLTPE------------MILSMMDR-------------------------------------------------- >TAKRU|ENSTRUP00000045999 pep:novel scaffold:FUGU4:scaffold_14:1533474:1536800:-1 gene:ENSTRUG00000017943 transcript:ENSTRUT00000046153 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------------------------------------------VTGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDTSTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEELQGTREEEEEEEEEPA--------EEDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDDDDTVS-E-------PDQED--------MDDEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPAVAKIVFFGG------FFPAAAIIRVEDEVTGEVDDIIV--KKEVG------------ADQDDQEICNEEAVEQVM----------EELAQEEAAAQEVPLSEAPPNGDLTPE------------MILSMMDR-------------------------------------------------- >TETNI|ENSTNIP00000012180 pep:novel chromosome:TETRAODON8:5:5277710:5281213:1 gene:ENSTNIG00000009314 transcript:ENSTNIT00000012371 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------EVAGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDASTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEEL-----------------------QEDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDTGS----E-------PEPEE--------MDEEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPA------------------TAAIIRVEDEATGEVDDIIV--KKEVG------------ADQDDQEICSEEAVERCSC------------------------LRRRRNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|47230373|emb|CAF99566.1| unnamed protein product [Tetraodon nigroviridis] -------------------------------------------------------------------MEDDVVSMETTQADGKVLPEGVDSLIQGS--------------------AIAQQAEVAGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDASTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEELQGTREEEEEEEEEAADVEPVVSQQEDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDT------E-------PEPEE--------MDEEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPA------------------TAAIIRVEDEATGEVDDIIV--KKEVG------------ADQDDQEICSEEAVEQVVVGGGKSTIQMEELAQEEVAGQEVQLSEAPPKRRPNPRDDPQHDGPVMDASVKSVNTRLGFKKNLFLFFFFFFFFFSFSFGFRLTNACILKRNKHHCCNLRFHF---- >GASAC|ENSGACP00000020939 pep:novel group:BROADS1:groupII:11613372:11616719:-1 gene:ENSGACG00000015865 transcript:ENSGACT00000020979 gene_biotype:protein_coding transcript_biotype:protein_coding -------------------------------------------------------------------------------------------------------------------------GEVVDDMG-MVVMDALDP-TLLQMKTEVLE-----------------GGGT----VTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-TGAALGLGQLQLVQ------V---PVTRATVE----------------------GL-QAT-------YVDASTAN--KDADPVICHTLPLPE------GFQVVKVGANGEVETVEQ-------------------------EVEATVPLEEDPEWSKDP-------DYQPIS-SLRNKGKKGKKSRLR----Y-GEG---------------------------NRDMDVSVYDFEEEQQ-------------------------EGMLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYCSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFLEKGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCEQ-------CDYCCRQ-ERHMVMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCNKTFTRRNTMLRHTENCSGEI------------------E-E-ENGTP--------AP---KKARRGRKRKMQTRR-DDDDTGSNAK---E-------DELDE--------VEEEEE-LSEL-----EVEQDPPV--VPIPAPVEPPVKRKRGRPPKNKPNIPKSDLK--------LLTAAAIIRVEDEVTGEVDDIIV--KKEVG------------VDRDDQEEATDGAVEE-----------------EAVAAPEV--SEAPPNGDLTPE------------MILSMMDR-------------------------------------------------- >GASAC|ENSGACP00000020941 pep:novel group:BROADS1:groupII:11613372:11616695:-1 gene:ENSGACG00000015865 transcript:ENSGACT00000020981 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------------------------------------------------------MVVMDALDP-TLLQMKTEVLE-----------------GGGT----VTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-TGAALGLGQLQLVQ------V---PVTRATVE----------------------GL-QAT-------YVDASTAN--KDADPVICHTLPLPE------GFQVVKVGANGEVETVEQEEMEADHDELLEVR----------AEVEATVPLEEDPEWSKDP-------DYQPIS-SLRNKGKKGKKSRLR----Y-GEG---------------------------NRDMDVSVYDFEEEQQ-------------------------EGMLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYCSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFLEKGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCEQ-------CDYCCRQ-ERHMVMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCNKTFTRRNTMLRHTENCSGEI------------------E-E-ENGTP--------AP---KKARRGRKRKMQTRR-DDDDT--------E-------DELDE--------VEEEEE-LSEL-----EVEQDPPV--VPIPAPVEPPVKRKRGRPPKNKPNIPKSDLK--------LLTAAAIIRVEDEVTGEVDDIIV--KKEVG------------VDRDDQEEATDGAVEEVVVGEGKSTIQLEELPQEAVAAPEV--SEAPPNGDLTPE------------MILSMMDR-------------------------------------------------- >ORYLA|ENSORLP00000011017 pep:known chromosome:MEDAKA1:3:20410268:20415910:-1 gene:ENSORLG00000008771 transcript:ENSORLT00000011018 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------METGQATAL----------------------------------------------------ASDGKVLSEGGEALIQTG------------------------QGDEAGTME-MMVMDALDP-ALLQMKTEVLE-----------------GGGT----VTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVEASAAN--KDA--VICHTLPLPE------GFQVVKVGANGEVETVEQDELQAAQEDLQGQEGEEVEEDEEAAEIVTSV-PQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG---------------------------DRDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKYHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACEQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCTKCSKTFTRRNTMLRHAEGCTGEA------------------SGD-ENGTP--------TP---KKGRRGRKRKMQARE-KKPDKVDSDT---E-------GELDE--------IEEEDDLLTEI-----EVEQAAPV--IPIPAPIEPPVKRKRGRPPKNKPEVCPCFSGIS------NLSVAAIIHVEDEVQ-EVEELV---KKEVG---------------AEQVNCTDETTEQVITGGGKPGAQSEELSQADAAAQEVQLSAAPSNGDLTPE------------MILSMMDR-------------------------------------------------- >ORENI|ENSONIP00000005164 pep:novel scaffold:Orenil1.0:GL831206.1:2751803:2762795:1 gene:ENSONIG00000004097 transcript:ENSONIT00000005168 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MDGRPTDGV---GVVDVPTKEFPSIQAVHSQDAMVADLLQQAAEAG-------------GHGEGMAAATQSQQQLME------------------------GVGVEGGTGVE-MMVMDSLDP-TLLQMKTEVIDAAVGGSSAAVGVVGGVPGSAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQQGN-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVVKVGANGEVETVEQEEEG-----AETQPDEEEDEEEEPVQ-----PPNDDPNWAKDP-------DYQPPSGVVK-KIKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPATFVCPKCNKTFTRRNTMARHAENCSGEV------------------E-DAENGAT--------IP---KKGRRGRKRKMRSRRDDDDDSDEDHA---EQDDDEEEGEGEE--ESSLLQEEEDP---ESM-----ELDQAPAA--IPVPAPDEPPVKRKRGRPPKNAPK-PPTPSKSVRVATKTTASAAAIIQVEDESTGAVENIIV--KKEEGDASAATPLDQGVALTVEGVGLD-EGVETVEL----------------PVNEET--AAASANGDLTPE------------MILSMMDR-------------------------------------------------- >gi|348523553|ref|XP_003449288.1| PREDICTED: transcriptional repressor CTCF-like [Oreochromis niloticus] -----------------MDGRPTDGV---GVVDVPTKEFPSIQAVHSQDAMVADLLQQAAEAGGVVEGQAGVVVEQGHGEGMAAATQSQQQLME------------------------GVGVEGGTGVE-MMVMDSLDP-TLLQMKTEVIDAAVGGSSAAVGVVGGVPGSAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQQGN-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVVKVGANGEVETVEQEEEG-----AETQPDEEED-EEEPVQ-----PPNDDPNWAKDP-------DYQPPSGVVK-KIKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPATFVCPKCNKTFTRRNTMARHAENCSGEV------------------E-DAENGAT--------IP---KKGRRGRKRKMRSRRDDDDDSDEDHA---EQDDDEEEGEGEE--ESSLLQEEEDP---ESM-----ELDQAPAA--IPVPAPDEPPVKRKRGRPPKNAPK-PPTPSKSVRVATKTTASAAAIIQVEDESTGAVENIIV--KKEEGDASAATPLDQGVALTVEGVGLD-EGVETVEL----------------PVNEET--AAASANGDLTPE------------MILSMMDR-------------------------------------------------- >ORYLA|ENSORLP00000022986 pep:novel ultracontig:MEDAKA1:ultracontig72:341588:352749:1 gene:ENSORLG00000018357 transcript:ENSORLT00000022987 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------MVAGQTQQQLMDAG---------------------VGVAVDGGAGVD-MMVMDSLDP-TLLQMKTEVMDAAVGASSPSAAVVGGVAGAAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGIGELQLVQ------V---PVSATTVE----------------------ALQQGT-------FVDASSIP--KDGDPVICHTLPLPE------GFQVLEDQKG-----------------FSRSVRAHQHSAGAPMQ-----PPNNDPSWAKDP-------DYQPPSGVVK-KVKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPFGCSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCPKCSKTFTRRNTMARHAENCSGEV------------------D-DAENGAP--------TP---KKGRRGRKKKMRSRR-DEDDSDEDQL---E-PDDEDEAEEEE--EASLLLEEDEP---ESL-----ELDQAPAA--VPVPAPEEPPVKRKRGRPPKNAPK-VPAPSKPVRTPSK-TSSAAAVIQVEDESTGAV-DIIV--KKEEADGAAEAPLQGGVALAVEDAAMDAEGAETVEL----------------ADGEET---VAAANGDLTPE------------MILSMMDR-------------------------------------------------- >GADMO|ENSGMOP00000006796 pep:novel genescaffold:gadMor1:GeneScaffold_4125:40745:45236:1 gene:ENSGMOG00000006386 transcript:ENSGMOT00000006994 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------MEAVHNQQQLLE----------------------------EGGGGVE-MMVMESLDP-ALLQMKTE----------------GGVAGGAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSASTVE----------------------ALQQGT-------FVDATAMP--KDGDPVICHTLPLPE------GFQVGKQ------------------------------------------VQNDDSAWSKDP-------DYQPPSAALK-KSKKGKKSRLR----Y-AEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNIKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTSFVCPKCSKTFTRRNTMARHAENCNGEI------------------D-DAENGTP--------TP---KRGRRGRKRKMRSRR-DEEEDSEDHA---D--------------PDLLLQEEEEQ---DAM-----ELDQAPAT--VPVPAPEEPPVKRKRGRPPKNAPKPAPTPTKSPRVAAKAAATAAAIIQVEDESTGAVENIIV--KKED-------PRAPGAGLAVEAVGLEAEEVEAVEV----------------QGTEDEAGAAAAANGDLTPE------------MILSMMDR-------------------------------------------------- >GASAC|ENSGACP00000003270 pep:novel group:BROADS1:groupXIX:1433175:1439803:1 gene:ENSGACG00000002504 transcript:ENSGACT00000003281 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------------------------------------------------------------------------MVAAAQHQLMEAA---------------------VGVGVDGGASVE-MMVMDSLDPIPLLQMKTEVIDSAVGGSSATLGVVGGVAGAAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQQGT-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVNKQTHS-----------------QSTFGEERKEPPAALLH-----PQNDDASWAKDP-------DYQPPHGAFK-KPKKGKKSRLR----Y-GEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTAFVCPKCSKTFTRRNTMVRHSENCNGEV------------------E-DAENGAP--------AP---KKGRRGRKRKMRSRR-DEEDSEDDNA---E--------FGEE--ETSLLQEEEEEEEPESM-----ELDQAPAA--IPVPAPDEPPVKRKRGRPPKNAPKPPPTPSRSARVAAKAAASAAALVQLEDESTGAVENITV--KKEDSQAPEATPAEQGAAPA-------AEGAETVEL----------------PVNEDTASAAAAANGDLTPE------------MILSMMDR-------------------------------------------------- >gi|126632718|emb|CAM56716.1| CCCTC-binding factor (zinc finger protein) [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAVIEQ--------------AQAEVEPVVEAQQQLVESV-------------------------VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV------------------TTVDDTQIITLQVVN--------------------------MEEQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL-QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQDELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK-KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGRKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSQCEKTF-RQKQLLDMHFRRYHD-------PNFVPTSFVCTKCGKTFTRRNTMARHAENCTGMD------------------SADGENGTP--------P----KRGRGGRKRKMRSRK-DDDDDDDSDEHGEP--------------DLDDIDEEDEDDLLDEDQMG--LLDQAPPS--VPIPAPAEPPIKRKRGRPPKNAPKVSPTKSITK------TTTAAAIIQVEDESTGAIENIIV--KKEPE--------------------------GTDAVVAAQPIIEEVEAVEADVETVQLTVPEAAPNGDLTPE------------MILSMMDR-------------------------------------------------- >gi|53734069|gb|AAH83236.1| Ctcf protein, partial [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAVIEQ--------------AQAEVEPVVEAQQQLVESV-------------------------VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV------------------TTVDDTQIITLQVVN--------------------------MEEQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL-QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQDELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK-KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVRWCRECIAVQ---------------------------------------------------------------LIKKYCPK-------------------------------------------------------------------------------------------------------------------KKKK-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >DANRE|ENSDARP00000059912 pep:known chromosome:Zv9:18:22066704:22074092:1 gene:ENSDARG00000056621 transcript:ENSDART00000059913 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAVIEQ--------------AQAEVEPVVEAQQQLVESV-------------------------VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV------------------TTVDDTQIITLQVVN--------------------------MEEQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL-QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQDELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK-KVKKTKKSKLR----YNTEG---------------------------DKDMDVSVYDFEEEQQ-------------------------EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G--------V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVRWCRECIAVQ---------------------------------------------------------------LIK-YCP-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >SARHA|ENSSHAP00000015681 pep:novel scaffold:DEVIL7.0:GL834666.1:1222728:1242523:1 gene:ENSSHAG00000013360 transcript:ENSSHAT00000015809 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------RMGTEASASPEQFTKIKGTDVIQEKA-KENDVDKVSKLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELA-PS------------------NEENEKHILTLQTVH--------------------------FATD-ETDHQEMSQLTVQP---AEGM---HVMVQQGE----------------------SGLQSL-------LVLQQDIN-----VQAELNEIPHQN------LHQCVAISIQEEVFSLHEMEVMEINVVEESVEVSSEEDKLTVNS-----PLDENTELIK----------------LCEEREFTDQKEEIF----T-FEKLREGEK---------------------EEIILLPANSEIEEHE-------------------------DVHSS--EQDIDEVSGTAK----NQAKSK--G--------M-KRTFHCEICIFTSSRISSFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKRHIRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIHILQKHSENVPKHQCPH--CSTVIARKSDL-RVHLRNLHSYKATEMKCRYCPAVFHERYALIQHQKTHRNEKRFKCDD-------CNYACKQ-ERHMTVHKR------------THTGEKPFTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHSEHCEAVK------------------GKSI-------------PS---AKGRKNKKKKQKDPK-QDAKEEGRQT---------------RNFRSDKVVEQMPIEDTSIVNIEHHPNEIVPVVYGMAA-DV-EE-----------------------------------------------------------------------------------------------------------------------PKTEVTCE------------MILNMMDK-------------------------------------------------- >SARHA|ENSSHAP00000015680 pep:novel scaffold:DEVIL7.0:GL834666.1:1220619:1244690:1 gene:ENSSHAG00000013360 transcript:ENSSHAT00000015808 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------------------MEILK-SY-----------------------------------------------------------RRQI-----------------------------------RD-ETDHQEMSQLTVQP---AEGM---HVMVQQGE----------------------SGLQSL-------LVLQQDIN-----VQAELNEIPHQN------LHQCVAISIQEEVFSLHEMEVMEINVVEESVEVSSEEDKLTVNS-----PLDENTELIKDKDKGIYGAEEGEVQQLCEEREFTDQKEEIF----T-FEKLREGEK---------------------EEIILLPANSEIEEHE-------------------------DVHSS--EQDIDEVSGTAK----NQAKSK--G--------M-KRTFHCEICIFTSSRISSFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKRHIRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIHILQKHSENVPKHQCPH--CSTVIARKSDL-RVHLRNLHSYKATEMKCRYCPAVFHERYALIQHQKTHRNEKRFKCDD-------CNYACKQ-ERHMTVHKR------------THTGEKPFTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHSEHCEAVK------------------GKSI-------------PS---AKGRKNKKKKQKDPK-QDAKEE--------------------------VVEQMPIEDTSIVNIEHHPNEIVPVVYGMAA-DV-EE-----------------------------------------------------------------------------------------------------------------------PKTEVTCE------------MILNMMDK-------------------------------------------------- >MONDO|ENSMODP00000020611 pep:novel chromosome:BROADO5:1:486248077:486266900:1 gene:ENSMODG00000016490 transcript:ENSMODT00000020976 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------RMGTEASASPEHFTKIKGTDLIQEKA-KESDVDKVSRLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELT-PQ------------------TQENEKHILTLQTVH--------------------------FATD-EMDHQEM---TVEP---AEGM---HVMVQQGE----------------------SGLQSL-------L----------------LNEIPHQN------LHHCVAISIQEEVFSLHELEVMEINVVEESVEISSEEDKLTVNP-----PLDENTESVKVE-------KNYEVPQLCEEREITDQKEGLF----T-FDKLREGEK---------------------EEIILLPANSEIEEHE-------------------------DIPSS--EQDTDEVSGTAK----NQAKTK--D--------V-KQTFHCEICIFTSSKISCFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCTMCKYASVEASKLKRHIRSHTGERPFHCCLCNYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIHILQKHSENVPKHQCPH--CATVIARKSDL-RVHLRNLHSYKAAEMKCRYCTDVFHERYALIQHQKTHRNEKRFKCDD-------CSYACKQ-ERHMRVHKR------------THTGEKPFTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHSELCEVIR------------------GKAV-------------QS---AKGRKTKKKKQKGPK-QDVKEEGK-------------------------FEQMPIEDISNVNIERHTNEIVPVGYGIAT-DVAEE-----------------------------------------------------------------------------------------------------------------------QKTEVTCE------------MILNMMDK-------------------------------------------------- >ORNAN|ENSOANP00000025255 pep:known ultracontig:OANA5:Ultra516:6791142:6804936:-1 gene:ENSOANG00000009260 transcript:ENSOANT00000029059 gene_biotype:protein_coding transcript_biotype:protein_coding -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SCQDWYKSHL----KAYFIS--G--------V-NKTFHCDICKFTTSRQSSLNRHLKIHSDVKP--HVCHLCLKAFRSATLLRNHVNTHTGTKPYKCG--DCTMAFVTSGELVRHRRYKHTHEKPFQCTICKYASVEASKLKRHIRSHTGERPFRCRLCSYASRDTYKLKRHMRTHS---GEKPYECSVCQTKFTQRGTMKIHMLQKHTENAPKHQCPH--CGTMIARKSDL-RVHLKNLHSYKTTEIKCHYCSAAFHERYLLLQHQKTHRDEKRFKCGD-------CDYACKQ-ERHMIVHKR------------THTGEKPFSCLHCNKRF-RQKRLLSVHFRKYHD-------ENFTPIVYECPKCGKAFSRLWILYERS-HFRYVE------------------WNSIFLCCRCARLFQCPSV---CNLRRTKRS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >HOMSA|gi|29570785|ref|NP_542185.2| transcriptional repressor CTCFL [Homo sapiens] ---------------------------------------------------------------------------------------MAATEISVLSEQFTKIKELELMPEKGLKEEEKDGVCREKDHR-SPSELEAER-TSGAFQDSVLE-------------------EEVELVLAP------------------SEESEKYILTLQTVH--------------------------FTSE-AVELQDMSLLSIQQ---QEGV---QVVVQQPG----------------------PGL-------------------------LWLEEGPRQS------LQQCVAISIQQELYSPQEMEVLQFHALEENVMVASEDSKLAVS-------LAETTGLIKLE-------EEQEKNQLLAER----TKEQLF----F-VETMSGDERS--------------------DEIVLTVSNSNVEEQE-------------------------DQPTA--GQADAEKAKSTK----NQRKTK--G--------A-KGTFHCDVCMFTSSRMSSFNRHMKTHTSEKP--HLCHLCLKTFRTVTLLRNHVNTHTGTRPYKCN--DCNMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKRHVRSHTGERPFQCCQCSYASRDTYKLKRHMRTHS---GEKPYECHICHTRFTQSGTMKIHILQKHGENVPKYQCPH--CATIIARKSDL-RVHMRNLHAYSAAELKCRYCSAVFHERYALIQHQKTHKNEKRFKCKH-------CSYACKQ-ERHMTAHIR------------THTGEKPFTCLSCNKCF-RQKQLLNAHFRKYHD-------ANFIPTVYKCSKCGKGFSRWINLHRHSEKCGSGE------------------AKSA-------------AS---GKGRRTRKRKQTILK-EATKGQKEAA---------------KGWKEAANGDEAAAEEASTTKGEQFPGEMFPVACRETTARVKEE-----------------------------------------------------------------------------------------------------------------------VDEGVTCE------------MLLNTMDK-------------------------------------------------- >PELSI|ENSPSIP00000012493 pep:novel scaffold:PelSin_1.0:JH209331.1:1517019:1535688:1 gene:ENSPSIG00000011195 transcript:ENSPSIT00000012554 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------MMAAQDSHLPEPFTKIKGAERIWDRAREDDGGDRLPWVKERN-SICDPDVEV-LNGAPPAKALE-----------------GGRNLELS-PS------------------LIQSEKHLIMLQTVR--------------------------LKEG-EEDLQAVSQLNIQQ---QSGL---HMVVQRGA----------------------SVLQPL-------VVVQQGV-------------GAQQN------IPTGVAISLQDGVYTFHDMEVMQINVLQEKVQAKDEENKS----------MDKSPGMLLIK---------KLVPKNLKNSVKIDRTKDLH----A-VEEILSCTA---------------------KDDISVSLNEPKEQGE---------------------------QSV--VKKTDTLEAHTN----TQHRKK--G--------E-KVTIHCDLCAFTSLRMSSLNRHMKTHSDEKP--HLCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCEMAFVTSGELARHRRYKHTLEKPFKCSVCKYSSVEASKLKRHIRSHTGERPYNCCLCSYASKDTYKLKRHMVTHS---GEKPYECYVCQARFTQSGTMKIHILQKHSENVPRYQCPH--CNAFIARKSDL-GVHLRNLHSYLAVAMKCSYCEAVFHERYALIQHKKTHRNEKRFKCDR-------CSYACKQ-ERHLIVHKR------------THTGEKPFTCVSCSKCF-RQKQLLTVHFRKHHD-------SNFKPTVYECPKCGKGYSRWNNMHKHAENCGLAR------------------AKVV-------------TR---HKGSKGKKKRWNSLK-QDVKQEGCSE-----------VSCGGLWESAGTVDLGSFQDVSVVNTECCASEIVPVEYGIET-STPRE-----------------------------------------------------------------------------------------------------------------------QKTEMTCE------------MILNMMDK-------------------------------------------------- >gi|171474913|gb|ACB47397.1| brother of regulator of imprinted sites [Pogona vitticeps] ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------A------------------LGEGEKHLVLLKTVH--------------------------LKIE-ENDAQGPSVAN-QH---DGVL---HAVMQRER----------------------CILEPL-------EVMTQSI-------------GIRNN------LEEVVAVGLPEGIYTVQEMEVMHINCLKE-MQAFNEDEKS----------TRKTLDALRIE-----------RDRSIDVATDGDKGQPLL----V-AEEDRISPF-------------------------------CLAEAA-------------------------KNLSS--RSKEDPVSFHNS----EKKDQS--N--------EGDVPQHCPFCTFTCFSIAGLRRHMKKHSEERP--HMCHLCLKAFRTVSLLRNHVNTHTGTKPHKCG--ECDMAFVTSGELSRHRRYKHTLEKPFKCTFCSYCSVEASKLKRHIRSHTGERPYHCTLCSYASRDTYKLKRHMVTHS---GEKPFECLICKARFTQAGTLKFHILHKHETNVPKHQCPH--CQTSVARKGDL-SIHLRNLHSYIEVPLRCNYCDAAFHERYAFRQHKKTHRNEKRFKCDQ-------CNYACKQ-ERHMVIHKW------------THTGEKPFVCVACSKCF-RQKQLLRVHFKKHHD-------SSFKPKVYECSKCSKEYSRWSNMHKHAEKCEDRR------------------AI---------------QP---SKGSKGKKKADKRSS-HNRQREGSN----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >ANOCA|ENSACAP00000016003 pep:novel scaffold:AnoCar2.0:GL343217.1:1034665:1047855:1 gene:ENSACAG00000016243 transcript:ENSACAT00000016323 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HCRFCTYTSSSVTGLNRHMKRHSDKNP--HMCHLCLKVCRTVALLRNHMNTHTGTKPYKCS--ECDMAFVTGGELSRHRRYKHTHEKPFKCTFCNYSSVEASKLKRHIRSHTGERPYNCTLCSYASRDTYKLKRHMLIHS---GEKPFECLICKARFTQAGTLKFHKLHKHGTNVPKYQCPH--CNTAVARKGDL-RIHLQNLHSYIKVPLKCNFCEDAFHERHAFKQHKKTHINEKKFKCDQ-------CNYACKQ-GRHMVMHKR------------THTGEKPFVCISCSKCF-RQKQLLTIHAKKYHD-------SSFQPKVFECPQCGKEYSRWNNMRKHAANCKGKS------------------VV---------------QP---SKGSKKRKKEEKRS---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >SARHA|ENSSHAP00000001830 pep:novel scaffold:DEVIL7.0:GL867782.1:14413:31463:1 gene:ENSSHAG00000001630 transcript:ENSSHAT00000001851 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------------------------------------------------------------------------------------------------------------------EEGELDWH-PS------------------NEENERHLLALQMLP--------------------------FTTD-ETG-QEMSQLAVQP---AEGM---HIMEPQGE----------------------SGLQSL-------LGWQQYIN-----VQPELNEMPHQD------LSRCVELNIPEEIFSLEEMEMIENYIRDESADFFNED-KWIFDF-----PFDE--ELLKDQDKGIYG-AEWEAQELCEEREYTDPKEEIV----N-FETPRDGEK---------------------EEIILLLANSEIEEHE-------------------------DIHSP--PQDLDEVSQTAA----NQAETK--G--------T-KWTFHCEICKFTSSKMSSLTRHMKTHTAEKP--HMCHLCPKAFRTGTLLRNHLNTHTGTRPYKCS--DCEMAFVTSGELGRHRRYKHTHEKRFKCSMCNYASVEASKLKRHIRSHTGERPFPCSFCNYASKDIFKLKRHMTSHS---GEKPYECSFCSARFSQSGTLKIHVLQKHSGNAPKHQCPH--CATLITRKSDL-RVHLRNLHSYSAAEMKCRYCTAAFHERYALIQHQKTHRDEKRFKCDV-------CSYACKQ-AQHMTIHKR------------IHTGEKPFTCLSCNKSF-RQKQLLKVHFKKYHD-------ETSVPPVHECPKCGKGFSRLNNMRKHSEHCEVVR------------------GKAV-------------PS---A-----KEKDQKGPE-QGAREEVLIG-----------FQTAQGLRNYKVIEEMPIEDISIVNIENATVEMVPVVYGTTS--DVQE-----------------------------------------------------------------------------------------------------------------------PQTEITFE------------MILNMIEK-------------------------------------------------- >PETMA|ENSPMAP00000004689 pep:putative scaffold:Pmarinus_7.0:GL483954:4616:13063:1 gene:ENSPMAG00000004257 transcript:ENSPMAT00000004708 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EDTHIITLHPVS--------------------------LEETGEGGGTSIGEITLVQVQADLTV---IIHNVKL-----------------------QLNSVL-------KIDVRRTL--LQGVP-ITHTLPLPE------GVQVVKVGPNGELE-VERAPMG----------------------------SQDERSKEKDP-------DYQL---PVKKPVRKGRKNKLR----Y-KQE---------------------------AADADISVYDFEEQEE-------------------------GRLVSSQDVGVEKAIAP-KPPKPTRIKKK--G--------A-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HCCHLCDRAFRTVTLLRNHVNTHTGTKPHKCM--ECDMAFVTSGELVRHRRYRHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCGLCSYASRDTYKLKRHMRTHS---GEKPYECHVCHARFTQSGTMKMHVLQKHTDNVPKYHCPH--CDAVIARKSDL-GVHLRKQHAVLERELRCRYCRAIFHERYALMQHQRTHRNEKRFKCDQ-------CEYACKQ-ERHMIMHKR------------VHTGEKPFECTLCDKTF-RQKQLLDFHFKRYHD-------PSFVPTTYECSKCHRNFTRRSTMMKHFDMCDGEL------------------ESGEQNGK---------AR---RGRRRGRKRKMQSRK-HGSSSESDEMPTDE---------------------DEEEEELNEE-----SVEVADEE----VEEPEPPPMKRRRGRPPKAKPGRPA------KKVAGSDSVACGIIEIIPVTVGGPDGPDDDEEEE--------------EEEEEAAEGEEGAVETAADEG--------------------------PKNDITPE------------MILSMMDQ-------------------------------------------------- >BRAFL|gi|260819198|ref|XP_002604924.1| hypothetical protein BRAFLDRAFT_217118 [Branchiostoma floridae] ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MR---------------------------------------------------------------------------------------------------------------------VGGKTYQCYKCDYTCQRMAFLERHMKVHTDERP--FKCGTCEREFRTMQSLQNHINSHNGVKPHKCD--QCPMSFVTSGELMRHRRYKHTHEKPHKCTMCDYASVEISKLKRHMRSHTGERPFQCGMCSYASPDSYKLKRHMRTHT---GEKPYECSVCLATFTQSGSLKMH-MQRHLGTAPSYVCDI--CGTALTRKSDL-KSHVRKLHTGDKL-LTCKYCDSAFPDKYNLTKHLKTHQGEKRFRCED-------CNYCCTQ-ERHLINHKR------------CHTGEKPFVCVQCDHTF-RQEQLLKQHIKVHHT-------PGYTPPRYACTNCDKSFTRKGNLR-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >MACEU|ENSMEUP00000007622 pep:novel genescaffold:Meug_1.0:GeneScaffold_3588:549:7519:-1 gene:ENSMEUG00000008337 transcript:ENSMEUT00000008359 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------------------------------------TEASALAEQFTKRKGPDLIQEKDAESDGGLSSRRGSGRG-LEVGC-----PYGVLQAKLVE---------------------GELQLVP------------------SQEGEKHVLTLQTVH--------------------------LAPE------ERGRVAVPP---AEGM---HVVVQPAE----------------------GGFPAV-------LVLQQDLS-----------VLARPS------VRRCVAISLQEELFSLHEMELVEIDVVEDSTDVSCEDRKLE---------------------------KQGMNQPLSEERELPDQREHLF----T-FEALGEGEE---------------------EEIILLPASSEMEERE-------------------------DVPAS--EQDLDEGSETAE----TQAEST--X--------X-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXASKLKRHMRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHSENVPKHQCPH--CATVIARKSDL-RVHLRNLHSYKATEMKCRYC-AVFHERYALIQHQKTHRNEKRFKCDD-------CSYACKQ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >OIKDI|GSOIDT00001753001 ---------------------------------------------------------------------------------------------------------------------------MSQQYEEEPIMEE----------------------------------------------------------------EDGNTTTVQFIA--------------------------VAPEEHERMVREGELPETIQGADVQM---TIGGTESRPVRLIAVDSNGIPVTDPTILQAAAEQAG-------IMFVSKDENDHENQISIDEAMQLRN---------------EENQQVPGYQDQ----------------------------PPADPYDYQAEP-----------------EYHDSEKGVPLR----NYTSQDVVYEEAPENG----------------GIKQQENIYDYQDVPIQN-----------------------EPKIDNHYKKSQSNTQRTKYPGTIQVGAN--G--------EKRKVYQCSECAFYSHRHSNLIRHMKIHTDERP--YKCHLCARAFRTNTLLRNHINTHLGVKPYKCPEANCEMAFVTSGELTRHRRYKHTHEKPFKCTLCEYASVEISKLRRHFRSHTGERPFSCDICGKAFADSFHLKRHKFSHT---GEKPYECPHCKARFTQHGSLKMHVMQQHTKTAPKFECEREICHTMLGRKSDL-NVHLRKQHSYQEIPMQCRYCEEVFHDRWSLMQHQKTHRS-GRYRIDEDGNQIFDPDYDSEM-EDMEGGHGSGNLVYTEDGHVIGQDGSPNVIVQRVQHIDGQQGVPIEDHQQMQHDEHNMHAVEHQMDHSQEAPPIHQGHHRQAHAEAKSERFEQEQQHDPFAFDDDQNMGNGEYMHAPQMGQTMAQPVE------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >CIOIN|ENSCINP00000013892 pep:known scaffold:KH:HT000097.1:44901:51453:1 gene:ENSCING00000006765 transcript:ENSCINT00000013892 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------------MADDGKDSVEVTSQIESVNGTVAPPNDSIEPEENEVSEKNE-EKIIEEVSNEAPAEAPV-----------------------------------------------------EDGTTTTVQFIA--------------------------VSADEHARMVTAGELPESVHGADVHM---QIPGAEAQQVRLIAVDSNGVPVTESAILQAAAEQAG-------IVFITKDDNGHDSQITIDQAMQLSM------RENPPMITATTAAVVLEQKEK-----------------EVHQDT-----DGNDTEDESETK-----------------KPSVRKPRLKLR----VYSQKDGGVDEVATSFLDDGSLALTDEVETLEEKPNDESVYEFQ-----------------------------DPP----TGGDTEQPSDVTMLDPTMFKKN--SKRATDKSRDRRKIYQCRECSFYSHRHSNLVRHMKIHTDERP--YKCHLCERSFRTNTLLRNHINTHTGVKPYKCTVDGCVMAFVTSGELTRHTRYIHTHEKPFRCTLCDYASVEISKLRRHFRSHTGERPYSCEECGKAFADSFHLKRHRMSHT---GEKPYECPECNQRFTQRGSVKMHIMQQHTKTAPKFKCEI--CRTLLGRKSDL-NVHMRKQHAFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ >CIOIN|ENSCINP00000031196 pep:known scaffold:KH:HT000097.1:44901:51453:1 gene:ENSCING00000006765 transcript:ENSCINT00000036493 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------------MADDGKDSVEVTSQIESVNGTVAPPNDSIEPEENEVSEKNE-EKIIEEVSNEAPAEAPV-----------------------------------------------------EDGTTTTVQFIA--------------------------VSADEHARMVTAGELPESVHGADVHM---QIPGAEAQQVRLIAVDSNGVPVTESAILQAAAEQAG-------IVFITKDDNGHDSQITIDQAMQLSM------RENPPMITATTAAVVLEQKEK-----------------EVHQDT-----DGNDTEDESETK-----------------KPSVRKPRLKLR----VYSQKDGGVDEVATSFLDDGSLALTDEVETLEEKPNDESVYEFQQRQRGTMEEWRDRECEIILKDILQTLKDEDPP----TGGDTEQPSDVTMLDPTMFKKN--SKRATDKSRDRRKIYQCRECSFYSHRHSNLVRHMKIHTDERP--YKCHLCERSFRTNTLLRNHINTHTGVKPYKCTVDGCVMAFVTSGELTRHTRYIHTHEKPFRCTLCDYASVEISKLRRHFRSHTGERPYSCEECGKAFADSFHLKRHRMSHT---GEKPYECPECNQRFTQRGSVKMHIMQQHTKTAPKFKCEI--CRTLLGRKSDL-NVHMRKQHAFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ >STRPU|gi|72028083|ref|XP_797592.1| PREDICTED: uncharacterized protein LOC593001 [Strongylocentrotus purpuratus] ---------------MDENTDQPGSQTEEPVAPADQGEAEGTDAASMNVLETYLQNFNEELSSGPATAAGAVQQAAASTVAATEEDDTESPLAVHVEEVADAEEDVQLEEGAEGDNVAVVKQEIGEEEEMEEEVGEASQPAPDQATIDITHTLQLLANASANISNPNALQENQEGEMGTENTFMQQLDTSNLVDENGAKVDPSRIAGIQTVNGEQVVMVHNMEDGTSGIGQQQVLMVAMQDNGQLSSMDQGVAQIAFSGAPVNMVGDNIVYQTTQ-------NTQYVPVSHNGTTQLAMTQSGGEPGTEVYTILQTVDGAETTTITTPTTMVVSSGGVHHLGDEQTHVIATTSGDHPSYAELQ---------------------------PVSEDAAQQEEGALQMAAAEHEEVALIVQKPVKRKRGRPRKDEAAKMQTQIVIVREVVEG-----------------EDGQDPSVYDFYAGED-------------------------DTAPV--AGGDEK----------SGIEVG--G-------AKKKTRYVVPKFDDGRLLDQVLNRAKKGGPGRRPKVHECHLCGRIFRTSTLLRNHENTHSGTKPYKCE--LCPKAFGTSGELGRHMKYMHTHEKPHKCPLCDYLSVEASKIKRHMRSHTGEKPYKCTLCEYASTDNYKLKRHMRVHT---GERPFNCSQCDQSFSQKSSLKEH-EWKHVGNRPSHKCDH--CDTTFGRYADM-KTHVRKMHTAGE-PMICKICENAFTDRFTYMQHVRGHRGEKIYKCGE-------CGYSAPQ-KRHLVIHMR------------VHTGERPYECEECHETF-KHKQTLINHQRSKHNLIQEADGTKKRKATDEITSPSKRITRRQRMQIQEEEETEEMVEPHTLTTADGNTIQVSMAQGGEGTVQLVQTSDGTMPVILTVGGDGQNVDEALQMMNGSLAAVQGHQDGEGQLMMAVPQGDGDNLHLEGQQASQQLQTSDD--------------------------IADDSQPPELQQEGQVQEVSAPRESSSITQEQAAALQQQMVSQGIISEGSVIAAMEEDEDGTGDGTIYLFVEEQ-------------------------------------------------------------------------------------------------------------------------