>HOMSA|gi|5729790|ref|NP_006556.1| transcriptional repressor CTCF isoform 1 [Homo sapiens] -----------------MEGDAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEEDACHLPQNQTDGGEVV-------------------------QDVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVAP---EAE------------------AAVDDTQIITLQVVN--------------------------MEEQ----PINIGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKEGLAESEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELP----------------------------PQEDPSWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPD------------------GVEGENGGE--------T----KKSKRGRKRKMRSKK-EDSSDS-ENA---E-------PDLDD--N------EDEEE--PAV-----EIEPEPEP--QPV-TPAPPPAKKRRGRPPGR-TNQPK------------QNQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEPAEGEEEEAQPAAT--------------------------DAPNGDLTPEMILSMMDR-------------------------------- >PELSI|ENSPSIP00000003796 pep:novel scaffold:PelSin_1.0:JH205822.1:16124:37080:-1 gene:ENSPSIG00000003593 transcript:ENSPSIT00000003816 gene_biotype:protein_coding transcript_biotype:protein_coding -----------MEQGDKMEGEAIEAIGEESETFIKGKERKTYQRRRE----------------------------GGQEEDACHMPPNQADGAEVV-------------------------QEVSGGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--EPV-APTPPPAKKRRGRPPGK-ANQPK------------QPQPAAIIQVEDQNTGAIENIIVEVKKEPD---------------AETV-GEEEEAQPAAV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >gi|171474905|gb|ACB47393.1| CCCTC-binding factor [Pogona vitticeps] -----------------MEGEVVEAVGEESETFIKGKERKTYQRRRE----------------------------GGQEEDACPMPPNQADGSEVV-------------------------QDVNAGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVQQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVSV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQSELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PTFVPAAFVCSKCGKTFTRRNTMARHADNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PELDN--N------EEEEE--TAI-----EIEAEPEV--EPV-APVPPPAKKRRGRPPGK-SNQPK------------QTQPTTIIQVEDQNTDAIENIIVEVKKEPE---------------AETV-GATAGTQPAAA--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >gi|327281289|ref|XP_003225381.1| PREDICTED: transcriptional repressor CTCF-like [Anolis carolinensis] -----------------MEGEVVEAIGEESETFIKGKERKTYQRRRE----------------------------GGQEEDVCSMPPNQADGTEVV-------------------------QDVNTGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVQQ---EAE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------DL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------PQEDPGWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGPD------------------GVEGENGGE--------P----KKGKRGRKRKMRSKK-ENSSDSEENA---E-------PELYDIEE------EDEEE--TAV-----EIEAEPEIEAEPV-APPPPPAKKRRGRPPGK-ANQPK------------QPQPTAIIQVEDESTGTIENIIVEVKKEPE---------------AETV-GVAAGAQPEAV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >MONDO|ENSMODP00000007129 pep:novel chromosome:BROADO5:1:685282646:685300408:1 gene:ENSMODG00000005757 transcript:ENSMODT00000007273 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVAKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-SSQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AETVEGEEEEPQSAVV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >SARHA|ENSSHAP00000004296 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004340 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >gi|396094|emb|CAA80319.1| CTCF protein [Gallus gallus] -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDEACHIAPNQADGGEVV-------------------------QDVNSGVQ-MVMMEHLDP-TLLQMKTEVME-----------------GAVPQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--SAE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETVE-EEEEAQPAVV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >MELGA|ENSMGAP00000001773 pep:novel chromosome:UMD2:13:1721989:1735383:-1 gene:ENSMGAG00000002200 transcript:ENSMGAT00000002439 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIVEESETFIKGKERKTYQRRRE----------------------------GGQEDEACHIAPNQADGGEVV-------------------------QDVNSGVQ-MVMMEQLDP-TLLQMKTEVME-----------------GAVPQ---ETE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-QGA-------YENEVSKGGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKNKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKKGLG-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHSATVGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCSGLD------------------GGEGENGGE--------T----KKGKRGRKRKMRSKK-EDSSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--EPE-APAPPPSKKRRGRPPGKAATQTK------------QSQPAAIIQVEDQNTGEIENIIVEVKKEPD---------------AETVE-EEEEAQPAVV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >SARHA|ENSSHAP00000004295 pep:novel scaffold:DEVIL7.0:GL834762.1:75014:90492:1 gene:ENSSHAG00000003783 transcript:ENSSHAT00000004339 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MEGEAVEAIMEESETFIKGKERKTYQRRRE----------------------------GGQDEDACHISQTQADGSEVV-------------------------QEVNSSVQ-MVMMEQLDP-TLLQMKTEVME-----------------GTVPQ---EAD------------------ATVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQVPVPVTV---PVATTSVE----------------------EL-HGA-------YENEVSKEGLPEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQGELQ----------------------------PQEDPNWQKDP-------DYQP---PAK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQGERNGFCIRI------------TVDTIKNFRVSVSSGAFDRQKQLLDMHFKRYHD-------PNFVPAAFVCSKCGKTFTRRNTMARHADNCTGLD------------------GIDGENGGE--------T----KKGKRGRKRKMRSKK-EESSDSEENA---E-------PDLDD--N------EDEEE--TAV-----EIEAEPEV--QPV-TPAPPPAKKRRGRPPGK-ASQPK------------QTQPTAIIQVEDQNTGAIENIIVEVKKEPD---------------AEAVEGEEEEPPSAVV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >gi|34785484|gb|AAH57697.1| Ctcf protein [Xenopus laevis] -----------------MEGEMAEDIVEDSETFMKRKETKTYQRRRE----------------------------GGVDEENCVIVQSQTDICEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GMVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENDVSKEVLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------PQEEPGWQKDP-------DYVP---PMK-KSKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADSCTGPD------------------GTDGENGEE--------GEVIHKKGKRGRKRKMRSKK-EGSTDSEDNA---E-------PELDD--DDEDEDDDEEEE--TPV-----EIEADPEP-EEPV-SPIPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVEDHNTRAIENIIVQVKKESD---------------LEAEVVVEAPVLTPAV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >XENTR|ENSXETP00000062683 pep:known scaffold:JGI_4.2:GL172782.1:1468935:1488905:1 gene:ENSXETG00000015615 transcript:ENSXETT00000060905 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MESEMAEAVVEDSETFMKRKETKTYQRRRE----------------------------GGVDEDNCVIVQSQTDISEVP-------------------------HDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------GVVSQ---EGD------------------PTVDDTQIITLQVVN--------------------------MEEQ----PINLGELQLVQ--VPVAV---PMATTSVG----------------------EL-HAA-------FENEVSKEGLQEGEPMICHTLPLPE------GFQVVKVGANGEVETLEQAELQ----------------------------QQEEPGWQKDP-------DYVP---PIK-KTKKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--DVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDTVFHERYALIQHQKSHKNEKRFKCDQ-------CEYACRQ-ERHMIMHKR------------THTGEKPYACSHCDKTF-RQKQLLDMHFKRYHD-------PSFVPAAFVCSKCGKTFTRRNTMSRHADNCTGPD------------------GTDGENGGE--------SEVVHKKGKRGRKRKMRSKK-EGSSDSDHNN---E-------PFFQD--N------AEPEQ--TPV-----EIEADPEP-EEPL-TPLPPPAKKRRGRPPGK-ANQAK--------------QNAAVIQVDDHSNRAIENIIVQVKKESD---------------LEAEGGVEAAVPTPAV--------------------------EAPNGDLTPEMILSMMDR-------------------------------- >LATCH|ENSLACP00000011174 pep:novel scaffold:LatCha1:JH126699.1:452823:489575:1 gene:ENSLACG00000009833 transcript:ENSLACT00000011258 gene_biotype:protein_coding transcript_biotype:protein_coding MQIYCTFLFLRERGREKMENEPSEVILEENETFSKGKERKTYQRRRE----------------------------GGQEEDNGAVIQNHPDGIEVVQDLQNQPDGTEEAQDLQNQSDGIEAQDVNSNVQ-MVMMEQLDP-TLLQMKTEVME-----------------AGVNQ---EGE------------------ATVDDTQIITLQVVN--------------------------MEEQ----PISLGELQLVQVPVPVSV---PVTATTVG----------------------QL-QGT-------YENDVSKEGLQ-GEPVICHTLPLPE------GFQVVKVGANGEVETLEQEELQPP-------------------------PPQEDPNWAKDP-------EFQP---PAK-K-KKTKKSKLR----Y-TEE-----------GKDVDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCP--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMIMHKR------------THTGEKPYACSHCEKTF-RQKQLLDMHFKRYHD-------PTFVPATFVCTKCGKTFTRRNTMARHAENCTGPD------------------SVEGENGGE--------P----KKSKRGRKKKMRSKR-DDSSGSDENA---E-------PELDD--I------DEEEE--EAVVINDEEMEGGPEA------LPAPPPAKKKRGRPPGK-SNQAK------------STQTAAIIQVEDQNAGTIENIIVEVKKEPD---------------TEEEEGEVEQAQPVVV--------------------------EAPNGDLTPEMILSMMDRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK >ORENI|ENSONIP00000007230 pep:novel scaffold:Orenil1.0:GL831150.1:2536970:2540553:1 gene:ENSONIG00000005736 transcript:ENSONIT00000007235 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------EGGEALTQ---------------------------GEVAGNME-MMVMDALDP-TLLQMKTEVLE-----------------GGGT----MTV------------------SGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVD----------------------GL-QAT-------YVETSAAN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQ----------------------------PQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG-----------DRDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PSFIPTAFVCDKCSKTFTRRNTMLRHADNCTGDA------------------TLE-ENGTP--------PP---KKGRRGRKRKMQSRR-DDDDDDTVNI---E-------GELDE--------AEEEEDMLTEI-----EVEQAPSV--VPIPAPVEPPVKRKRGRPPKSKPDSK-------------RIIAAAIIRVEDETTGEVDDIIV--KKEVG------------ADQDDD---GNEAAQEVVV--------------------------APPNGDLTPEMILSMMDR-------------------------------- >TAKRU|ENSTRUP00000045998 pep:novel scaffold:FUGU4:scaffold_14:1533474:1536803:-1 gene:ENSTRUG00000017943 transcript:ENSTRUT00000046152 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------EVTGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDTSTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAA----------------------------EDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG-----------DRDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDDDDT---E-------PDQED--------MDDEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPAGEF------------QDQPAAIIRVEDEVTGEVDDIIV--KKEVG------------ADQDDQEICNEEAVEQVVV--------------------------APPNGDLTPEMILSMMDR-------------------------------- >TETNI|ENSTNIP00000012180 pep:novel chromosome:TETRAODON8:5:5277710:5281213:1 gene:ENSTNIG00000009314 transcript:ENSTNIT00000012371 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------EVAGNME-MMVMDALDP-TLLQMKTEVLD-----------------GGGT----MTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVDASTTN--KDAEPVICHTLPLPE------GFQVVKVGANGEVETVEQEELQAAHEEL-----------------------QEDPDWSKDP-------DYQPIT-TVR-KGKKGKKSRLR----Y-GEG-----------DRDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIEMGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCSKTFTRRNTMLRHAENCMGDV------------------E-D-ENGTP--------TP---KKGRRGRKRKMQSRK-DDDDDDTGS----E-------PEPEE--------MDEEDEMLSEI-----EVEQAPPV--VPIPAPVEPPVKRKRGRPPKNKPA------------------TAAIIRVEDEATGEVDDIIV--KKEVG------------ADQDDQEICSEEAVERCSC------------------------LRRRRNGDLTPEMILSMMDR-------------------------------- >GASAC|ENSGACP00000020939 pep:novel group:BROADS1:groupII:11613372:11616719:-1 gene:ENSGACG00000015865 transcript:ENSGACT00000020979 gene_biotype:protein_coding transcript_biotype:protein_coding -------------------------------------------------------------------------------------------------------------------------GEVVDDMG-MVVMDALDP-TLLQMKTEVLE-----------------GGGT----VTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-TGAALGLGQLQLVQ------V---PVTRATVE----------------------GL-QAT-------YVDASTAN--KDADPVICHTLPLPE------GFQVVKVGANGEVETVEQ-------------------------EVEATVPLEEDPEWSKDP-------DYQPIS-SLRNKGKKGKKSRLR----Y-GEG-----------NRDMDVSVYDFEEEQQ--EGMLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYCSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFLEKGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCEQ-------CDYCCRQ-ERHMVMHKR------------THTGEKPFACSQCDKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCSKCNKTFTRRNTMLRHTENCSGEI------------------E-E-ENGTP--------AP---KKARRGRKRKMQTRR-DDDDTGSNAK---E-------DELDE--------VEEEEE-LSEL-----EVEQDPPV--VPIPAPVEPPVKRKRGRPPKNKPNIPKSDLK--------LLTAAAIIRVEDEVTGEVDDIIV--KKEVG------------VDRDDQEEATDGAVEE-----------------EAVAAPEV--SEAPPNGDLTPEMILSMMDR-------------------------------- >ORYLA|ENSORLP00000011017 pep:known chromosome:MEDAKA1:3:20410268:20415910:-1 gene:ENSORLG00000008771 transcript:ENSORLT00000011018 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------METGQATAL----------------------------------------------------ASDGKVLSEGGEALIQTG------------------------QGDEAGTME-MMVMDALDP-ALLQMKTEVLE-----------------GGGT----VTV------------------TGGDEGQIITLQVVN--------------------------MEEQ-AGAALGLGQLQLVQ------V---PVTTTTVE----------------------GL-QAT-------YVEASAAN--KDA--VICHTLPLPE------GFQVVKVGANGEVETVEQDELQAAQEDLQGQEGEEVEEDEEAAEIVTSV-PQDDPEWTKDP-------DYQPIT-AVR-KGKKGKKSRLR----Y-AEG-----------DRDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYSSVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKYHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKTHKNEKRFKCDQ-------CDYCCRQ-ERHMIMHKR------------THTGEKPFACEQCEKTF-RQKQLLDMHFKRYHD-------PTFVPTAFVCTKCSKTFTRRNTMLRHAEGCTGEA------------------SGD-ENGTP--------TP---KKGRRGRKRKMQARE-KKPDKVDSDT---E-------GELDE--------IEEEDDLLTEI-----EVEQAAPV--IPIPAPIEPPVKRKRGRPPKNKPEVCPCFSGIS------NLSVAAIIHVEDEVQ-EVEELV---KKEVG---------------AEQVNCTDETTEQVITGGGKPGAQSEELSQADAAAQEVQLSAAPSNGDLTPEMILSMMDR-------------------------------- >ORENI|ENSONIP00000005164 pep:novel scaffold:Orenil1.0:GL831206.1:2751803:2762795:1 gene:ENSONIG00000004097 transcript:ENSONIT00000005168 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------MDGRPTDGV---GVVDVPTKEFPSIQAVHSQDAMVADLLQQAAEAG-------------GHGEGMAAATQSQQQLME------------------------GVGVEGGTGVE-MMVMDSLDP-TLLQMKTEVIDAAVGGSSAAVGVVGGVPGSAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQQGN-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVVKVGANGEVETVEQEEEG-----AETQPDEEEDEEEEPVQ-----PPNDDPNWAKDP-------DYQPPSGVVK-KIKKGKKSRLR----Y-AEG-----------DKDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPATFVCPKCNKTFTRRNTMARHAENCSGEV------------------E-DAENGAT--------IP---KKGRRGRKRKMRSRRDDDDDSDEDHA---EQDDDEEEGEGEE--ESSLLQEEEDP---ESM-----ELDQAPAA--IPVPAPDEPPVKRKRGRPPKNAPK-PPTPSKSVRVATKTTASAAAIIQVEDESTGAVENIIV--KKEEGDASAATPLDQGVALTVEGVGLD-EGVETVEL----------------PVNEET--AAASANGDLTPEMILSMMDR-------------------------------- >ORYLA|ENSORLP00000022986 pep:novel ultracontig:MEDAKA1:ultracontig72:341588:352749:1 gene:ENSORLG00000018357 transcript:ENSORLT00000022987 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------MVAGQTQQQLMDAG---------------------VGVAVDGGAGVD-MMVMDSLDP-TLLQMKTEVMDAAVGASSPSAAVVGGVAGAAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGIGELQLVQ------V---PVSATTVE----------------------ALQQGT-------FVDASSIP--KDGDPVICHTLPLPE------GFQVLEDQKG-----------------FSRSVRAHQHSAGAPMQ-----PPNNDPSWAKDP-------DYQPPSGVVK-KVKKGKKSRLR----Y-AEG-----------DKDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPFGCSQCEKTF-RQKQLLDMHFKRYHD-------PNFVPTAFVCPKCSKTFTRRNTMARHAENCSGEV------------------D-DAENGAP--------TP---KKGRRGRKKKMRSRR-DEDDSDEDQL---E-PDDEDEAEEEE--EASLLLEEDEP---ESL-----ELDQAPAA--VPVPAPEEPPVKRKRGRPPKNAPK-VPAPSKPVRTPSK-TSSAAAVIQVEDESTGAV-DIIV--KKEEADGAAEAPLQGGVALAVEDAAMDAEGAETVEL----------------ADGEET---VAAANGDLTPEMILSMMDR-------------------------------- >GADMO|ENSGMOP00000006796 pep:novel genescaffold:gadMor1:GeneScaffold_4125:40745:45236:1 gene:ENSGMOG00000006386 transcript:ENSGMOT00000006994 gene_biotype:protein_coding transcript_biotype:protein_coding ----------------------------------------------------------------------------------MEAVHNQQQLLE----------------------------EGGGGVE-MMVMESLDP-ALLQMKTE----------------GGVAGGAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSASTVE----------------------ALQQGT-------FVDATAMP--KDGDPVICHTLPLPE------GFQVGKQ------------------------------------------VQNDDSAWSKDP-------DYQPPSAALK-KSKKGKKSRLR----Y-AEG-----------DKDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNIKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTSFVCPKCSKTFTRRNTMARHAENCNGEI------------------D-DAENGTP--------TP---KRGRRGRKRKMRSRR-DEEEDSEDHA---D--------------PDLLLQEEEEQ---DAM-----ELDQAPAT--VPVPAPEEPPVKRKRGRPPKNAPKPAPTPTKSPRVAAKAAATAAAIIQVEDESTGAVENIIV--KKED-------PRAPGAGLAVEAVGLEAEEVEAVEV----------------QGTEDEAGAAAAANGDLTPEMILSMMDR-------------------------------- >GASAC|ENSGACP00000003270 pep:novel group:BROADS1:groupXIX:1433175:1439803:1 gene:ENSGACG00000002504 transcript:ENSGACT00000003281 gene_biotype:protein_coding transcript_biotype:protein_coding -----------------------------------------------------------------------------------MVAAAQHQLMEAA---------------------VGVGVDGGASVE-MMVMDSLDPIPLLQMKTEVIDSAVGGSSATLGVVGGVAGAAHQ---ATV------------------TTVDQTQIITLQVVN--------------------------MEEQ---AALGLGELQLVQ------V---PVSATTVE----------------------ALQQGT-------FVDTTAMP--KDGDPVICHTLPLPE------GFQVNKQTHS-----------------QSTFGEERKEPPAALLH-----PQNDDASWAKDP-------DYQPPHGAFK-KPKKGKKSRLR----Y-GEG-----------DKDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDLGGVHLRKQHSFIETGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDM-------CDYCCRQ-ERHMVMHRR------------THTGEKPYACSQCEKTF-RQKQLLDMHFKRYHD-------PNFIPTAFVCPKCSKTFTRRNTMVRHSENCNGEV------------------E-DAENGAP--------AP---KKGRRGRKRKMRSRR-DEEDSEDDNA---E--------FGEE--ETSLLQEEEEEEEPESM-----ELDQAPAA--IPVPAPDEPPVKRKRGRPPKNAPKPPPTPSRSARVAAKAAASAAALVQLEDESTGAVENITV--KKEDSQAPEATPAEQGAAPA-------AEGAETVEL----------------PVNEDTASAAAAANGDLTPEMILSMMDR-------------------------------- >gi|126632718|emb|CAM56716.1| CCCTC-binding factor (zinc finger protein) [Danio rerio] -----------------MEGGPTEAVVEDAGDAFKAKECKTYQRRREDEEVGAELLQAAVIEQ--------------AQAEVEPVVEAQQQLVESV-------------------------VSVNSSVD-MMMMETLDP-ALLQMKTEVMEAAVGAPVA-------VAGAAHE---ATV------------------TTVDDTQIITLQVVN--------------------------MEEQ----QLGLGELQLVQVPVSA-V---PVTAATVE----------------------EL-QGT-------LVDATAMP--KDGEPVICHTLPLPE------GFQVVKVGANGEVETVEQDELQ-----PQDDQPPHQEEEEEMAE-----PQNEDPAWSKDP-------DYTP---PVK-KVKKTKKSKLR----YNTEG-----------DKDMDVSVYDFEEEQQ--EGLLS--EVNAEKVVGNMKPPKPTKIKKK--G-V-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCT--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHS---GEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPH--CDTVIARKSDL-GVHLRKQHSYIEQGRKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQ-------CDYACRQ-ERHMVMHKR------------THTGEKPYACSQCEKTF-RQKQLLDMHFRRYHD-------PNFVPTSFVCTKCGKTFTRRNTMARHAENCTGMD------------------SADGENGTP--------P----KRGRGGRKRKMRSRK-DDDDDDDSDEHGEP--------------DLDDIDEEDEDDLLDEDQMG--LLDQAPPS--VPIPAPAEPPIKRKRGRPPKNAPKVSPTKSITK------TTTAAAIIQVEDESTGAIENIIV--KKEPE--------------------------GTDAVVAAQPIIEEVEAVEADVETVQLTVPEAAPNGDLTPEMILSMMDR-------------------------------- >SARHA|ENSSHAP00000015681 pep:novel scaffold:DEVIL7.0:GL834666.1:1222728:1242523:1 gene:ENSSHAG00000013360 transcript:ENSSHAT00000015809 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------RMGTEASASPEQFTKIKGTDVIQEKA-KENDVDKVSKLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELA-PS------------------NEENEKHILTLQTVH--------------------------FATD-ETDHQEMSQLTVQP---AEGM---HVMVQQGE----------------------SGLQSL-------LVLQQDIN-----VQAELNEIPHQN------LHQCVAISIQEEVFSLHEMEVMEINVVEESVEVSSEEDKLTVNS-----PLDENTELIK----------------LCEEREFTDQKEEIF----T-FEKLREGEK-----EEIILLPANSEIEEHE--DVHSS--EQDIDEVSGTAK----NQAKSK--G-M-KRTFHCEICIFTSSRISSFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKRHIRSHTGERPFHCCLCSYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIHILQKHSENVPKHQCPH--CSTVIARKSDL-RVHLRNLHSYKATEMKCRYCPAVFHERYALIQHQKTHRNEKRFKCDD-------CNYACKQ-ERHMTVHKR------------THTGEKPFTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHSEHCEAVK------------------GKSI-------------PS---AKGRKNKKKKQKDPK-QDAKEEGRQT---------------RNFRSDKVVEQMPIEDTSIVNIEHHPNEIVPVVYGMAA-DV-EE-----------------------------------------------------------------------------------------------------------------------PKTEVTCEMILNMMDK-------------------------------- >MONDO|ENSMODP00000020611 pep:novel chromosome:BROADO5:1:486248077:486266900:1 gene:ENSMODG00000016490 transcript:ENSMODT00000020976 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------RMGTEASASPEHFTKIKGTDLIQEKA-KESDVDKVSRLKERQ-SSCGLEVDC-SYGVLQAKIVE-------------------GELELT-PQ------------------TQENEKHILTLQTVH--------------------------FATD-EMDHQEM---TVEP---AEGM---HVMVQQGE----------------------SGLQSL-------L----------------LNEIPHQN------LHHCVAISIQEEVFSLHELEVMEINVVEESVEISSEEDKLTVNP-----PLDENTESVKVE-------KNYEVPQLCEEREITDQKEGLF----T-FDKLREGEK-----EEIILLPANSEIEEHE--DIPSS--EQDTDEVSGTAK----NQAKTK--D-V-KQTFHCEICIFTSSKISCFNRHMKTHSDEKP--HMCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCDMAFVTSGELVRHRRYKHTHEKPFKCTMCKYASVEASKLKRHIRSHTGERPFHCCLCNYASKDTYKLKRHMRTHS---GEKPYECYVCHARFTQSGTMKIHILQKHSENVPKHQCPH--CATVIARKSDL-RVHLRNLHSYKAAEMKCRYCTDVFHERYALIQHQKTHRNEKRFKCDD-------CSYACKQ-ERHMRVHKR------------THTGEKPFTCLSCNKCF-RQKQLLNVHFKKYHD-------KNFIPTVYECPKCGKGFSRWNNMRKHSELCEVIR------------------GKAV-------------QS---AKGRKTKKKKQKGPK-QDVKEEGK-------------------------FEQMPIEDISNVNIERHTNEIVPVGYGIAT-DVAEE-----------------------------------------------------------------------------------------------------------------------QKTEVTCEMILNMMDK-------------------------------- >HOMSA|gi|29570785|ref|NP_542185.2| transcriptional repressor CTCFL [Homo sapiens] ---------------------------------------------------------------------------------------MAATEISVLSEQFTKIKELELMPEKGLKEEEKDGVCREKDHR-SPSELEAER-TSGAFQDSVLE-------------------EEVELVLAP------------------SEESEKYILTLQTVH--------------------------FTSE-AVELQDMSLLSIQQ---QEGV---QVVVQQPG----------------------PGL-------------------------LWLEEGPRQS------LQQCVAISIQQELYSPQEMEVLQFHALEENVMVASEDSKLAVS-------LAETTGLIKLE-------EEQEKNQLLAER----TKEQLF----F-VETMSGDERS----DEIVLTVSNSNVEEQE--DQPTA--GQADAEKAKSTK----NQRKTK--G-A-KGTFHCDVCMFTSSRMSSFNRHMKTHTSEKP--HLCHLCLKTFRTVTLLRNHVNTHTGTRPYKCN--DCNMAFVTSGELVRHRRYKHTHEKPFKCSMCKYASVEASKLKRHVRSHTGERPFQCCQCSYASRDTYKLKRHMRTHS---GEKPYECHICHTRFTQSGTMKIHILQKHGENVPKYQCPH--CATIIARKSDL-RVHMRNLHAYSAAELKCRYCSAVFHERYALIQHQKTHKNEKRFKCKH-------CSYACKQ-ERHMTAHIR------------THTGEKPFTCLSCNKCF-RQKQLLNAHFRKYHD-------ANFIPTVYKCSKCGKGFSRWINLHRHSEKCGSGE------------------AKSA-------------AS---GKGRRTRKRKQTILK-EATKGQKEAA---------------KGWKEAANGDEAAAEEASTTKGEQFPGEMFPVACRETTARVKEE-----------------------------------------------------------------------------------------------------------------------VDEGVTCEMLLNTMDK-------------------------------- >PELSI|ENSPSIP00000012493 pep:novel scaffold:PelSin_1.0:JH209331.1:1517019:1535688:1 gene:ENSPSIG00000011195 transcript:ENSPSIT00000012554 gene_biotype:protein_coding transcript_biotype:protein_coding ---------------------------------------------------------------------------------------MMAAQDSHLPEPFTKIKGAERIWDRAREDDGGDRLPWVKERN-SICDPDVEV-LNGAPPAKALE-----------------GGRNLELS-PS------------------LIQSEKHLIMLQTVR--------------------------LKEG-EEDLQAVSQLNIQQ---QSGL---HMVVQRGA----------------------SVLQPL-------VVVQQGV-------------GAQQN------IPTGVAISLQDGVYTFHDMEVMQINVLQEKVQAKDEENKS----------MDKSPGMLLIK---------KLVPKNLKNSVKIDRTKDLH----A-VEEILSCTA-----KDDISVSLNEPKEQGE----QSV--VKKTDTLEAHTN----TQHRKK--G-E-KVTIHCDLCAFTSLRMSSLNRHMKTHSDEKP--HLCHLCLKAFRTVTLLRNHVNTHTGTRPYKCS--DCEMAFVTSGELARHRRYKHTLEKPFKCSVCKYSSVEASKLKRHIRSHTGERPYNCCLCSYASKDTYKLKRHMVTHS---GEKPYECYVCQARFTQSGTMKIHILQKHSENVPRYQCPH--CNAFIARKSDL-GVHLRNLHSYLAVAMKCSYCEAVFHERYALIQHKKTHRNEKRFKCDR-------CSYACKQ-ERHLIVHKR------------THTGEKPFTCVSCSKCF-RQKQLLTVHFRKHHD-------SNFKPTVYECPKCGKGYSRWNNMHKHAENCGLAR------------------AKVV-------------TR---HKGSKGKKKRWNSLK-QDVKQEGCSE-----------VSCGGLWESAGTVDLGSFQDVSVVNTECCASEIVPVEYGIET-STPRE-----------------------------------------------------------------------------------------------------------------------QKTEMTCEMILNMMDK-------------------------------- >SARHA|ENSSHAP00000001830 pep:novel scaffold:DEVIL7.0:GL867782.1:14413:31463:1 gene:ENSSHAG00000001630 transcript:ENSSHAT00000001851 gene_biotype:protein_coding transcript_biotype:protein_coding ------------------------------------------------------------------------------------------------------------------------------------------------------------------------EEGELDWH-PS------------------NEENERHLLALQMLP--------------------------FTTD-ETG-QEMSQLAVQP---AEGM---HIMEPQGE----------------------SGLQSL-------LGWQQYIN-----VQPELNEMPHQD------LSRCVELNIPEEIFSLEEMEMIENYIRDESADFFNED-KWIFDF-----PFDE--ELLKDQDKGIYG-AEWEAQELCEEREYTDPKEEIV----N-FETPRDGEK-----EEIILLLANSEIEEHE--DIHSP--PQDLDEVSQTAA----NQAETK--G-T-KWTFHCEICKFTSSKMSSLTRHMKTHTAEKP--HMCHLCPKAFRTGTLLRNHLNTHTGTRPYKCS--DCEMAFVTSGELGRHRRYKHTHEKRFKCSMCNYASVEASKLKRHIRSHTGERPFPCSFCNYASKDIFKLKRHMTSHS---GEKPYECSFCSARFSQSGTLKIHVLQKHSGNAPKHQCPH--CATLITRKSDL-RVHLRNLHSYSAAEMKCRYCTAAFHERYALIQHQKTHRDEKRFKCDV-------CSYACKQ-AQHMTIHKR------------IHTGEKPFTCLSCNKSF-RQKQLLKVHFKKYHD-------ETSVPPVHECPKCGKGFSRLNNMRKHSEHCEVVR------------------GKAV-------------PS---A-----KEKDQKGPE-QGAREEVLIG-----------FQTAQGLRNYKVIEEMPIEDISIVNIENATVEMVPVVYGTTS--DVQE-----------------------------------------------------------------------------------------------------------------------PQTEITFEMILNMIEK-------------------------------- >PETMA|ENSPMAP00000004689 pep:putative scaffold:Pmarinus_7.0:GL483954:4616:13063:1 gene:ENSPMAG00000004257 transcript:ENSPMAT00000004708 gene_biotype:protein_coding transcript_biotype:protein_coding --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EDTHIITLHPVS--------------------------LEETGEGGGTSIGEITLVQVQADLTV---IIHNVKL-----------------------QLNSVL-------KIDVRRTL--LQGVP-ITHTLPLPE------GVQVVKVGPNGELE-VERAPMG----------------------------SQDERSKEKDP-------DYQL---PVKKPVRKGRKNKLR----Y-KQE-----------AADADISVYDFEEQEE--GRLVSSQDVGVEKAIAP-KPPKPTRIKKK--G-A-KKTFQCELCSYTCPRRSNLDRHMKSHTDERP--HCCHLCDRAFRTVTLLRNHVNTHTGTKPHKCM--ECDMAFVTSGELVRHRRYRHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCGLCSYASRDTYKLKRHMRTHS---GEKPYECHVCHARFTQSGTMKMHVLQKHTDNVPKYHCPH--CDAVIARKSDL-GVHLRKQHAVLERELRCRYCRAIFHERYALMQHQRTHRNEKRFKCDQ-------CEYACKQ-ERHMIMHKR------------VHTGEKPFECTLCDKTF-RQKQLLDFHFKRYHD-------PSFVPTTYECSKCHRNFTRRSTMMKHFDMCDGEL------------------ESGEQNGK---------AR---RGRRRGRKRKMQSRK-HGSSSESDEMPTDE---------------------DEEEEELNEE-----SVEVADEE----VEEPEPPPMKRRRGRPPKAKPGRPA------KKVAGSDSVACGIIEIIPVTVGGPDGPDDDEEEE--------------EEEEEAAEGEEGAVETAADEG--------------------------PKNDITPEMILSMMDQ-------------------------------- >OIKDI|GSOIDT00001753001 ---------------------------------------------------------------------------------------------------------------------------MSQQYEEEPIMEE----------------------------------------------------------------EDGNTTTVQFIA--------------------------VAPEEHERMVREGELPETIQGADVQM---TIGGTESRPVRLIAVDSNGIPVTDPTILQAAAEQAG-------IMFVSKDENDHENQISIDEAMQLRN---------------EENQQVPGYQDQ----------------------------PPADPYDYQAEP-----------------EYHDSEKGVPLR----NYTSQDVVYEEAPENGGIKQQENIYDYQDVPIQNEPKIDNHYKKSQSNTQRTKYPGTIQVGAN--G-EKRKVYQCSECAFYSHRHSNLIRHMKIHTDERP--YKCHLCARAFRTNTLLRNHINTHLGVKPYKCPEANCEMAFVTSGELTRHRRYKHTHEKPFKCTLCEYASVEISKLRRHFRSHTGERPFSCDICGKAFADSFHLKRHKFSHT---GEKPYECPHCKARFTQHGSLKMHVMQQHTKTAPKFECEREICHTMLGRKSDL-NVHLRKQHSYQEIPMQCRYCEEVFHDRWSLMQHQKTHRS-GRYRIDEDGNQIFDPDYDSEM-EDMEGGHGSGNLVYTEDGHVIGQDGSPNVIVQRVQHIDGQQGVPIEDHQQMQHDEHNMHAVEHQMDHSQEAPPIHQGHHRQAHAEAKSERFEQEQQHDPFAFDDDQNMGNGEYMHAPQMGQTMAQPVE------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- >STRPU|gi|72028083|ref|XP_797592.1| PREDICTED: uncharacterized protein LOC593001 [Strongylocentrotus purpuratus] ---------------MDENTDQPGSQTEEPVAPADQGEAEGTDAASMNVLETYLQNFNEELSSGPATAAGAVQQAAASTVAATEEDDTESPLAVHVEEVADAEEDVQLEEGAEGDNVAVVKQEIGEEEEMEEEVGEASQPAPDQATIDITHTLQLLANASANISNPNALQENQEGEMGTENTFMQQLDTSNLVDENGAKVDPSRIAGIQTVNGEQVVMVHNMEDGTSGIGQQQVLMVAMQDNGQLSSMDQGVAQIAFSGAPVNMVGDNIVYQTTQ-------NTQYVPVSHNGTTQLAMTQSGGEPGTEVYTILQTVDGAETTTITTPTTMVVSSGGVHHLGDEQTHVIATTSGDHPSYAELQ---------------------------PVSEDAAQQEEGALQMAAAEHEEVALIVQKPVKRKRGRPRKDEAAKMQTQIVIVREVVEG-EDGQDPSVYDFYAGED--DTAPV--AGGDEK----------SGIEVG--GAKKKTRYVVPKFDDGRLLDQVLNRAKKGGPGRRPKVHECHLCGRIFRTSTLLRNHENTHSGTKPYKCE--LCPKAFGTSGELGRHMKYMHTHEKPHKCPLCDYLSVEASKIKRHMRSHTGEKPYKCTLCEYASTDNYKLKRHMRVHT---GERPFNCSQCDQSFSQKSSLKEH-EWKHVGNRPSHKCDH--CDTTFGRYADM-KTHVRKMHTAGE-PMICKICENAFTDRFTYMQHVRGHRGEKIYKCGE-------CGYSAPQ-KRHLVIHMR------------VHTGERPYECEECHETF-KHKQTLINHQRSKHNLIQEADGTKKRKATDEITSPSKRITRRQRMQIQEEEETEEMVEPHTLTTADGNTIQVSMAQGGEGTVQLVQTSDGTMPVILTVGGDGQNVDEALQMMNGSLAAVQGHQDGEGQLMMAVPQGDGDNLHLEGQQASQQLQTSDD--------------------------IADDSQPPELQQEGQVQEVSAPRESSSITQEQAAALQQQMVSQGIISEGSVIAAMEEDEDGTGDGTIYLFVEEQ-------------------------------------------------------------------------------------------